Gene Sros_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1857 
Symbol 
ID8665135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1971193 
End bp1972659 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content68% 
IMG OID 
ProductSite-specific recombinase DNA invertase Pin 
Protein accessionYP_003337588 
Protein GI271963392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.830217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0036727 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCCCC AGGACGCTCC CACGTACGCC ATCGTCTACG TGCGCATTTC CCGAGATAAG 
GAAGGTGCCG GCCTCGGCAT CGACCGGCAG CGCGAGGACT GCGAAGCCCT CGCCGAGCGG
CTCGGCTTTC GCGTCATCGC CGTCTACGAC GACAACGACA TCAGCGCGTA CTCCGGCAAG
GCTCGCCCCG GCTACCGGCA ACTCCTCGCC GACCTCGAAG CCGGCCGCGC AACCGCCGTC
CTGGCCTGGC ACACTGACCG CCTTCACCGC TCCCCCGTCG AGCTGGAGGA GTACATCACC
GTGTGCGAGC CGCGCGGGAT CGTCACGCAC ACGGTGAAAG CTGGCCCGAT CGACCTCGCC
ACCCCCTCCG GTCGGATGAT CGCCCGGCAG CTCGGCGCCG TCGCACGCTT CGAGTCCGAG
CACAAGTCCG ACCGGGCCCG GCGCAAGCGC GACCAGATGG CCCAGGCCGG ACAGTGGAAG
GGCGGGCGGC GCCCCTTCGG CTACGAGGAC AACGGCGTCA CAGTCCGACC AGACGAAGCC
GCCGAGGTGA AGCAGGCGAG CGAGAGTCTC CTGTCCGGGA TGAGCATGCA CGCGATCGTG
CGGGACTGGA ACGCGCGAGG AGTGAAGACG ACGACCGGGG CCACCTGGTC AACGAGGAGC
GTGCGCAACG TCCTGACCCG GCCACGTAAC GCCGGGATCA TGGAGCACCG CGGCGAGGAG
CTCGGCCCCG CCGAGTGGCC AGCCATCGTC GACGAGATGA CATGGCGATC CCTGTGCGCC
CTGCTGGCCG ACCCTAATCG GCGCACAAGC CCCGGTTCCG AACGGCGATG GTTGCTGTCA
GGCATTGCCG AGTGCGGCAT CTGCGAGGTT GGAGGAGTGC TCACCTATCT GAGGGCGGGG
ACCGCAGGGA AGAACACGGG TGAAGATCGG AAGACCGCCC CGGCCTATCG GTGCCGCCTC
GACGATGCAG CCAAGCACGT GGTACGAAAC GCGATCCACC TGGACAAGTA CGTATCCGCC
GTTGCCGTGG AAAGACTTTC GCAGCCAGGT GCCGCCGCGG AGCTCACCAT ACGTAACGAC
GGCGGCGCCG CCCGGGAGCG GGCGATCGAG CGGGAGGCAC TCCGAATCCA TGAGCAGGAG
GCCGGCGAGA TGTTCGCCGC CCGGGAGATG ACGCGCGCCC AGCTCGCCAC GACAAACCGG
GAGATCGCCG CCCGGCGCGC GGAACTCGAC TTGGCCGACG CTGCCGCAGC CCGAGTGACA
GCTCTGGCCC CGTTCGCCAG CGGGGCCGAC GCATCGCAAG TCTGGGAGGA CTTGACGTTG
GATCAGCGTC GCGCGGTTAT CTCCCAGGTC ATGCGCGTGG TTGTGCTTCC GGCAGGGAAA
GGGCGGCCGA AGGGCTGGAC TCCGGAGTAC GGCAAGGAGT GGGGCTACTT TGATCCCGAG
GCCGTCCGAA TTGGCTGGAA GGGATAA
 
Protein sequence
MAPQDAPTYA IVYVRISRDK EGAGLGIDRQ REDCEALAER LGFRVIAVYD DNDISAYSGK 
ARPGYRQLLA DLEAGRATAV LAWHTDRLHR SPVELEEYIT VCEPRGIVTH TVKAGPIDLA
TPSGRMIARQ LGAVARFESE HKSDRARRKR DQMAQAGQWK GGRRPFGYED NGVTVRPDEA
AEVKQASESL LSGMSMHAIV RDWNARGVKT TTGATWSTRS VRNVLTRPRN AGIMEHRGEE
LGPAEWPAIV DEMTWRSLCA LLADPNRRTS PGSERRWLLS GIAECGICEV GGVLTYLRAG
TAGKNTGEDR KTAPAYRCRL DDAAKHVVRN AIHLDKYVSA VAVERLSQPG AAAELTIRND
GGAARERAIE REALRIHEQE AGEMFAAREM TRAQLATTNR EIAARRAELD LADAAAARVT
ALAPFASGAD ASQVWEDLTL DQRRAVISQV MRVVVLPAGK GRPKGWTPEY GKEWGYFDPE
AVRIGWKG