Gene Sros_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1551 
Symbol 
ID8664827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1646453 
End bp1648168 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337287 
Protein GI271963091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATGCGA GACCCCGTCT CCGTCACCCC TCGACCATCC TCATCCCCGC ACTGGCGTCG 
GCACTGATCG CGTCGGTGCT CGTCCTGGCG GCACCGCCCG GCTTCGCCGC CGACTCCCTC
CTGTCCCAGG GCCGTCCGGC CACCGCCTCC TCCAGCGAGG ACTCCACGCT GGTCCCGGCC
AAGGCCTTCG ACGGAAGCGG CTCGACCCGC TGGGCCTCCG TGGAGGGCCA CGACCCCGAG
TGGCTCCGCG TCGACCTCGG CCAGTCCGCC ACGATCTCCC GGGTCAAGCT GACCTGGGAG
GCGGCGTACG GCAAGGCGTA CCGGATCCAG ACCTCGGCCG ACGGCTCGGC CTGGACCGAC
GTCTACTCGA CCACCGCCGG CGACGGCGCC GTGGACGACC TGACCCTGTC CGGCACCGGC
CGCTACGTGC GGCTGTACGG CACGGCCCGC GGCACCGCCT ACGGCTACTC GCTGTACGAG
ATGGAGGTCT ACGGCAGCAC CGGCGGGAAC CCCAGCCCCA CCCCGACGGT CACCCCCACC
CCCACCCCGA CGCCGACCGC GGGCGGACCG GCGGTGCCCT TCGGCGGGCA CACGATCCCC
TACGCCGCCG GGATGCTGCG TCCCGGCGGC GGCCAGGCCG CGCTCGACCA GAAGGTGGTC
GACTACTACA AGCGCTGGAA GGCCGCCTTC GTCAAGCAGA ACTGCGGCAA CGGCTGGTAC
CAGATCATCT CCCCCGACGC CGACCACCCG TACGTGGCCG AGGCGCAGGG CTACGGCATG
GTCATCGCCG CCACCATGGC GGGGGCCGAC CCCGACGCCA AGAAGATCTT CGACGGCCTG
CTGAAGTACG TCCTGGCCCA TCCCTCGTCG ATCACCCCCG GCCTGCTCGC CGCCGAGCAG
GACACCTCCT GCAAGAGCGT CAACGGCGGG GACTCCGCCA CCGACGGCGA CCTGGACGTC
GCCTACGGCC TGCTCCTGGC CGACCGGCAG TGGGGCAGCG CGGGCGCCTA CGACTACAAG
CAGCTGGCGA TCAAGACCAT CAACGCCATC AAGTCCGGCG AGGTCAACCC GACCACCAAG
CTGATGAAGC TCGGCGACTG GACGAGCTCC GGCGACCAGT ACTACTGGAT CAGCCGCTCC
TCGGACTGGA TGATCGACCA CTTCCGGGCG TTCCGTAAGG CCACCGGCGA CGCGACCTGG
GACACCGTCC GCACCAACCA CCAGAATCTC ATCACCTCCC AGCAGGCGAC CTACGCCGCG
AGCACCGGCC TGCTGGCCGA CTTCGTGGTC AACACCAACA CCACCCCGAA GCCCGCCTCT
GGCCAGGTCC TGGAGGACCC CAACGACGGC AAGTACTGGT GGAACGCCTG CCGTGACCCG
TGGCGGATCG GCGCCGACGC GGTGACCAGC GGCGACGCCA AGTCGCTCGC CGCCGCACGC
AAGCTGAACA CCTGGATCAA GGGCAAGACC GGCGGCGACC CGAGCAAGAT CGCCGTCGGC
TACTCCCTCA GCGGCACGCA GATCTCCAGC GGCAGCGAGC CGGCCTACTT CGCCCCGTTC
GCGGTGGCGG CGATGACCGA CTCCGGCAGC CAGGCCTGGC TGGACGCCCT CTGGAACAAG
ATGCTGAACA CCTCGTTCAC GCCCACCGAC TACTTCTCCA CCAGCATCCA GCTCCAGGTC
ATGATCACGG TGACCGGCAA CCACTGGGTG CCCTGA
 
Protein sequence
MHARPRLRHP STILIPALAS ALIASVLVLA APPGFAADSL LSQGRPATAS SSEDSTLVPA 
KAFDGSGSTR WASVEGHDPE WLRVDLGQSA TISRVKLTWE AAYGKAYRIQ TSADGSAWTD
VYSTTAGDGA VDDLTLSGTG RYVRLYGTAR GTAYGYSLYE MEVYGSTGGN PSPTPTVTPT
PTPTPTAGGP AVPFGGHTIP YAAGMLRPGG GQAALDQKVV DYYKRWKAAF VKQNCGNGWY
QIISPDADHP YVAEAQGYGM VIAATMAGAD PDAKKIFDGL LKYVLAHPSS ITPGLLAAEQ
DTSCKSVNGG DSATDGDLDV AYGLLLADRQ WGSAGAYDYK QLAIKTINAI KSGEVNPTTK
LMKLGDWTSS GDQYYWISRS SDWMIDHFRA FRKATGDATW DTVRTNHQNL ITSQQATYAA
STGLLADFVV NTNTTPKPAS GQVLEDPNDG KYWWNACRDP WRIGADAVTS GDAKSLAAAR
KLNTWIKGKT GGDPSKIAVG YSLSGTQISS GSEPAYFAPF AVAAMTDSGS QAWLDALWNK
MLNTSFTPTD YFSTSIQLQV MITVTGNHWV P