Gene Sros_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1737 
Symbol 
ID8665014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1851226 
End bp1853985 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337471 
Protein GI271963275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.254418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCG ATGGGATCTT GGTTCCCGAC TGGGCCAAGC CGTACGTCGG CTGGGTGGTC 
GGGATGGACT GGCCCGAAGG CGATGAGGAA GGGTGTTTCC GGCTCGCCGA CGCCTGCGTG
ACCGCCGCTC ATCGGGTCGT TGAGGGGACC GGCGCCGACC AACTCCACAG CGCCCAGAAG
ATCGGCGCGG CCTGGGACGG CGAGGCCCAT ACCGCCTTCG CCGCGCACGT CACCAGGGTG
GTGGGCGGGC GGGTCGCCGA GCTGGTCACC CGGCTGGTCA ATGCCGCCGT CGCGCTCAAC
AACGTCGGCG TGCAGATCCA GTACGCCAAG TACATGATCG AGGCCACGGT ATGGCTGCTG
ATCCTGCAGC TGGGTTACCT GCTGGCCGCC GCCGTGACCA GTGGGGGCGC GAGCCTGGCG
CTCATCCCGG CCCGGTTGCA GCTCGCCCGT CTGACGGTCG CGCAGATCGC CGAGCGGGCC
CTGGTCAACA TGCGGCTGTT CGCCGTCATC GTCGGCGGCA TGGACGCCGG CGTCCAGTCC
CTGCAGATCG CCCAGGGCCG CCGCGACGAC TTCGACCGGT GGCAGCTGCT GATGTCCACG
CTGGCCGGCG GTGCGATGGG CGCCATGATG GGGGGACTGT CGGGTGGGCT GAGCCAGCTG
GCCACTCCGG CCCTGCAGGC GGGTCTTTCC CGAGCCGAGA TGAGCCTGGC GGAGAAGCTG
CTGGCCGCCG CCACCAACTC CTTCTACGGC CAGGCCGCCC AGTACGCGCT GACCGGCGGC
ATCACCACCG CCGGCAGCAT GCTCGTCAAT GGCGACTTCA GCTGGGACCT GCTCGCCAAG
GGCATCACCT CCAGCGCGCT GGGCGCCGAC GGCCAGCACC TGGCCGCACC CATCACCCAC
GGTGGCGGCG CGCCCCCGCC CCATCCCACC CCCCTCCTGC CCGGCTCCGA TCCCCACCCC
GGCCCGGGGT CCCCGTCCGG TTCCGGCCGT GACGTCTCGC TCGGCTCGGA GTCCCCGTCC
GGTTCCGGCC GTGACGCCTC GCTCAGCCAG GCCGACACCC CCACCCCCGA GGCGGCCCGG
CCTCCCTCCA CCACGCACGG TGGACCGGAC CCCTCTTCCC CCGCCGGCCA GAACGGTTTC
CCCGTGGGAG GCCGCGCGCA GGCCGCACCG GAGGGCCATG CCGTACCGAA CGGCAGCGCG
CGGCCGGACG GGAGCCAGGC ACCGCGGGCG GCGCAACCCC CGCCGGCACA CGCGAGAACG
GACGGCGGCG CGGCCGGAGG GGGGCGCCCC GCCGCCGGGA GGACGGACAG CGCGCCGGGC
CCGGGAGGGC GGCCGGACGG CGCGGCGCCG CTCTCGCGCA TCGACCAGTT GATCAACCGG
CCGGCCGAGC AGCCGAGCGC GCCGGCGCCA CAACGCTTTC CGGAGCGATC GGCGCACGCC
GAGGAGGGGC CGCAGACCTG GAACGACGGC GGTTGGGAGG GTTCCCGTGA CGTTCCGGAC
ACTCCGCGCG GGGGAGAGCC GGACACGGTG GCCGCGTCGG TCCCCGACCG CGCCCCCGTA
CACCCGGGTG AGGGGACCGC GGCCTCGAAT CGGGGGCCGG TCAGGGATGT CGAGAACGTG
GACCTCCATC CGGGCTCCGC GGCGACGTGG CCCGACGTGG GTCCGGCGCC GATGAGGACC
GCCGAGGGGG TCAGGCCCCT CGACGCCGCC AGGGACGCCC TTGCCGGCGG CGACGTCGCG
GGTGCCCGGT CGGCGGCGGA CGCGGCCAGG GCGGCCGCCG ACGAGGTGCG ACGATCGGGA
ACGGCTCCCA GGGACGAGGT CGCACGGGCC GAACGGGACG CCCTCCAGGC CGAGCGGATC
GCCGAGGTCG TCGCGGAGCT GGACGGTATC GGCGGTCTGC GCGCGGAGAA CCTGGCCAAG
ATCAGAGAAT TCGGCCTCAA CATCGAGATA CGGACCTTCG ATGAGCAGGT CAGGGGGCAG
GAGAACTTCA TCCCGGTCAC CTATGACGCG GACACGAACA CGCTGACGGT CCAGCAGCAC
ATGGCATGGC GGTCGCCGGC CGAGGAGCTC GCCGACTTCG TGGGCGGGCG CTTCCAGCCG
TTCGAGGACA TGATCAGCCA ACGCGAGTTC GTGGCGCAGG ATGTCAGCGA CTGGCCCGGG
CTGCCCGCGT GGAGCGAGGG GGCTGTGCGC TTCCGCACCC AGGCGGAGTA CGACGCCTGG
GTGGACGGAG CCATGGTGCC GCAGTCCGAG CGGTTCACCG ACGCCCAGAA GGACTCGCTG
GACGCCTACC GCAGGGAGCC GACCTACAGA GAGATCAACG ATCCGCTCCG CGGCCACGGC
CACCACTCGC CCGCCGCGGC CGAGCACACC GCTCACATCG ACTCGGCGAT GCACCAGTCC
GTCATCCCCG AGGACGTCGT CGTCGCCCGG CACGTCTCTC CGGCCGCTTT CGACCGGCCG
ATCGCGCAGT TGGAAGGGAC GGTCCAGGGG GATCTCGCCT ATGTGTCCAC CTCGACCGCC
AAGAACCCCG AACGCTACAT GCACCTGGCC CTCGAGCTGG AGTCGCTGGT CAAGCTCTGG
CTCCGCGTCC CCGAGGGAAC GCACGCGGTT CACATGACCG GTCTCCATCC GAACACCGAG
ATGTTCGGCC CGACCCACGA GCTGCTGCTC GACCGCGGTG TCCGCTACCG GGTCGACAAG
GTGGTGTACG AGGACGGTGG CTGGAAGGTG TTCGGTACGG TGCTGGGCAA GGAGGGCTGA
 
Protein sequence
MGFDGILVPD WAKPYVGWVV GMDWPEGDEE GCFRLADACV TAAHRVVEGT GADQLHSAQK 
IGAAWDGEAH TAFAAHVTRV VGGRVAELVT RLVNAAVALN NVGVQIQYAK YMIEATVWLL
ILQLGYLLAA AVTSGGASLA LIPARLQLAR LTVAQIAERA LVNMRLFAVI VGGMDAGVQS
LQIAQGRRDD FDRWQLLMST LAGGAMGAMM GGLSGGLSQL ATPALQAGLS RAEMSLAEKL
LAAATNSFYG QAAQYALTGG ITTAGSMLVN GDFSWDLLAK GITSSALGAD GQHLAAPITH
GGGAPPPHPT PLLPGSDPHP GPGSPSGSGR DVSLGSESPS GSGRDASLSQ ADTPTPEAAR
PPSTTHGGPD PSSPAGQNGF PVGGRAQAAP EGHAVPNGSA RPDGSQAPRA AQPPPAHART
DGGAAGGGRP AAGRTDSAPG PGGRPDGAAP LSRIDQLINR PAEQPSAPAP QRFPERSAHA
EEGPQTWNDG GWEGSRDVPD TPRGGEPDTV AASVPDRAPV HPGEGTAASN RGPVRDVENV
DLHPGSAATW PDVGPAPMRT AEGVRPLDAA RDALAGGDVA GARSAADAAR AAADEVRRSG
TAPRDEVARA ERDALQAERI AEVVAELDGI GGLRAENLAK IREFGLNIEI RTFDEQVRGQ
ENFIPVTYDA DTNTLTVQQH MAWRSPAEEL ADFVGGRFQP FEDMISQREF VAQDVSDWPG
LPAWSEGAVR FRTQAEYDAW VDGAMVPQSE RFTDAQKDSL DAYRREPTYR EINDPLRGHG
HHSPAAAEHT AHIDSAMHQS VIPEDVVVAR HVSPAAFDRP IAQLEGTVQG DLAYVSTSTA
KNPERYMHLA LELESLVKLW LRVPEGTHAV HMTGLHPNTE MFGPTHELLL DRGVRYRVDK
VVYEDGGWKV FGTVLGKEG