Gene Sros_3328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3328 
Symbol 
ID8666616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3625910 
End bp3627700 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content74% 
IMG OID 
Productmaltodextrin glucosidase 
Protein accessionYP_003339010 
Protein GI271964814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.307817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATC GGGGATTCCA CCACGACGGC TCGCCGGACC ACGTCAGCAC GCAGGCGCCC 
GAACTCGGCG AGACGGTCAC CGTCTTCGTC CGCACCCCGC GCGACCAGCG TTCCGGCGTG
CACGTGCGGA CCACACCCGA CGGCGAACCG CACTTCACCG AAGCGGTGAT CGACCGCGAG
ACCGAGCACG AGACCTGGTG GCGGGCCGAC GTCGTGGCCC GCCATCCGGT GAGCCGCTAC
CGCTTCCTCA TCGACGGGCC GGGCGGCTAC CGCTGGCTGA CCGCCGCCGG GCTGACCTCC
CACGACGTGC CGGACACCAC CGACTTCCGG CTGGTCACCC ATGCCGCTCC ACCCGCGTGG
GCGACCGGCA GCGTGGTCTA CCAGATCTTC CCCGACCGGT TCGCCTCCTC CGGCGCGGAG
CGCGACCTGC CGTCGTGGGC GATCGCCCGG GGCTGGGACG ACCCCGTCGC CGGGACCGGG
CCCGACACCC CCCGGCAGAT CTTCGGCGGC GACCTGGACG GCATCCGGTC CCGGCTCGAC
CACGTCGCCG CCCTCGGTGC CGACACCCTC TACCTGACGC CCGTCTTCCC CGCCGCCTCC
AACCACCGCT ACAACGCCTC CTCCTTCGCC GAGGTCGACC CGCTGCTCGG CGGTGACGCG
GCCTACCGTG CGCTGATCGA CGAGACGCAC CGCAGGGGCA TGCGCATCCT GGGCGACCTC
ACGACGAACC ACTGCGGCGA CACCCACCCC TGGTTCCGGG CCGCGGCGAA GGACAAGGGC
GCCGCCGAGC GGGAGATGTT CTTCTTCACC GACGACGACC TCGGCTACGA GGCGTGGATG
GGCGTTCCGT CCCTGCCCAA GTTCGACTGG GCGAGCGAGC GGCTGGCCGG GGAGATGGAG
GAGGTCGTCC GGCGGTGGCT CCGCTTCGGC CTGGACGGCT GGCGCATCGA CGTGGCGAAC
ATGACCGGCA GGCTCCGCCG GGCGGACCAC GCCCACGAGG TGGCACGGCG CATCCGGCGG
GCGCTCGCCG CCGAGAGCCC GGACGGGCTG CTCGTCGCCG AGCACGCCCA CGACGCCACC
GGCGACCTCG ACGCCGACGG CTGGCAGGGG ACGATGAACT ACGCCGGATT CACCCGCCCG
GTGTGGAGCT GGCTGCGGGG CCCCGAGCTC CGGCTGCCGT TCCTCGGCGT CCCGGCGGAG
GTGCCGAGGA TCGGCGGCGG CGACGCGGTG GCGACGATGC GGGCGTTCTC CAGCCTCGTG
TCGTGGCGCT CGCTGACGCA CTCCTGGTCG ATCCTCGGCT CCCACGACAC CGCCAGGATC
CGTACGGTGT GCGGAGGGGA TCCCGCCCTG GTGGAGGTCG CCGCCGGGCT GATGTTCACG
CTGCCCGGCA CGCCCATGGT GTTCGCCGGT GACGAGATCG GCCTGGAGGG GGCCTGGGGA
GAGGACGCCC GCCGGACCAT GCCCTGGGAC CGGCCGGAGA GATGGGACCA CGGCACCTTC
GGCGTGTACC GCGACCTGGT CGCGCTGCGC CGGAGCGAGC CCGCCCTGCG CCACGGTGGC
CTGCGCTGGC TGCACGTTTC GGAGGACGCC GTGGTCTACG TCCGGGAGCA CGCGGGCGAG
CGCCTGCTGG TGCTGGCCGC CCGCGCCGCC CACCGGCCGG TACGGCTCCC GCTCGCCGCC
CGTTCGGCCG TCCCCGTGTA CGGCGGCGCG GAGACGGCCC TTTCGGACGA CGGCACCGTC
ACGCTGCCGG CCGACGGGCC GGCCCTCCAC ATCTGGCGGC TGACCCGGTA G
 
Protein sequence
MIDRGFHHDG SPDHVSTQAP ELGETVTVFV RTPRDQRSGV HVRTTPDGEP HFTEAVIDRE 
TEHETWWRAD VVARHPVSRY RFLIDGPGGY RWLTAAGLTS HDVPDTTDFR LVTHAAPPAW
ATGSVVYQIF PDRFASSGAE RDLPSWAIAR GWDDPVAGTG PDTPRQIFGG DLDGIRSRLD
HVAALGADTL YLTPVFPAAS NHRYNASSFA EVDPLLGGDA AYRALIDETH RRGMRILGDL
TTNHCGDTHP WFRAAAKDKG AAEREMFFFT DDDLGYEAWM GVPSLPKFDW ASERLAGEME
EVVRRWLRFG LDGWRIDVAN MTGRLRRADH AHEVARRIRR ALAAESPDGL LVAEHAHDAT
GDLDADGWQG TMNYAGFTRP VWSWLRGPEL RLPFLGVPAE VPRIGGGDAV ATMRAFSSLV
SWRSLTHSWS ILGSHDTARI RTVCGGDPAL VEVAAGLMFT LPGTPMVFAG DEIGLEGAWG
EDARRTMPWD RPERWDHGTF GVYRDLVALR RSEPALRHGG LRWLHVSEDA VVYVREHAGE
RLLVLAARAA HRPVRLPLAA RSAVPVYGGA ETALSDDGTV TLPADGPALH IWRLTR