Gene Sros_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1938 
Symbol 
ID8665220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2072994 
End bp2074457 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content69% 
IMG OID 
ProductExopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like protein 
Protein accessionYP_003337669 
Protein GI271963473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAC GTATCATCGT CGGGCTTTCA GCCATAGCAC TCACCATCAC GGCGATTCCC 
GCACAAGCCG ACCGCACCGG CGTGACCTTT CCCACCTCGG CGTTCCCACT CGGCCAGAGC
GGGACACCGA TCAAGAACCA GCGCAGCGTG GCCCCCGGAG TCGACCTCTT CAGCGTGATG
TCCGGCACGT CCACCCAGGG GTGGACCGTG ACGGTGCTGA TGCCCAACGG GCACGACGAC
GGCAAGCTGA CCACCGCCCA GGCGAAGGCG GAGGAGGTCA CGGCGGCGGG ATTCACCCCC
AGCGTGCAGA AGATCGTGCA GCCCGCCGCC GCGGACGCCC CGGCCGTGGA GCGCTACCTC
GTCCGCGTCG GCCTGTGGAC GTTCAAGGAG CGCGCCAAGG CCGACAAGGT GGTCAAGGAG
CTCAAGGAGT TCGACATCCG GGCCAAGACC GACTACCTCG GTGACGACGG CCTGGAGACC
ACCGGCCCCT GGGACATGCG CGTGCTCATG GTGGACCCGC GCGCCTTCCG GGGGTCCTTC
AAGACCAGCG TCGGGACCAG TGTCGCCAAG CGCGAGACCA CCACTTCGAT GTCCAAGCTG
ACCAAGGCCA TCGCCGGCGT CAACGGCGGA TTCTTCAACA TCCACACGCC CAAAGCACTC
CAGGGCGACC CGATGGGCAT CTCGGTGGTG GGTGGCAGGC TGCTCAGCGA GGCGGTGCCC
GGCCGCAGCG GCCTGGTCAT CAGCGGTCGC AAGGTCCGGA TCACCGAGCT GAAGACGGTG
ATCACCGCGA TCCCCGCCGA CGGGGCGAAG ACCGAGATCA AGGGCATCAA CCGGGCCGCC
GGAGCGGACG AGCTCGTGCT CTACACCGAG GAGTTCGGCA CCAAGACGGC GGCCGACGGC
GGCGCCGAGA TCGTGGTCGA CGCCCAGGGG AGGATCGTCA AGGCCCGCGC GGCCGGCGGC
GTCGTCCCAC GCGGCACCTA CGTGCTGCAC GGCACCGGCA TCATGGCGAC CTGGCTCCTG
GAGCACGCGC AGGAGACCTC CGTCATGAAG CTGGACACCA AGGTCATCGA CCTGCGGACG
GAACGGGCCG TGCCGCTCAC CCCCGAGACG CACATCATGG GTGGCGGCGT CGGGCTCCTC
AGGAACGGCC GGGTGCGGAT CAGCGCCAAG GCCGACGGGC ACGCGTCGGT CGTCATGATG
CTCCGCCGCC ACCCGCGCAC GATGGTCGGC GTCACGAAGT CCGGCGGCCT GATCCTGGCG
ACGGTGGACG GCCGCAACCC GGGTGTCACC GTGGGTGCCT CCATGGTGGA GGCGGCTCAG
CTGATGCGCT GGCTGGGCGC CAAGCAGGCC ATCAACTTCG ACGGTGGCGG CTCGACCGCG
ATGGTCGTCG GCCACAAGGT GATCAACCGG CCCTCCGACG GCAGCGAGCG GACCGTGGGC
GACGGCCTGT TCATCACCCC CTGA
 
Protein sequence
MSRRIIVGLS AIALTITAIP AQADRTGVTF PTSAFPLGQS GTPIKNQRSV APGVDLFSVM 
SGTSTQGWTV TVLMPNGHDD GKLTTAQAKA EEVTAAGFTP SVQKIVQPAA ADAPAVERYL
VRVGLWTFKE RAKADKVVKE LKEFDIRAKT DYLGDDGLET TGPWDMRVLM VDPRAFRGSF
KTSVGTSVAK RETTTSMSKL TKAIAGVNGG FFNIHTPKAL QGDPMGISVV GGRLLSEAVP
GRSGLVISGR KVRITELKTV ITAIPADGAK TEIKGINRAA GADELVLYTE EFGTKTAADG
GAEIVVDAQG RIVKARAAGG VVPRGTYVLH GTGIMATWLL EHAQETSVMK LDTKVIDLRT
ERAVPLTPET HIMGGGVGLL RNGRVRISAK ADGHASVVMM LRRHPRTMVG VTKSGGLILA
TVDGRNPGVT VGASMVEAAQ LMRWLGAKQA INFDGGGSTA MVVGHKVINR PSDGSERTVG
DGLFITP