Gene Sros_5794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5794 
Symbol 
ID8669088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6350160 
End bp6353576 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content74% 
IMG OID 
ProductExopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like protein 
Protein accessionYP_003341283 
Protein GI271967087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0776912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGG CCCTGCTCGC CCCGTTGCTC GCGTTCAGCG CGGCAGGGCC CGCCCAGGCC 
GATCCGTCGC CGTCACCCTC CCCGGCGCGA CCGGAGCAGA CCCTCGTAGA GGACGCGCCG
CGGGCGTCGT CCTACCAGGC CCCCTCCACG CTGCTCGTCG CCCCCGGCGC ACGGCAGCTG
GACACGGCCA AGAAGACCCG GCCCGTCGCG CCGGGGATCA CCCTCTCCTC CTTCGACCGC
TACGACACCG CGGGCTGGCT CCGTGCCGAC GCGATCGCCG CGGACCTCAC GGCCGGCGCG
AGCGTCGACT ACGTGTACTC GGGCGAGGTG TCGAAGACCG AGCCGCTGTC CGGACCGGCG
AAGCGGTCCC GGGCCGTCGC CGCCGTCAAC GGCGACTTCT TCGACATCAA CAGCTCCGGC
GCGGCGCAGG GCATCGGCAT CCACAACGGA GACCTCGTCC AGGCCCCGGT CGCGGGACAC
AACAACGCCG TCGCGGTCAC CGCCGACGGG GTCGGACGCG TGCTGCAGAT GCACTTCGAC
GGCACCGCGA CGCCCGCCGG CGGCAGCCCG ATCACGCTGA CCCAGTTCAA CCAGCTCATC
CAGGGTAACG GCGTCGGCCT GTTCACCCCG CTGTGGGGCT CCTACGGCCG CGGGCGCGCG
GTCGAGGGCG CGGCCGCGGT CACCGAGGTC GTGCTGGAGG GCGGCGTCGT CACCGAGGTG
CGCACCTCCG CGGGCAGCGG CCCGATCCCG GCGGGCACCG CGATCCTGCT CGGCAGGGAC
GCCGGAGCGA GCGCGCTCGC CGCGCTGAAG CCGGGCGACC GCGTCGAGGT GAGATACCAG
CCCAAGCCCT CCGAGGGCGG CGCCGTCAAG GCGGCCGTCG GCGGCAGCCA GATCCTCGTC
AAGGACGGCG TCGCGCAGAC CTCCGCCGAC AACACCGCCC ACCCGCGCAC CGCCGTCGGC
TTCTCCGCCG ACGGCCGGAA GATGTACCTG CTGACGGTCG ACGGCAGGCA GACCGACAGC
AGGGGCGTCA CGCTCACCGA GCTCGGCGCG ATGATGGCCG AGCTCGGCGC GCACGACGCG
CTGAACCTCG ACGGCGGCGG CTCCTCGACG ATGCTCGCCC GGGAGCCCGG CTCCGCCGAC
GTGCAGGTGG AGAACAGCCC CTCCGACGGC GGCGAGCGGC ACGTCCCCAA CGGCCTCGCG
CTCTATGCGC CCGCGGGCAG CGGGAAGCTC AAGGGCTTCT GGGTGGAGAC GGCCGCCGAC
CCGGCGCGGG CCCCCGGCGC CGGTCCGGTC GGCGGCGGCC GCCCCGACCG CGTCTTCCCC
GGTCTCACCC GCAAGCTCAC GGCCGCCGGC TACGACGAGA CGTACGGCCC GGCGGCGGGC
ACCCCGTCCT GGCGGGCCTC CCAGGCGGTA CACGGGCTCG TCGGCCGCGA CGGCACCTTC
CGCGCCCTCG TGCCCGGCCA GACGACCGTC ACCGCCTCCC GCGGCGGCGC GAAGGGCGAG
ATCGCTCTCA CCGTCCTGCA GCCCCTCGTG CGCCTCGGCG CGACGACCGA CCGGGTCGGC
CTGGCGGGCG CGGACGCCTC GGGCACCTTC GGAGTGGTCG GCTACGACCG CAACGGCAAC
TCCGCCCCCG TCGAACCCTC CGACGTGCGG CTCGACTACG ACAGGGACCT GCTCGACGTC
GCCACCTCCG AGCACGGCTT CTTCACGGTG AAGGCGAAGA AGGCGACCGG TTCGGGCCTG
GTGACCGTAC GGGCCGGCGG TTCGAGCGCG GCGGTCCCGG TGACCGTCGG GTACGAGGAC
GTCCCGGTCG CCGGCTTCGA GGACGCGGCG TCGTGGACGT TCGCGGCCGC GCGGGCGACG
GGCTCGCTGT CGGCGGCGCC GGGCCAGAGC GGCGGCGGGC TCAAGCTCGC CTACGACTTC
ACCCAGTCGA CGGCGACGCG GGCCGCCTAC GCGAGCCCGC CCCAGCAGAT CACCGTGCCG
GGCCAGCCGC AGGCGTTCGG CCTCTGGATC CACTCCGGCG GCAAGGGGGA GTGGCCGAGC
CTGGAGTTCT ACGACGCCCA GGGGCAGTCG CAGATCCTCC GCGGCCCGTA TCTCACCTGG
ACCGGCTGGA AGTTCGTCGA GTTCGCGGTG CCGCCCGGGG TGAACTACCC GCTGAAGCTC
CGGCGCTTCT ACGTCGCCGA GACCAAGACC GACCAGAAGT ACACCGGTGA GATCACGATC
GACGGGCTCG TCGCGAAGGT GCCGCCGTCG GTGGACGCTC CGGCCGCCCC GAAGACGGCC
GACCCGGTCA TCCGCCCCGG GGCCGAGGTC GCGGCGATGC CGTGGCGGTT CGCGGTCATG
TCGGACGCGC AGTTCGTGGC ACGCGACCCC GACAGCGACA TCGTCGCCAA CGCCCGCCGC
ACCCTGCGGG AGATCCGGGC CGCCAGGCCC GACTTCCTGC TCATCAACGG TGACCTCGTG
GACGAGGCCT CGCCGGAGGA CTTCGCGCTC GCCAAGCGGA TCCTGGACGA GGAGCTCCAC
GGAGAGCTGC CCGTCCACTA CGTGCCCGGC AACCACGAGG TCATGGGCGG GAAGATCGAC
AACTTCAGGT CGGTCTTCGG CGACACCTAC GGCGGCTTCG ACCACAAGGG GACCCGTTTC
GTGACCCTCG ACACCTCGCG GCTGAACCTG CGCGGCGGAG GGTTCGAGCA GGTGGCCATG
TTGCGCGAGC GTCTCGACGC GGCGGAGAAG GACCCGTCGG TCGGCTCGGT CGCCGTGCTG
TTCCACGTGC CGCCGCGCGA CCCGACGCCC GGCAAGGGCA GCCAGCTCGG CGACCGCAAG
GAGGCCGCGC TGGTCGAGGG CTGGCTCGCG GACTTCCAGC GCACCACCGG CAAGGGGGTC
GCCTACATCG GCGCCCACGT CGGGACCTTC CACGCCTCGC GGGTCGACGC CGTGCCGTAC
TTCATCAACG GCAACTCAGG CAAGAACCCG GCCACGGCGG CCGACGACGG CGGCTTCACC
GGCTGGTCCC TGTGGGGGGT CGACCCGGTG ACCGAGCGGG AGGCGGCACA CGTCCGGCGC
AACTGGTTCG TCGACGCGCC GGCCTGGATC GGCGCGCAGG TCCGCCCGCA CGTCGACGGG
CTGACCCTCA CGGCACCCGC GTCGGTGGAG GTCGGCAAGC CCGGCCGGGT CTCGGCGACC
CTGGCGCAGG GCAGCCGCAC GGTGCCCGTC GCCTACCCCG CCTCGGCGGA CTGGTCGGGC
TCGCCGAACC TTCACGTCGG CACCCGCGCC GGCGCGAAGG CGCGCCACGT CGCGGTGCTC
GATCCCGCCA CGGGGACGCT CACGGCGCTG CGCGCGGGAC AGGTCGTCGT CGCGGTCACC
GTGAACGGCG TCACCCGGCA GGCGACGGTC GCGCTGTCGG CGCGGGCCGC GGCCTGA
 
Protein sequence
MAAALLAPLL AFSAAGPAQA DPSPSPSPAR PEQTLVEDAP RASSYQAPST LLVAPGARQL 
DTAKKTRPVA PGITLSSFDR YDTAGWLRAD AIAADLTAGA SVDYVYSGEV SKTEPLSGPA
KRSRAVAAVN GDFFDINSSG AAQGIGIHNG DLVQAPVAGH NNAVAVTADG VGRVLQMHFD
GTATPAGGSP ITLTQFNQLI QGNGVGLFTP LWGSYGRGRA VEGAAAVTEV VLEGGVVTEV
RTSAGSGPIP AGTAILLGRD AGASALAALK PGDRVEVRYQ PKPSEGGAVK AAVGGSQILV
KDGVAQTSAD NTAHPRTAVG FSADGRKMYL LTVDGRQTDS RGVTLTELGA MMAELGAHDA
LNLDGGGSST MLAREPGSAD VQVENSPSDG GERHVPNGLA LYAPAGSGKL KGFWVETAAD
PARAPGAGPV GGGRPDRVFP GLTRKLTAAG YDETYGPAAG TPSWRASQAV HGLVGRDGTF
RALVPGQTTV TASRGGAKGE IALTVLQPLV RLGATTDRVG LAGADASGTF GVVGYDRNGN
SAPVEPSDVR LDYDRDLLDV ATSEHGFFTV KAKKATGSGL VTVRAGGSSA AVPVTVGYED
VPVAGFEDAA SWTFAAARAT GSLSAAPGQS GGGLKLAYDF TQSTATRAAY ASPPQQITVP
GQPQAFGLWI HSGGKGEWPS LEFYDAQGQS QILRGPYLTW TGWKFVEFAV PPGVNYPLKL
RRFYVAETKT DQKYTGEITI DGLVAKVPPS VDAPAAPKTA DPVIRPGAEV AAMPWRFAVM
SDAQFVARDP DSDIVANARR TLREIRAARP DFLLINGDLV DEASPEDFAL AKRILDEELH
GELPVHYVPG NHEVMGGKID NFRSVFGDTY GGFDHKGTRF VTLDTSRLNL RGGGFEQVAM
LRERLDAAEK DPSVGSVAVL FHVPPRDPTP GKGSQLGDRK EAALVEGWLA DFQRTTGKGV
AYIGAHVGTF HASRVDAVPY FINGNSGKNP ATAADDGGFT GWSLWGVDPV TEREAAHVRR
NWFVDAPAWI GAQVRPHVDG LTLTAPASVE VGKPGRVSAT LAQGSRTVPV AYPASADWSG
SPNLHVGTRA GAKARHVAVL DPATGTLTAL RAGQVVVAVT VNGVTRQATV ALSARAAA