Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5794 |
Symbol | |
ID | 8669088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 6350160 |
End bp | 6353576 |
Gene Length | 3417 bp |
Protein Length | 1138 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like protein |
Protein accession | YP_003341283 |
Protein GI | 271967087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0776912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGG CCCTGCTCGC CCCGTTGCTC GCGTTCAGCG CGGCAGGGCC CGCCCAGGCC GATCCGTCGC CGTCACCCTC CCCGGCGCGA CCGGAGCAGA CCCTCGTAGA GGACGCGCCG CGGGCGTCGT CCTACCAGGC CCCCTCCACG CTGCTCGTCG CCCCCGGCGC ACGGCAGCTG GACACGGCCA AGAAGACCCG GCCCGTCGCG CCGGGGATCA CCCTCTCCTC CTTCGACCGC TACGACACCG CGGGCTGGCT CCGTGCCGAC GCGATCGCCG CGGACCTCAC GGCCGGCGCG AGCGTCGACT ACGTGTACTC GGGCGAGGTG TCGAAGACCG AGCCGCTGTC CGGACCGGCG AAGCGGTCCC GGGCCGTCGC CGCCGTCAAC GGCGACTTCT TCGACATCAA CAGCTCCGGC GCGGCGCAGG GCATCGGCAT CCACAACGGA GACCTCGTCC AGGCCCCGGT CGCGGGACAC AACAACGCCG TCGCGGTCAC CGCCGACGGG GTCGGACGCG TGCTGCAGAT GCACTTCGAC GGCACCGCGA CGCCCGCCGG CGGCAGCCCG ATCACGCTGA CCCAGTTCAA CCAGCTCATC CAGGGTAACG GCGTCGGCCT GTTCACCCCG CTGTGGGGCT CCTACGGCCG CGGGCGCGCG GTCGAGGGCG CGGCCGCGGT CACCGAGGTC GTGCTGGAGG GCGGCGTCGT CACCGAGGTG CGCACCTCCG CGGGCAGCGG CCCGATCCCG GCGGGCACCG CGATCCTGCT CGGCAGGGAC GCCGGAGCGA GCGCGCTCGC CGCGCTGAAG CCGGGCGACC GCGTCGAGGT GAGATACCAG CCCAAGCCCT CCGAGGGCGG CGCCGTCAAG GCGGCCGTCG GCGGCAGCCA GATCCTCGTC AAGGACGGCG TCGCGCAGAC CTCCGCCGAC AACACCGCCC ACCCGCGCAC CGCCGTCGGC TTCTCCGCCG ACGGCCGGAA GATGTACCTG CTGACGGTCG ACGGCAGGCA GACCGACAGC AGGGGCGTCA CGCTCACCGA GCTCGGCGCG ATGATGGCCG AGCTCGGCGC GCACGACGCG CTGAACCTCG ACGGCGGCGG CTCCTCGACG ATGCTCGCCC GGGAGCCCGG CTCCGCCGAC GTGCAGGTGG AGAACAGCCC CTCCGACGGC GGCGAGCGGC ACGTCCCCAA CGGCCTCGCG CTCTATGCGC CCGCGGGCAG CGGGAAGCTC AAGGGCTTCT GGGTGGAGAC GGCCGCCGAC CCGGCGCGGG CCCCCGGCGC CGGTCCGGTC GGCGGCGGCC GCCCCGACCG CGTCTTCCCC GGTCTCACCC GCAAGCTCAC GGCCGCCGGC TACGACGAGA CGTACGGCCC GGCGGCGGGC ACCCCGTCCT GGCGGGCCTC CCAGGCGGTA CACGGGCTCG TCGGCCGCGA CGGCACCTTC CGCGCCCTCG TGCCCGGCCA GACGACCGTC ACCGCCTCCC GCGGCGGCGC GAAGGGCGAG ATCGCTCTCA CCGTCCTGCA GCCCCTCGTG CGCCTCGGCG CGACGACCGA CCGGGTCGGC CTGGCGGGCG CGGACGCCTC GGGCACCTTC GGAGTGGTCG GCTACGACCG CAACGGCAAC TCCGCCCCCG TCGAACCCTC CGACGTGCGG CTCGACTACG ACAGGGACCT GCTCGACGTC GCCACCTCCG AGCACGGCTT CTTCACGGTG AAGGCGAAGA AGGCGACCGG TTCGGGCCTG GTGACCGTAC GGGCCGGCGG TTCGAGCGCG GCGGTCCCGG TGACCGTCGG GTACGAGGAC GTCCCGGTCG CCGGCTTCGA GGACGCGGCG TCGTGGACGT TCGCGGCCGC GCGGGCGACG GGCTCGCTGT CGGCGGCGCC GGGCCAGAGC GGCGGCGGGC TCAAGCTCGC CTACGACTTC ACCCAGTCGA CGGCGACGCG GGCCGCCTAC GCGAGCCCGC CCCAGCAGAT CACCGTGCCG GGCCAGCCGC AGGCGTTCGG CCTCTGGATC CACTCCGGCG GCAAGGGGGA GTGGCCGAGC CTGGAGTTCT ACGACGCCCA GGGGCAGTCG CAGATCCTCC GCGGCCCGTA TCTCACCTGG ACCGGCTGGA AGTTCGTCGA GTTCGCGGTG CCGCCCGGGG TGAACTACCC GCTGAAGCTC CGGCGCTTCT ACGTCGCCGA GACCAAGACC GACCAGAAGT ACACCGGTGA GATCACGATC GACGGGCTCG TCGCGAAGGT GCCGCCGTCG GTGGACGCTC CGGCCGCCCC GAAGACGGCC GACCCGGTCA TCCGCCCCGG GGCCGAGGTC GCGGCGATGC CGTGGCGGTT CGCGGTCATG TCGGACGCGC AGTTCGTGGC ACGCGACCCC GACAGCGACA TCGTCGCCAA CGCCCGCCGC ACCCTGCGGG AGATCCGGGC CGCCAGGCCC GACTTCCTGC TCATCAACGG TGACCTCGTG GACGAGGCCT CGCCGGAGGA CTTCGCGCTC GCCAAGCGGA TCCTGGACGA GGAGCTCCAC GGAGAGCTGC CCGTCCACTA CGTGCCCGGC AACCACGAGG TCATGGGCGG GAAGATCGAC AACTTCAGGT CGGTCTTCGG CGACACCTAC GGCGGCTTCG ACCACAAGGG GACCCGTTTC GTGACCCTCG ACACCTCGCG GCTGAACCTG CGCGGCGGAG GGTTCGAGCA GGTGGCCATG TTGCGCGAGC GTCTCGACGC GGCGGAGAAG GACCCGTCGG TCGGCTCGGT CGCCGTGCTG TTCCACGTGC CGCCGCGCGA CCCGACGCCC GGCAAGGGCA GCCAGCTCGG CGACCGCAAG GAGGCCGCGC TGGTCGAGGG CTGGCTCGCG GACTTCCAGC GCACCACCGG CAAGGGGGTC GCCTACATCG GCGCCCACGT CGGGACCTTC CACGCCTCGC GGGTCGACGC CGTGCCGTAC TTCATCAACG GCAACTCAGG CAAGAACCCG GCCACGGCGG CCGACGACGG CGGCTTCACC GGCTGGTCCC TGTGGGGGGT CGACCCGGTG ACCGAGCGGG AGGCGGCACA CGTCCGGCGC AACTGGTTCG TCGACGCGCC GGCCTGGATC GGCGCGCAGG TCCGCCCGCA CGTCGACGGG CTGACCCTCA CGGCACCCGC GTCGGTGGAG GTCGGCAAGC CCGGCCGGGT CTCGGCGACC CTGGCGCAGG GCAGCCGCAC GGTGCCCGTC GCCTACCCCG CCTCGGCGGA CTGGTCGGGC TCGCCGAACC TTCACGTCGG CACCCGCGCC GGCGCGAAGG CGCGCCACGT CGCGGTGCTC GATCCCGCCA CGGGGACGCT CACGGCGCTG CGCGCGGGAC AGGTCGTCGT CGCGGTCACC GTGAACGGCG TCACCCGGCA GGCGACGGTC GCGCTGTCGG CGCGGGCCGC GGCCTGA
|
Protein sequence | MAAALLAPLL AFSAAGPAQA DPSPSPSPAR PEQTLVEDAP RASSYQAPST LLVAPGARQL DTAKKTRPVA PGITLSSFDR YDTAGWLRAD AIAADLTAGA SVDYVYSGEV SKTEPLSGPA KRSRAVAAVN GDFFDINSSG AAQGIGIHNG DLVQAPVAGH NNAVAVTADG VGRVLQMHFD GTATPAGGSP ITLTQFNQLI QGNGVGLFTP LWGSYGRGRA VEGAAAVTEV VLEGGVVTEV RTSAGSGPIP AGTAILLGRD AGASALAALK PGDRVEVRYQ PKPSEGGAVK AAVGGSQILV KDGVAQTSAD NTAHPRTAVG FSADGRKMYL LTVDGRQTDS RGVTLTELGA MMAELGAHDA LNLDGGGSST MLAREPGSAD VQVENSPSDG GERHVPNGLA LYAPAGSGKL KGFWVETAAD PARAPGAGPV GGGRPDRVFP GLTRKLTAAG YDETYGPAAG TPSWRASQAV HGLVGRDGTF RALVPGQTTV TASRGGAKGE IALTVLQPLV RLGATTDRVG LAGADASGTF GVVGYDRNGN SAPVEPSDVR LDYDRDLLDV ATSEHGFFTV KAKKATGSGL VTVRAGGSSA AVPVTVGYED VPVAGFEDAA SWTFAAARAT GSLSAAPGQS GGGLKLAYDF TQSTATRAAY ASPPQQITVP GQPQAFGLWI HSGGKGEWPS LEFYDAQGQS QILRGPYLTW TGWKFVEFAV PPGVNYPLKL RRFYVAETKT DQKYTGEITI DGLVAKVPPS VDAPAAPKTA DPVIRPGAEV AAMPWRFAVM SDAQFVARDP DSDIVANARR TLREIRAARP DFLLINGDLV DEASPEDFAL AKRILDEELH GELPVHYVPG NHEVMGGKID NFRSVFGDTY GGFDHKGTRF VTLDTSRLNL RGGGFEQVAM LRERLDAAEK DPSVGSVAVL FHVPPRDPTP GKGSQLGDRK EAALVEGWLA DFQRTTGKGV AYIGAHVGTF HASRVDAVPY FINGNSGKNP ATAADDGGFT GWSLWGVDPV TEREAAHVRR NWFVDAPAWI GAQVRPHVDG LTLTAPASVE VGKPGRVSAT LAQGSRTVPV AYPASADWSG SPNLHVGTRA GAKARHVAVL DPATGTLTAL RAGQVVVAVT VNGVTRQATV ALSARAAA
|
| |