Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_2221 |
Symbol | |
ID | 8377895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | - |
Start bp | 2556112 |
End bp | 2557581 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 645001442 |
Product | polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
Protein accession | YP_003158719 |
Protein GI | 256829991 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.120089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATG AACTGTATCA GCAATTTGAA AGGTATGCGC GGATCTTGCT GCAACGCAAG CGCGTTGTGG TCGTCGTGGC TCTGCTGGTC ATGACCCTGG GCGTTATAAC CAGTTATGTC CTGCCCAGGA AATACGAGGC GCAGTCCACG GTTTTCATCG AACAGAGCGT CATCAGCGAA TTGGTCAAGG GCATTGCCAC CACTCCGTCC ATGGAGGCCA AGATCAAGGT TTTGACCGTG GCCATGCTCA GTCGGGAAAC GCTCTTCAAG GTGATGCGCA TTCTGGATAA GGATGTCGAG TTTGCTTCGG ATATGGATCG GGAAGCATAC ATCAAGGATT TGCGGGAGCG GATTTCCATC AGGCTGGACG AAAAACGGGG GATCTTCTTC ATCTCCTTTC AGGACAGCGA CCCGCGTTAT GCCCGTGATT TCGTCAACAC CATCACTCAA GTCTACATCG AGTCGAATAC GGCCTCCAAG CGGGACGAAT CCCTGGAAGC GACCCGCTTT TTATCCGAGC AGATAGAGAG CTTCAAGAAG CGCCTCGACG CGGTGGAGGA TGAGATCAAT CAGTACAGGG CCGAGCATGG CCTCCAGCTG GCGACGGACG AGACCACAAT CCGTTTCGAG ATCGCCGATG CCGAGAGAAA GCTGGAGGCC ATTCGGGCGC GCAAGCTTGA GCTTGAGACC AAGTTGCAGC TCATGCCCTC CGGAGGGGGC CGGTCCGCGC ATCTGGCCGA TATGGAGCGC CAATTGGCGA CGCTTTTGAC GGCCTATACG GATCAGCATC CAAAGGTCGT GCGGCTGCAG GGGCAGATCA GGGCCGTGAA AAGCAGTCCG TCGGGCGGCA TGACCGGGAA CTCGGGAGCA GCCGCCAAGA CACTGGTTCA GGCCGAGATA GAGGCCGCCA CGCTTCAGGA GAAGGCCCAG CTTGCGACCA TCGAGGAAAA GACGGAGCTT CTGCGCCGGA TTCCCACGCT CCGCACGGGG CTGAACGAAC TCTTGCGCAA GAAGGATAAC GAGACCCTGA TCTACAGTCA GCTCGTGACC CGCTACGGCC AGTCGGAAGT TTCCAAACAA ATGGAGATGG AAAACAAGTC CATGAACTTT CGGGTCGTGG ATCCGGCTGT CATGCCCGAC ACCACTGTCA GTCCCAAGCG CGTGCCGATC ATGCTTTTGT CGGCCCTGGC CGGAATCGGG ATAGGCATGG CCGTCATCAT GGTGCCGTAC CTCATGCGCG GGTCGGTGGA GAGCCTGGCC GACCTGCGGT CTCTCAACCA GCGGGTGCTG GCCGTTCTGC CTGCGATTTC CAAGCCAAAG GAGGAGAGGC AGCGCGTGAG GGGGGACCGT ATTTTTATGT CCGGAGCGGC GCTTTATTTC CTCATGCTGG TGACCATCGG CGTCATGGAG GCCCTGGGCA ACTCACCCCT CGATGTGCTG TTTGAAAGGG CGCTCGGCCC TTGGCTGTAG
|
Protein sequence | MNHELYQQFE RYARILLQRK RVVVVVALLV MTLGVITSYV LPRKYEAQST VFIEQSVISE LVKGIATTPS MEAKIKVLTV AMLSRETLFK VMRILDKDVE FASDMDREAY IKDLRERISI RLDEKRGIFF ISFQDSDPRY ARDFVNTITQ VYIESNTASK RDESLEATRF LSEQIESFKK RLDAVEDEIN QYRAEHGLQL ATDETTIRFE IADAERKLEA IRARKLELET KLQLMPSGGG RSAHLADMER QLATLLTAYT DQHPKVVRLQ GQIRAVKSSP SGGMTGNSGA AAKTLVQAEI EAATLQEKAQ LATIEEKTEL LRRIPTLRTG LNELLRKKDN ETLIYSQLVT RYGQSEVSKQ MEMENKSMNF RVVDPAVMPD TTVSPKRVPI MLLSALAGIG IGMAVIMVPY LMRGSVESLA DLRSLNQRVL AVLPAISKPK EERQRVRGDR IFMSGAALYF LMLVTIGVME ALGNSPLDVL FERALGPWL
|
| |