Gene Dbac_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2221 
Symbol 
ID8377895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2556112 
End bp2557581 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content58% 
IMG OID645001442 
Productpolysaccharide chain length determinant protein, PEP-CTERM locus subfamily 
Protein accessionYP_003158719 
Protein GI256829991 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.120089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATG AACTGTATCA GCAATTTGAA AGGTATGCGC GGATCTTGCT GCAACGCAAG 
CGCGTTGTGG TCGTCGTGGC TCTGCTGGTC ATGACCCTGG GCGTTATAAC CAGTTATGTC
CTGCCCAGGA AATACGAGGC GCAGTCCACG GTTTTCATCG AACAGAGCGT CATCAGCGAA
TTGGTCAAGG GCATTGCCAC CACTCCGTCC ATGGAGGCCA AGATCAAGGT TTTGACCGTG
GCCATGCTCA GTCGGGAAAC GCTCTTCAAG GTGATGCGCA TTCTGGATAA GGATGTCGAG
TTTGCTTCGG ATATGGATCG GGAAGCATAC ATCAAGGATT TGCGGGAGCG GATTTCCATC
AGGCTGGACG AAAAACGGGG GATCTTCTTC ATCTCCTTTC AGGACAGCGA CCCGCGTTAT
GCCCGTGATT TCGTCAACAC CATCACTCAA GTCTACATCG AGTCGAATAC GGCCTCCAAG
CGGGACGAAT CCCTGGAAGC GACCCGCTTT TTATCCGAGC AGATAGAGAG CTTCAAGAAG
CGCCTCGACG CGGTGGAGGA TGAGATCAAT CAGTACAGGG CCGAGCATGG CCTCCAGCTG
GCGACGGACG AGACCACAAT CCGTTTCGAG ATCGCCGATG CCGAGAGAAA GCTGGAGGCC
ATTCGGGCGC GCAAGCTTGA GCTTGAGACC AAGTTGCAGC TCATGCCCTC CGGAGGGGGC
CGGTCCGCGC ATCTGGCCGA TATGGAGCGC CAATTGGCGA CGCTTTTGAC GGCCTATACG
GATCAGCATC CAAAGGTCGT GCGGCTGCAG GGGCAGATCA GGGCCGTGAA AAGCAGTCCG
TCGGGCGGCA TGACCGGGAA CTCGGGAGCA GCCGCCAAGA CACTGGTTCA GGCCGAGATA
GAGGCCGCCA CGCTTCAGGA GAAGGCCCAG CTTGCGACCA TCGAGGAAAA GACGGAGCTT
CTGCGCCGGA TTCCCACGCT CCGCACGGGG CTGAACGAAC TCTTGCGCAA GAAGGATAAC
GAGACCCTGA TCTACAGTCA GCTCGTGACC CGCTACGGCC AGTCGGAAGT TTCCAAACAA
ATGGAGATGG AAAACAAGTC CATGAACTTT CGGGTCGTGG ATCCGGCTGT CATGCCCGAC
ACCACTGTCA GTCCCAAGCG CGTGCCGATC ATGCTTTTGT CGGCCCTGGC CGGAATCGGG
ATAGGCATGG CCGTCATCAT GGTGCCGTAC CTCATGCGCG GGTCGGTGGA GAGCCTGGCC
GACCTGCGGT CTCTCAACCA GCGGGTGCTG GCCGTTCTGC CTGCGATTTC CAAGCCAAAG
GAGGAGAGGC AGCGCGTGAG GGGGGACCGT ATTTTTATGT CCGGAGCGGC GCTTTATTTC
CTCATGCTGG TGACCATCGG CGTCATGGAG GCCCTGGGCA ACTCACCCCT CGATGTGCTG
TTTGAAAGGG CGCTCGGCCC TTGGCTGTAG
 
Protein sequence
MNHELYQQFE RYARILLQRK RVVVVVALLV MTLGVITSYV LPRKYEAQST VFIEQSVISE 
LVKGIATTPS MEAKIKVLTV AMLSRETLFK VMRILDKDVE FASDMDREAY IKDLRERISI
RLDEKRGIFF ISFQDSDPRY ARDFVNTITQ VYIESNTASK RDESLEATRF LSEQIESFKK
RLDAVEDEIN QYRAEHGLQL ATDETTIRFE IADAERKLEA IRARKLELET KLQLMPSGGG
RSAHLADMER QLATLLTAYT DQHPKVVRLQ GQIRAVKSSP SGGMTGNSGA AAKTLVQAEI
EAATLQEKAQ LATIEEKTEL LRRIPTLRTG LNELLRKKDN ETLIYSQLVT RYGQSEVSKQ
MEMENKSMNF RVVDPAVMPD TTVSPKRVPI MLLSALAGIG IGMAVIMVPY LMRGSVESLA
DLRSLNQRVL AVLPAISKPK EERQRVRGDR IFMSGAALYF LMLVTIGVME ALGNSPLDVL
FERALGPWL