Gene Dbac_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1694 
Symbol 
ID8377366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp1926905 
End bp1929244 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content60% 
IMG OID645000923 
Productcapsular exopolysaccharide family 
Protein accessionYP_003158202 
Protein GI256829474 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.648011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAAA ATGGCCCTTT GGATTCGCTT CCTTCTGGTA GTCCTTCTTT TTTCCCGGCC 
GTGCACCCAA GTCAGGCCGT GGTGCATGCA TCGGAATTTC CAGCGTCATT TCCGCAGGCG
GAAGACGAGA TCGATCTGCG GGACTGCCTG GAGGTGCTTG TTCGCCGTAA ATGGCTCATC
GGCGGTGTGC TGTTCGCTGT TTTCGTGACC ACGTTCATCG TGACTCTGGC CATGACCCCG
ATTTACAAGG CCACGGGCAA GCTTGAATTC AATCTGCAGC CGCCCAAGGT CACCACGTTT
GAGGACATGC TCGTTCCCCA GACTCAGACC CGGGAGTTCA TGAACACCCA GACCAAGCTG
CTCACCTCCG ACTCGCTGGC GTGGCGGGTG ATCGAAACCC TCGATCTGGT CCACAACCCT
CGCTTCAACA CGGAGATCAC GGCCGAGGGC GAGGATTCCG GCGGCATTCT GAGCGATCTT
CGCCGCATGC TGACCACGAT GCCCGGCCAG GGCGAGGACC TTGATGCGCA GGTCCGGGAG
GCCGAGGTGC AGCAGCGACT GCTCAAGGTC TTCTTGGACA ATCTGGAGAT CAAGGCGGAG
CGGGACACCA CCATCCTGAA CCTGGCCTTT TCCTCCCCGG ACCCGGCGCT GGCCCGGGAC
GTGGTCAACA CGTTCATCGG CTCCTTCATC GCCTGGCAGC TTGACAAAAA GATCGGGGCC
GCCGGCGTGG CCAACGAGCA GTTGCGCAAG CAGGTCGAGG TGGCGCGGAT CCGCCTGGAG
AAATCCGAAG CGGAGATGAA CGCTTTTGCC CAGAAAGCAG GCATCGTCTC TTTGGATAGC
CGCATGAACC TTGTTTACAA GCAGCTTGAG GAGATCAATT CCGCCCTGGC CCTGGCCCAT
GCCCAGCGCC TCGGCAGGGA GGCCCTGGAT GCGCAGGCCC GTGAAGTCGG CGTTTCGGCC
CTGCCCATGG TCATCGACAA TCCGCTCATT CAGGAACTTC GGCAGCAATA CGTCCAGCTT
GAGTCCCAAT ACGAGGATAA GCTTGTCGTC TTCAAGCCCG ATTACCCTGA GGCCAGGCGG
CTGAAGGCCC GGCTCGACGA TGTCGCATCC AAGATCCGCC ATGAGGAGAG CAGGATCCAG
GGCGGCATCC GCAACGACTA TCTTGTGGCC CTCAAAAACG AGGAGTCCCT GCGCACGCAG
GCCGATCTGG CCAAGACCAG CGCCCTGGAT CTGAACGATC GGGCCACGCA GTACAAGATT
CTTGAGCGCG AAGTCGAGAC AAACAAGGAG ATTCATCAGT CCCTGCTGCA GCGCGGCAAG
GAGATCGACG CAACGGTGGG CACGGACATC AGCAATATCC AGGTGGTGGA CCACGCCTTG
CTCCCGGTGA AGGCGGACAA GCCGCGTATC CAGCTGAACC TGATGCTGAG CATCGGGATC
GGTCTGCTGC TTGGCGTGGC CGCCGCCTTC CTGCTCGAAT TTCTGGATAA CACGATCAAA
AGCATTGATG AGATCACCGA CCGCTTCGGC ATCGGCATCC TGGGCGTGCT CCCGGAAGCC
GAAAAGAAAT TCGCGGACAG GCTGGACCGC CTGGTGGTTT CCGATCCCCG GGCGGGGTTT
TCCGAGGCCA TCCGCACCAC CCGCGTCTCC ATTCAGCTGT CCACCGCCGC CGAGGGCGGC
ACCCGTACGC TGCTCATCAC CAGCACTTCG GAAGGGGAGG GCAAGTCGAC CATCGCGGTG
AACATGGCCC TGACCTTTGC CGCCGCAGGC GACAGGGTGC TCATCATCGA CGCGGATCTG
CGCAGGCCCA GATTGCACCA GGTGTTTTCC CCGTCCGCTT CAGGCGGTGG GGGACTGAGC
GAACTGCTCA TCGGCAGCAA GAGTCTGGAC GATGTGATCT GCGCCACCGA GCATGAGAAC
TTGTTTTTCA TCCCGGCGGG ATTGGTGCCC CCCAACCCTG CCGAGCTTCT GGCCTCCAGG
CGGATGCGTA TGTACTTGGA ACAATTGCAT GAGGATTTCG ACCGCATCAT TATCGACGGC
CCGCCATCGG TGGGTTTCGC CGATGTGCTC GTGCTCAGCA GCCTGGCCAG CGGAGTCATT
CTGATAAGCA CCCTGGGCAG GACCCACAGG CAGGCGCTGC GTCTGTTTCG GCGCTCTTTG
CTCAATATCA ACGCCCGGCT GCTGGGCACC ATCGTCAATC GCCTCAATGT GCAGAACCGG
TTTGGGGATT ATTATTCCAA ATATGGCAGG TATTATTACC ATCCTTACGC CTACGGCGAC
GGCAGTACCG AACGTGCGCA GGGCGGCGTT CCGCAACTGG AAGAACCCCG CGCGTCCTGA
 
Protein sequence
MRENGPLDSL PSGSPSFFPA VHPSQAVVHA SEFPASFPQA EDEIDLRDCL EVLVRRKWLI 
GGVLFAVFVT TFIVTLAMTP IYKATGKLEF NLQPPKVTTF EDMLVPQTQT REFMNTQTKL
LTSDSLAWRV IETLDLVHNP RFNTEITAEG EDSGGILSDL RRMLTTMPGQ GEDLDAQVRE
AEVQQRLLKV FLDNLEIKAE RDTTILNLAF SSPDPALARD VVNTFIGSFI AWQLDKKIGA
AGVANEQLRK QVEVARIRLE KSEAEMNAFA QKAGIVSLDS RMNLVYKQLE EINSALALAH
AQRLGREALD AQAREVGVSA LPMVIDNPLI QELRQQYVQL ESQYEDKLVV FKPDYPEARR
LKARLDDVAS KIRHEESRIQ GGIRNDYLVA LKNEESLRTQ ADLAKTSALD LNDRATQYKI
LEREVETNKE IHQSLLQRGK EIDATVGTDI SNIQVVDHAL LPVKADKPRI QLNLMLSIGI
GLLLGVAAAF LLEFLDNTIK SIDEITDRFG IGILGVLPEA EKKFADRLDR LVVSDPRAGF
SEAIRTTRVS IQLSTAAEGG TRTLLITSTS EGEGKSTIAV NMALTFAAAG DRVLIIDADL
RRPRLHQVFS PSASGGGGLS ELLIGSKSLD DVICATEHEN LFFIPAGLVP PNPAELLASR
RMRMYLEQLH EDFDRIIIDG PPSVGFADVL VLSSLASGVI LISTLGRTHR QALRLFRRSL
LNINARLLGT IVNRLNVQNR FGDYYSKYGR YYYHPYAYGD GSTERAQGGV PQLEEPRAS