Gene Dbac_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2074 
Symbol 
ID8377749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2378838 
End bp2380775 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content65% 
IMG OID645001298 
Producttransglutaminase domain protein 
Protein accessionYP_003158575 
Protein GI256829847 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.618768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCACG ACAAGCGCCG CTTCAGCGTG ACGCTGCTGG CCCTGGCTCT GGCCTTCGCG 
CCGCATCTGC CGCGCGTGCC GGTCTTTGTC GGCTTTTTTG TCTTTCTGGC CTGGGGCTAC
GCCCTGGGGA TGCAGTACCG GGGCTGGCCC GTCCCTCCCC GTTGGCTGCG CGCCATCCTG
GCCCTGGCCT GTCTTGCCCT GGTGCTCTCC ACGTACGGCC GATCCTTTGG ACGTGACGCC
GGAGTGGCCC TGCTGTCACT CATGCTGGGG CTCAAGGCCG TGGAGAGCAA ATCCGTGCGC
GACATGCTGG CCCTCTTGTT CCTGGCATAT TTCGTGGTCG TGACCAACGT GCTTTATTCC
CAGACCCTGG TCATGAGCGC GTACATGTTT TTTTCGGTCA TGGCCGTGAC CGCGGCCCTG
GTTCATCTGC ATTCCGGGGA ACCCCGCCTG CTCCCCGATC TGCGGCGCGG GGGGCTGCTT
CTCGTCCAGG CCCTGCCGTT GGCCTTGATT CTTTTCGTCT TTTTTCCGCG TCTGCAGGGC
GCCCTGTGGG GCGTGCACGA TGAACGGGAC GAAGGTGTCA GCGGCTTCAG CGAGACGCTG
GAGCCGGGTT CGGTGGCCAG TCTGTCCCTG TCCCGGGAGG TGGCCTTCAG GGTCGATTTT
CCCGGCACCA TCCCTGACCG TGACAGTCTG TATTGGCGCG GGCTGGTGCT GGACAGTTTC
GATGGCATGA CATGGTTTCG GGATGTGCCT TTCGACCTCG TTCCTCCGCG TATTGATGCC
CTCCCCGCGC AAAGCGTGTC CTACACGCTG ACCATGGAAC CGCACAACAG GGAGTGGGTT
TTTGCCCTTG ATCTGCCCGT GCTCGCGCCT CGGGGCACGG TGCTGCGGTC CGATCAGACG
CTGGCCAGCC TGCGCATGGT CCGCTCGCGG GTGCGTTACG AACTCGCGGC GGTCCAGGCT
CCCGGTCTCT CGCCTGTTCC TGGCCCTGCG TGGACCGCAC TGCCCGAGGT CGGCAATCCC
AAGGCGCGCG CCCTGGCCGC TGAATGGAAG GACGCGGGCC TCTCGCCGGA CGAGATGGTC
GCGGCCGCGC TCAAACTTTT CCGGGAGGGC GGGTTTGTCT ACAGCCTGCG GCCCGGAGCC
GCGGACAAAG ACATCGTTGA TCAGTTTTTG TTTGCAACCC GCCTGGGATA TTGCGAACAT
TACTCCTCGG CCATGGCCTT TCTGCTTCGC GCCGCCGGGG TTCCGGTCCG GGTCGTGGTC
GGCTATCAGG GCGGGGAAGA GAATCCCATG GGCGGATATC TCATTGTCCG TCAGTCCGAC
GCCCACGCCT GGGTCGAAGT CTGGACGGAC GGCCGCTGGC TGCGCGTCGA CCCCACCTCC
GTGGTCGCTC CGCAGCGTCT GGTGACGGGG GTGGAGTCCT TCGTGCCCCA GGGGCAGGGC
GGAGTGTTGC CCGAAGGGGC TCAGGCCCTG CGCAAAGTGG GACGTTTTTT TCAGCTGGGC
TGGGACGCGG CCAACAACTC CTGGAATCAA TGGGTGCTGG GCTTCAGCCA CGACAGGCAG
CGAAGCTTGT GGGAGCGCCT GGGCATCGAT TCGACCACCA GGGCCGGGGC CGGAAAGCTG
GCGGGCGTCC TGGCCGTGGG GCTGTGCATC ATTCTGGGCG GGGTGTTCGG CGTCATACTG
CGCTCGCGGC ATGGCGAGCG GGATCAGGCT ACGTTTTTGT ACGGCCGTTT TTGCCGCAAA
TTGGCCAGGC TCGGATTGGC CAGGGGACTG GCCGAGGGGC CGCGCGATTA TGCTCGGCGC
ATAGGTAGGC AGCGCCCCGA ACTGGCCCTG GCTGCCCGGT CTATTGTCGA CGCGTATGTC
GCTTTGCGCT ACAGTGGCCG GGGGGATTTG GCGGCATTCA AACGACTGAT CGACGAATTC
ATGGGGAGAA AGATTTGA
 
Protein sequence
MIHDKRRFSV TLLALALAFA PHLPRVPVFV GFFVFLAWGY ALGMQYRGWP VPPRWLRAIL 
ALACLALVLS TYGRSFGRDA GVALLSLMLG LKAVESKSVR DMLALLFLAY FVVVTNVLYS
QTLVMSAYMF FSVMAVTAAL VHLHSGEPRL LPDLRRGGLL LVQALPLALI LFVFFPRLQG
ALWGVHDERD EGVSGFSETL EPGSVASLSL SREVAFRVDF PGTIPDRDSL YWRGLVLDSF
DGMTWFRDVP FDLVPPRIDA LPAQSVSYTL TMEPHNREWV FALDLPVLAP RGTVLRSDQT
LASLRMVRSR VRYELAAVQA PGLSPVPGPA WTALPEVGNP KARALAAEWK DAGLSPDEMV
AAALKLFREG GFVYSLRPGA ADKDIVDQFL FATRLGYCEH YSSAMAFLLR AAGVPVRVVV
GYQGGEENPM GGYLIVRQSD AHAWVEVWTD GRWLRVDPTS VVAPQRLVTG VESFVPQGQG
GVLPEGAQAL RKVGRFFQLG WDAANNSWNQ WVLGFSHDRQ RSLWERLGID STTRAGAGKL
AGVLAVGLCI ILGGVFGVIL RSRHGERDQA TFLYGRFCRK LARLGLARGL AEGPRDYARR
IGRQRPELAL AARSIVDAYV ALRYSGRGDL AAFKRLIDEF MGRKI