Gene Dbac_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1047 
Symbol 
ID8376708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp1141869 
End bp1143158 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID645000287 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003157576 
Protein GI256828848 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAG ACGTACATCA ACTGGCCAAA ATGCTGCATG AACTCGACGA CCAGATGGTC 
GCGTGCATGA AGTGTGGCAT GTGCCAGGCG GTATGCCCGG TGTTTGCCGA AACCATGAAT
GAGGGGGATG TGGCCCGGGG CAAGATCGCG CTCCTGGAAA ACCTGTCCCA TGAAATGATC
AAGGACCCCG AAGGCGTTCA GGAAAAGCTC AACATGTGCC TCTTGTGCGG CTCATGCGCG
GCCAACTGTC CCAGCGGCGT GAAAGTGCTG GACATCTTCC TGAAAGCCCG CGTCATCGTG
AATACGTACA TGGGCTTGCC CGCAGTCAAG AAGGCCATTT TCCAGGGTCT TTTGACCAAG
CCCGGCGTGT TCAATTCCGT GATGGACCTG GCTTCCAAGT TCCAGGGCGT GTTCACCAAG
CCCGCCAACG AAGTCATCGG ATCGTCCTGT TCACGTATCG ATCTGGCTGC CATCGAAGGC
CGCCATTTCA TGCCTCTGGC CAAGAAGTCC TTGCGCAAGC TGGAGCCGTC CCGCAACACC
CGCCCTGGCA AGAGCGGATA CCGCGTGGCC TTTTTTCCGG GCTGCGTCAT CGACAAGATA
TTCCCGCATG TCGGGCAGGC CGTGCTCAAG GCTCTGACGC ATCATGAGGT TGGCATCTAC
ATGCCGACAG GGCAGGCTTG CTGCGGTATC CCGGCTCTGG CTTCGGGCGA CAAGGGGTCT
TTTGACAAGC TTGTGAAGCG TAATCTGGAG ATCTTTGAAA AAGAGAACTT CGATTATCTG
CTCACTGCCT GCGCGACCTG CACGGCGACC ATGCATGAAC TGTGGCCGCT CATGTCCGGG
GACAAGACCC AGAGCATGCA GGATCGCATC GCGGCCATGT CGGCCAAGGT CATGGACGTG
AACCAGTTCA TGGTTGACGT GCTGAAGGTC TCCATGCCTG TCAGCGGACA CGGGACCAAG
GTCACGTATC ATGATCCCTG TCACCTCAAA AAATCCATGA AGGTTTTTGA ACAGCCCCGT
GCGCTCTTGA AGTCCAACCC GAACGTGGAG CTTGTTGAGA TGGCCGATGC GGACCGCTGC
TGCGGTTGCG GCGGCAGCTT CAACCTGCAG CACTACAGCG TATCGAAGAG TATCGGCGAC
CAGAAACGGG ACAATATCGT TGCTTCCGGA GCTCAGGTAG TGGCCACAGG ATGCCCGGCG
TGCATGCTGC AGATTTCCGA CATGCTTTCA CAGCACAAGG ATCAGATCGC AGTCAAACAC
GTCATGGAAA TCTACGCGGA AACGCTTTAA
 
Protein sequence
MTADVHQLAK MLHELDDQMV ACMKCGMCQA VCPVFAETMN EGDVARGKIA LLENLSHEMI 
KDPEGVQEKL NMCLLCGSCA ANCPSGVKVL DIFLKARVIV NTYMGLPAVK KAIFQGLLTK
PGVFNSVMDL ASKFQGVFTK PANEVIGSSC SRIDLAAIEG RHFMPLAKKS LRKLEPSRNT
RPGKSGYRVA FFPGCVIDKI FPHVGQAVLK ALTHHEVGIY MPTGQACCGI PALASGDKGS
FDKLVKRNLE IFEKENFDYL LTACATCTAT MHELWPLMSG DKTQSMQDRI AAMSAKVMDV
NQFMVDVLKV SMPVSGHGTK VTYHDPCHLK KSMKVFEQPR ALLKSNPNVE LVEMADADRC
CGCGGSFNLQ HYSVSKSIGD QKRDNIVASG AQVVATGCPA CMLQISDMLS QHKDQIAVKH
VMEIYAETL