Gene Dbac_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_3104 
Symbol 
ID8378799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp3513185 
End bp3514414 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content63% 
IMG OID645002339 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_003159595 
Protein GI256830867 
COG category[S] Function unknown 
COG ID[COG3034] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCC GCACCTTCAT CCTCATCGCG GCGTTCCTGT CACTCCTGTT TGCCGCCCCG 
GCCCGGCCCC AGAGCTGGAC CGCCTCCCTT GCCCCGCATC CAAAGGCGCC GCCGCTGTTC
ATGGCCGTGG ACATGGCCCG GCAACGCGCC TTTGTGGTCC GCAACAAGGA TGGGGAACTC
AAAAAAATGA AAGACATGTC CTGCACGACA GGCATGCACG GCGGCGGCAA GCTCCTCGAA
GGCGACCGCA AGACCCCTGA AGGCGTCTAT TTCCTGCAGG GCAAGGCTAC GGGAGGGCTC
GATTTCGACA GCTTCGGCAA CACGGCCTTT CCGCTTAATT ACCCCAATCC AGTGGACCGC
ATCCAGGGCA AGACCGGCAA CGGGATCATG ATCCACGGCC GTGGGCGCAG CTTCGGACCG
CGCCAGACCC TTGGCTGCGT TGTCCTTGAA AACGACGACG TGGACACGCT GGACCGGCAT
GTGCGCATCC ACGCCACTCC GGTGGTCATC GCGGAATCGG TGAGTTTGAC CGGCAAGGCA
GGGCCGCCAC CGGAGATCGT TCTCGGCACC TGGGGCTGGA TCAAGGCCCG GGAGCGGCGC
GAGAATGCTT TTTTCGAGAT CTACGATCCC GCGCGCTTTG AAAAATCGAC GGGTATGAGC
TTTGCCCGTT TCCGGCAGAA AATCCTGCAG GAATTCGCCA CCTCCCGGTG GATCGACCTC
CGCATAGAGG ACCTGCAGGT CGTGCAGGGG CCGGACTACA TGGTCTCGGT CTTTGCCGAG
CGCACCCTGC CGCATGGGGA ACAGGGCTGG CGCAGACTGT ACTGGATGCG TCAGGTCGAG
CTCTGGAAGA TCGTGGGCGA GGAGTGGATT CCGCAGAATC TGGGCGGCAG CGTGGACTAC
GCCCAGCTGG TCGGCAAGGA GATTCGTGAG CGGTTGCAGG AATGCGCCCA GGCTTGGGAC
AAGGGTGACC TCAAAACCCT CCTGCGGGCC TACGACCGGA CCGGTAGGCG CAATGACGCG
CAAGGCCGCG AAGCCGTCGC CGCATCGCTT GAACGCGACA TGGCGGCCAA AAAGAAAAAT
CCCTACAGCG CCGAATCCAT GGTGCGGGTC ACCAAACACG GCGTCGAGGC CAAACTCAAG
GCGGACGGAC ACTCCCGGAC CATCCTCTTT CTGCCCGGGG CCTTTAACAC CTGGCTCATC
GTTAGCGACG AGGCGGCGAA GCAGCCATGA
 
Protein sequence
MDTRTFILIA AFLSLLFAAP ARPQSWTASL APHPKAPPLF MAVDMARQRA FVVRNKDGEL 
KKMKDMSCTT GMHGGGKLLE GDRKTPEGVY FLQGKATGGL DFDSFGNTAF PLNYPNPVDR
IQGKTGNGIM IHGRGRSFGP RQTLGCVVLE NDDVDTLDRH VRIHATPVVI AESVSLTGKA
GPPPEIVLGT WGWIKARERR ENAFFEIYDP ARFEKSTGMS FARFRQKILQ EFATSRWIDL
RIEDLQVVQG PDYMVSVFAE RTLPHGEQGW RRLYWMRQVE LWKIVGEEWI PQNLGGSVDY
AQLVGKEIRE RLQECAQAWD KGDLKTLLRA YDRTGRRNDA QGREAVAASL ERDMAAKKKN
PYSAESMVRV TKHGVEAKLK ADGHSRTILF LPGAFNTWLI VSDEAAKQP