Gene Dbac_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_0239 
Symbol 
ID8375879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp281598 
End bp283094 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content64% 
IMG OID644999473 
Productprotein of unknown function DUF814 
Protein accessionYP_003156782 
Protein GI256828054 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCCA GTGTTTTTTG TTTTGTGGCC AGGGAATTGG CCGAGCGCGT CGTCGGCATG 
CGCGTGGAGA AAGTCTTTAC GCCTCTCCCC GAGACCTGGA CCATAGATCT TGGCCGGGCC
GGATACCTTG TGTTGTGCAC CGCCAAGCCC ACGCCGTTTC TCTATCTTTG CTCTCACAAG
CCCGAAAATC CGCCCAATCC GTCAGGACGC GCCATGTGGC TGCGCAAGCG TCTGAAGGGT
CGCCGCGTCC TGGGCCTTGT TTCGGACTGG CCGCTGCGGC GTCTGGCCCT GGAGCTGTCG
CCGGGCGAAG GCAAATGGCT CATCCTTGAT TTGGCGGCGG CTCCGCAGCT GGCGGACGTG
TTGCCGCCCG AATTCGGAAG CGAGCCGGTC TGGCCGGACC TTGAACGCAT CAAAAGCGAG
GACGGCCTGT GGAGAGCGTT GCCTCACCTG ACCCCGCCCT TGCGTCATCA TCTGCGTTCG
GTTTCGCCGG GCGAGGCAAA GTCCGTTTTG GAGAATCTCA AGGCCGGGAC CGTTTCCACT
TTTTATCATG GCTTCGATCA TCAGGGGCGG CCCCAGGTCA GGCTTTGGCC CTTGCAGGAC
GGGGGCGCCT GTTCCAGCGC CCTGGAGGCT GCGCAGAAGG CCCACGGCCA GACGCTGGCG
GGCCTTGAGC GGGTTCATGC CGGTGCGGAC AGCGCCGTGG CCCGCAATAT CCGTCGCATC
CGCCGCGCTC TGGAGCGGGT GCAGGATGAT CACAAACGTC TGCAGGGCAT GATCGAAAAG
CGTCGCGAGG GACTTATGCT GCAGGCCCAT CTGCATCATC TGGACAAGAA TGTCCGGCTG
GCAGTGCTGC GCCTTGCGGA TGAGCACGGC GAGGAAGTGG AGCTTCGCCT TGATCCGGGT
TTGACGGTGC GTGAAAACAT GGAACGTTTT TTTGTACGCG CGGCCAAGGG CGAGAGGGGG
CTTGGCATTG TCGCCGCCCG CGTCCTGGCC CTCCAGCGTG AGCTGGACGC GGCCCGTCAG
GGCGTGGTGC CGTCCGAGTC CCTGCCTGGG CGCTGCGCAA AAGAGCCCGC GCCTGTCGTG
CTCCCGGCCA AGTACCGCAA GATCAAGGTT CAGGCCTATC GCTCCTCCGA CGGGTTTCTC
ATCGTTCGCG GCCGCAGCGC CCAGGCCAAT CATCAGCTCC TGACCCAGGC CGCCAGCCCC
TTCGATTACT GGCTGCACGC CCAGGATGGT CCCGGCGCGC ACGTCATTGT CAAACGCGAC
TTTCCGGCCC AGGAAGTGCC CGAGCGGACC ATCCAGCAGG CGGCGGCGCT GGCGGCTCTT
GCCAGTCATC TGAAAATGGC GGACCGGGGC GAGGTGCTCC TGTGCCTGGT CAAGGATGTG
CGGCCCATCA AGGGCGCGGC TTTGGGGATG GTCGGAGTGG ACAAGGTTTT GCGCACGGTG
CGCCCGGTCA TCGACCCCGC CCTGGAAGAA AGCCTTCGCC TTGAAGGGCA GCGCTGA
 
Protein sequence
MDASVFCFVA RELAERVVGM RVEKVFTPLP ETWTIDLGRA GYLVLCTAKP TPFLYLCSHK 
PENPPNPSGR AMWLRKRLKG RRVLGLVSDW PLRRLALELS PGEGKWLILD LAAAPQLADV
LPPEFGSEPV WPDLERIKSE DGLWRALPHL TPPLRHHLRS VSPGEAKSVL ENLKAGTVST
FYHGFDHQGR PQVRLWPLQD GGACSSALEA AQKAHGQTLA GLERVHAGAD SAVARNIRRI
RRALERVQDD HKRLQGMIEK RREGLMLQAH LHHLDKNVRL AVLRLADEHG EEVELRLDPG
LTVRENMERF FVRAAKGERG LGIVAARVLA LQRELDAARQ GVVPSESLPG RCAKEPAPVV
LPAKYRKIKV QAYRSSDGFL IVRGRSAQAN HQLLTQAASP FDYWLHAQDG PGAHVIVKRD
FPAQEVPERT IQQAAALAAL ASHLKMADRG EVLLCLVKDV RPIKGAALGM VGVDKVLRTV
RPVIDPALEE SLRLEGQR