Gene Dhaf_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_2235 
Symbol 
ID7259204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp2414422 
End bp2415600 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID643562124 
ProductRubrerythrin 
Protein accessionYP_002458704 
Protein GI219668269 
COG category[C] Energy production and conversion 
COG ID[COG1592] Rubrerythrin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00102881 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCAA TTAGAAATAT TGATCTCTGT ACCAAGGATT GCCTGTGTCT TTATGTTTGC 
CCGACAGGAG CAACCGATAC AGAAACAGGT CAGATTGACC CTGACAAATG CCTGGACGGA
TGCCGGGCCT GCGTCGACGC CTGCCCATCC CACGCCATAT CCTTTGTGCC TGAGGTATAT
CCTCCTCAGC AAGGAAAATC CCCTTCGGTA AAAAGAGCCA TGTTATCTTT ATCGGCAAGT
AAAACCAAAC AGGAAAAGAT CGCGGCCCAA GTGGCGGAGC GATCAGGCAG CCCGATTTTG
CGGCAGTTTG CCGAAGCCTT AAGCGCCTCC AACCGGCTCA TGGCCGAAGA TATTCTGCGT
GAAGCCGGGT ATCTGCTGCC TCAGAGCGTC AACGCTCAGG ATTTTCTTCA ATCCTTGCTG
GACAGCCCCC AAGGGGAGGA TTTTCCCAGG GAAGCGGCGG CAAGATTATT AGCGAAACTG
AAAACCAATC AGGCAAAAGG GCAGGAGGAG AAAAAAATGA CTCACTATCG TTGTTCAATT
TGTGGTTACC TTCATGAAGG AGAATTAACC GCGGACTTTA AATGTCCAAT CTGTAAACAA
CCCGCTTCCG TATTTCAACT GGTAGAAGAG AAGGGGAGTG CAGGCAATCC TTACGCCGGC
ACCAAAACAG AGAAAAATCT TCTGGACGCC TTTGCCGGAG AAAGCCAGGC CAGAAATAAA
TATACTTATT TCGCCGCCAT AGCCCAAAGA GAGGGATACG ATCAAATTGC CGAACTCTTT
TTGCATACGG CAAGGAATGA GCAGGAACAT GCCCGCATCT GGTATGAAGA GCTGGGCAAT
CTGGGCAGGA CCGCCGAAAA CCTTTTGCAT GCGGCTGAAG GGGAAAACTA TGAATGGACG
GATATGTACG ACCGCTTTGC CAAGGATGCT GAAGCGGAAG GGTTCAAGGA TTTAGCGGCA
AGATTCCGCA AAGTGGGTGC TATCGAGAAA GCCCATGAAG AAAGATACCG TGCCTTGCTG
AAAAACGTGG AAATGCAGCA GGTCTTTGCC AAAGGGGAAG AAGCCATGTG GGAATGCCGT
ATCTGCGGGC ATCTTGTCAT GGGCAGGAAA GCCCCCGATG TTTGCCCGGT ATGTAAGTAT
TCCCAGAGTT ATTTTGAAGT AAGAAAAGAA AACTATTAA
 
Protein sequence
MPAIRNIDLC TKDCLCLYVC PTGATDTETG QIDPDKCLDG CRACVDACPS HAISFVPEVY 
PPQQGKSPSV KRAMLSLSAS KTKQEKIAAQ VAERSGSPIL RQFAEALSAS NRLMAEDILR
EAGYLLPQSV NAQDFLQSLL DSPQGEDFPR EAAARLLAKL KTNQAKGQEE KKMTHYRCSI
CGYLHEGELT ADFKCPICKQ PASVFQLVEE KGSAGNPYAG TKTEKNLLDA FAGESQARNK
YTYFAAIAQR EGYDQIAELF LHTARNEQEH ARIWYEELGN LGRTAENLLH AAEGENYEWT
DMYDRFAKDA EAEGFKDLAA RFRKVGAIEK AHEERYRALL KNVEMQQVFA KGEEAMWECR
ICGHLVMGRK APDVCPVCKY SQSYFEVRKE NY