Gene Dhaf_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3951 
Symbol 
ID7260971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp4191475 
End bp4192695 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content39% 
IMG OID643563872 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002460400 
Protein GI219669965 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGGAGA TTGTTAAAAA ACTAAAATCT TATCCTTTAT CTCGAGATGT TGAAACTAAA 
GAAAGAACAG GATATAGGTA TATCCATTAT GGAGATATTC ATAAGCAAAT AGCTGATTTG
ATAGTACAGG ATGAAGATTT GCCCTCTATA AAAGAAGGGG ATTATATACC ACTTAATCAG
GGTGATTTAG TTTTGGCAGA TGTTTCTGAA GATTACACCG GGATTGCAGA GCCAAGCATC
ATTCTCCATG AACCAAAAAC AAAGATTATT GCAGGATTGC ACACCATTGC AATCCGCCCT
CAGAGTGCCA CCTCTTTGTA TTTGTATTAC TTACTACACA CAGAAAGATT TAAAAAATTC
GGAAGCCATG TCGGGACGGG CTTAAAAGTA TTTGGGATAA CTTTTAATAA TTTATCTTTA
TTTCAAATAA AGACTCCGAG TTTTCCCGAA CAAACCGCCA TCGGCAACTT TTTCCGCACC
CTCGACGATA CTATCACCCT TCATAAGCGT AAGCTGGATA AGCTGAAAGA GTTGAAGAAC
GGCTATCTGC AAAAGCTGTT TCCTCAACCC GGAGAAGATG TGCCAAGGGT GCGCTTTGCC
GGATTCAATG AACCGTGGGA AGTGCGTTCA TTTGAAAATA TTCTTGCCCC AGCCGTGGCC
AGTAACACTC TGTCAAGAGC TGAATTAAGC TATGAAAAAG GCAGCATTAA AAATATCCAT
TATGGTGATA TACTTGTGAG ATTCGGAGTC TATATTGACA TTGCAAGGGA TCCGATTCCT
TGTATCGCCA ACGGAAGAAT TATTGATTAT AAGAATAAAT TGCTCCAAGA AGGAGATGTC
ATATTTGCTG ATACGGCAGA AGATGAGACT GTCGGTAAAG CGGTCGAAAT CACTAATATT
AGTAATTTCC AGGTTGTTTC TGGATTGCAC ACAATGGCAT ACCGACCCAA AATTAAAATG
TCACCTTACT ATTTAGGCTA TTATTTGAAT TCTCATTCAT TTCGCTATCA ATTGCTTCCC
CTTATGCAAG GGGTAAAAGT GTTATCGTTG AGCCGCAAGA ACCTGTCTAA GACACTTATT
CGCTATCCGG CTGTATTAAG CGAGCAGTCT CAAATTGGCG ATTTTTTACG AAATCTGGAT
GAACAAATCT TTACTCTATA CAATAAATTA GGCAAGCTGA AACAATTAAA GTCGTTTTAT
CTGCAAAAGA TGTTTATATG A
 
Protein sequence
MGEIVKKLKS YPLSRDVETK ERTGYRYIHY GDIHKQIADL IVQDEDLPSI KEGDYIPLNQ 
GDLVLADVSE DYTGIAEPSI ILHEPKTKII AGLHTIAIRP QSATSLYLYY LLHTERFKKF
GSHVGTGLKV FGITFNNLSL FQIKTPSFPE QTAIGNFFRT LDDTITLHKR KLDKLKELKN
GYLQKLFPQP GEDVPRVRFA GFNEPWEVRS FENILAPAVA SNTLSRAELS YEKGSIKNIH
YGDILVRFGV YIDIARDPIP CIANGRIIDY KNKLLQEGDV IFADTAEDET VGKAVEITNI
SNFQVVSGLH TMAYRPKIKM SPYYLGYYLN SHSFRYQLLP LMQGVKVLSL SRKNLSKTLI
RYPAVLSEQS QIGDFLRNLD EQIFTLYNKL GKLKQLKSFY LQKMFI