Gene Dhaf_4664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_4664 
Symbol 
ID7261692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp4975545 
End bp4976735 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content44% 
IMG OID643564577 
Producttransposase IS4 family protein 
Protein accessionYP_002461097 
Protein GI219670662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCATA GCACTATCTT ACCAAAAAAC GAAGCAATGT TCAATTTCTT TAAAAGCCAT 
CGACTGCCCT TATATTTCTC AAAGCCTGTT TTACGACATA TTCAAGAATT TATTGTAGCG
GCCACGGCTA AAGGATATCG TGGTAAGATA GTCGACATCG CGGAGTGGAG TTCGGTTCAT
CGAACCTCTA TTGGTCATTT CCTCTCTCAT GGGGTATGGG ATGAATCTTA TATCCAGAAA
ATTGTTAAAC AGGAATCTCT TCAATTTGTC GTAGCCCACT CCCAAAAGAC GGAGCAGCCC
ATCTTTGTGA TTCATGATGA TACTGTTTGC AATAAGACGA AACCTTCGTC ACAGGCACAA
CGTCCCATCG AGCAAGCAGA TTTCCATTTT TCGCACTTAG AGGGTAAGAG TGTTTGGGGG
CATCAGGTTC AAGCAACCCT TGTTCAATGC GGTGACCACT CGCTCATTCA TGATGTTCAT
CAATACGATA AAACCAAGCT AAGCAAAATT GATGACGCTT GTGAATTGGC TAAAACCATG
CCGATTCCCC CTAAGTCAGG CTATGCCTTG GTTGATTCTT GGTATACCTG CGCCAAGCTG
ATTAACACCT ATGCTGCACG AGGATACCAG CTGATTGGAG CTCTTAAAAC CAACCGCATT
CTCTATCCCC AAGGGATTCG TGTTCGTCTC GATACCTTTG CCTCCTATGT GAACCCAAAG
GAAGTTCACC TTGTGACCGT GAACGGTTCA TCCTACTGGG TTTATCGCTA TGAAGGGGCT
CTAAACGATA TTGAGAATGC CGTAGTGCTG TTTTGTTGGC CTAAAGATGC TTTTCAGGTG
TCTAAAGCCT TGCATGCCTT TTTGTGCACC GATGTTTCAT TAGAAACACA AACTATTTTG
GCTTACTACA GTAAGAGATG GCCCATTGAG ATCTTCTTTC GGCAAGCCAA GGGAAATCTT
GGTTTTAACG GCTACCAAGT ACGCTCAATC CGTTCCATCG AAAGATTCTG GGCTCTACTT
TCTTTCACTC ATTTGTACTG CACCATGGGT TTAGGGAAGC CGCTGCTCTT TGGTGAAGGA
TTGCGGAAAG TCCGAAAAGA GGTAAAAGGG CAATACATTC GATGGATTTA TGAGTGTAGT
AGAAATGGAG TGCCTTTGGA AGATGTTTTA AAACGTCTTA AAGCTGCATA G
 
Protein sequence
MSHSTILPKN EAMFNFFKSH RLPLYFSKPV LRHIQEFIVA ATAKGYRGKI VDIAEWSSVH 
RTSIGHFLSH GVWDESYIQK IVKQESLQFV VAHSQKTEQP IFVIHDDTVC NKTKPSSQAQ
RPIEQADFHF SHLEGKSVWG HQVQATLVQC GDHSLIHDVH QYDKTKLSKI DDACELAKTM
PIPPKSGYAL VDSWYTCAKL INTYAARGYQ LIGALKTNRI LYPQGIRVRL DTFASYVNPK
EVHLVTVNGS SYWVYRYEGA LNDIENAVVL FCWPKDAFQV SKALHAFLCT DVSLETQTIL
AYYSKRWPIE IFFRQAKGNL GFNGYQVRSI RSIERFWALL SFTHLYCTMG LGKPLLFGEG
LRKVRKEVKG QYIRWIYECS RNGVPLEDVL KRLKAA