Gene Dole_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1424 
Symbol 
ID5694259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1695812 
End bp1697719 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content58% 
IMG OID641264017 
ProductZinc finger-domain-containing protein 
Protein accessionYP_001529305 
Protein GI158521435 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR02098] MJ0042 family finger-like domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0877101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATTA CCTGCGAACA GTGCGGTGTC AATTTTAAGC TGGATGACAG TCTTATAAAA 
CCCCAGGGGT CAAAGGTACG ATGTTCAAAG TGCAAGCACG TTTTTCGGGC CTACCCCCCG
CCCCCGGAGC CTGCGCCTGT TCAGCCGCCG CCGGCGAAAA AGTCGCCGAA GCCCACGGCA
GAACCGGCAC CGGAAGCGCC GGTAAAAGCA TCCAAACCTG AAGCAAAGCC CCCCCCGCCT
GAGACCAAAG AGGTGTCTGA TTTTTCCCTG GACGAAGAGG ACCATGGGCC TGAGCCGGAC
GAGGATCTGG ATCTGGGATT GGACGAGGAC CTGGACCTGG GAGAGGATCT GGACCTGGGG
TTGGACGAGG ATCTCGACCT GGGTCCGGGC GAGGATCTGG ATTTGGGGCT GGAAGAGGAT
GCCGGGGCCG GGGCATCTGA CGTATCCGCC GCATCGGACG ATTTCGAACT CTCCATTGAC
GATGACGAGG ACCACGGGCT TGACGATGAT GACGCCGATA CATCCGACGA AGCTGATGAA
GGGGATCTGG ACCTGGACTT TGATGAAACC GGTGAGGACA AAGCGGCCGA CGACCAGGAG
GAGGATTTCG GCCTGGCCCT GGACGACGAC GAAGACCAGC CCGTGGCGGA TGATGAGGAT
GAGGCAGATC TGGCCCTGGA TTTTGATGAC GATGACCTGG GGTTGGGAAG TGATGATCTT
GAACTGGATG AACTGGAGGA CGTCAGCGAG GCGACCCCCA CGGCAAGCTC CGATGATTTT
GAACTTGAAC TGGATGATGA CGACGCGCCA CCCGAGGAGG ATCTCGACTT TTCTGAGGTA
GACAGCCTGC TGGAGGGCGA TGATGACACA TCGGTGGCCG ACACCGTGGA GCTTTCCGCC
GATGAACTGA ACCTGGACCT GGACGACGAT ATCGATGGTG CCGATCCCAT TGAGCAGCAG
GAGATCGACC TGGCCGACCT GGAGCAGACC ATTGAAATGG AGCTGCTGGA GCCCGAAGAC
GAGGATGAGG AGGAGCAGGA GGAACCCGAA GACGTGGAAC TGGCCCTGTC CGATGAGGGG
CAGGACATGG ATGACGATAT TGATTTTTCC GATGCCGACG AGGATGATTT TTCCGATATA
GAGCAGATGC TGGCATCGGA CGAGGACCAG GAAGACGGCG AACCGGCGGC GGCAGCGATG
GAGACGGATG AAGCCGGGGC AGTGGCTGCT GATAAGAAGA AGGCCAAAAA AGACAAGAAG
GCCAAAAAAC CGGAAAAAGC AAAGAAAGAA AAGAAAAAGA AGGAAAAGAA GGAAAAAACG
GTTTCTGCCG GGGCAACCGG CGTTGGCAGG CGTGTACTGA AGGTGATCCT GGTGGTGCTG
CTGGTGCTGG TCCTTATCGC CGGACTGACC GTGGGTGCTT ATTTCCTGGC TTCCAGCATG
GGGGTCAGTG TGCCGCCCAT GGACAAGCTC CCGCTTATTG GGACCCTGCT GGGAAAGGGG
AGCACGTCCA TCGCCGGAAC CGAGGTGGTG GTGGTTAAAA GCTCACTGCA GAACAATTTT
GTCACCAATA CCCATGCGGG CAAACTGCTG ATCATCACCG GTTCCGTTAC CAACAAGTCC
TCCGCGCCGC GGCGGTTCGT CAAAGTGACG GCATCCCTTG CTTCCCAGGG CACCCCCCTT
GCCCGGGAGG TGTCGGCCTA CTGCGGCAAT ATATTGGCCT TTGAGGAGTT GTCCGAGCAG
CCCATGGACG CCATTCAACA GCGGCTGGCC AACCCCTCCG GGGACAACAA CCAGAACGCC
AATATTCGGG CCGGGGCAAC AGTCCCGTTT ATGATCGTCA TTTCCGATCT GCCGCCGGAC
CTGGTCGGAT ATGAGGTCCA GGTCACGGAA GCATCCCCGA TGGCCTGA
 
Protein sequence
MIITCEQCGV NFKLDDSLIK PQGSKVRCSK CKHVFRAYPP PPEPAPVQPP PAKKSPKPTA 
EPAPEAPVKA SKPEAKPPPP ETKEVSDFSL DEEDHGPEPD EDLDLGLDED LDLGEDLDLG
LDEDLDLGPG EDLDLGLEED AGAGASDVSA ASDDFELSID DDEDHGLDDD DADTSDEADE
GDLDLDFDET GEDKAADDQE EDFGLALDDD EDQPVADDED EADLALDFDD DDLGLGSDDL
ELDELEDVSE ATPTASSDDF ELELDDDDAP PEEDLDFSEV DSLLEGDDDT SVADTVELSA
DELNLDLDDD IDGADPIEQQ EIDLADLEQT IEMELLEPED EDEEEQEEPE DVELALSDEG
QDMDDDIDFS DADEDDFSDI EQMLASDEDQ EDGEPAAAAM ETDEAGAVAA DKKKAKKDKK
AKKPEKAKKE KKKKEKKEKT VSAGATGVGR RVLKVILVVL LVLVLIAGLT VGAYFLASSM
GVSVPPMDKL PLIGTLLGKG STSIAGTEVV VVKSSLQNNF VTNTHAGKLL IITGSVTNKS
SAPRRFVKVT ASLASQGTPL AREVSAYCGN ILAFEELSEQ PMDAIQQRLA NPSGDNNQNA
NIRAGATVPF MIVISDLPPD LVGYEVQVTE ASPMA