Gene Dole_2898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2898 
Symbol 
ID5695756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3483494 
End bp3484780 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID641265513 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001530778 
Protein GI158522908 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.180767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTATCACATT CATTTTAATC GTTGCACTGT TCTTGGTCCT GCCGTCATGC 
GCCACGCTCA CCCAGCTGGG GAAGGCGGCA AGGAGCGGAG ATATCAGACA GGTAGAAACG
CTTTTAAAAA ACGGCGCCGA TGTGAATGAA ACGGCGTTGA TGGGCGTAAC ACCTCTGTAT
GAAGCGGTTT TATATGATGC CCCCATAGAG ATAGTAAGGC TTCTGCTTGA TAACGGCGCC
GACGTCAACA GAGGGATGGG GAATGGATGG AAGCCTATAC ATCTGGCGGT TGATAACGGC
AATGCCGCTG TAGTCAAGCT TCTGATTGAC CGGGGGGCAG ATGTCTCTTT TCAGAATCCC
CATGGCAAAA CCCCGTTACA AATGGCCCAG GAAAACGGCC AGGCCGTTAT GCTTCGCCTG
CTTCAGGATG CAGAGGAGAA GCAGTATAAA GCACTTTTTG CAAAATCTGA TATAGAGGCC
CGACCTTCGA TCGACGGCGG CTCCGTTTCG ATTCTCAAAT CAGACGTCGA TGACCCTCCT
TCCATTCATT CAAAGAACAA CCACAGCGCT TATGCCATCG TCGTCGGCAT TGAAAGTTAT
CGTCAGCAAC TTCCCAAGGC AGATTTTGCC GCCCGGGACG CGCAGACAAT GACCAGTTAT
TTGACAAAAG CCATGGGGTA TCCTGAAGAA AACGTGGTGA CGCTTTTAAA CGACCGGGCG
GCGAAAAGTG ATTTTGAAAA ATATTTTGAA AAATGGCTGT CCAACAACGT GGAGACGGGC
AGTACGGTTT TTGTCTATTT TTCCGGCCAT GGCGCGCCCG ACCCCAAAAC CGGTTCTGCC
TACCTGGTGC CCTATGACGG AGACCCTACA TTTATCGCTG AGACCGGCTA CTCGTTAAGC
AGAATGTATA CCGCCCTGGG CAAACTTCCG GCAAAGGAGA TCATCGTTGC CCTGGACTCC
TGCTTTTCCG GCGCCGGTGG CCGGTCGGTG CTGGCCAAAG GGGCCCGGCC CCTGGTGATG
AACCTTCAGA CCGGAACAGC CATATCAAAA AACATGACCG TGATTGCCGC TTCAGCGGGC
GACCAGATCA GCTCCACCTA TGACGAAAAG GGCCACGGCC TGTTCACCTA CTTTCTGCTC
AAAGGCATCA AGAACGAGGA TGTGCTCAAC CCGGACGGCT CCCTTCGCAT GGACGACCTG
TTCGGCTACA TCTCGCCTCA GGTGGAGCGC ATTGCGCGCA AACAATACAA CAACGAACAG
ACACCGCAGC TGATCGGGGC GAAGTAG
 
Protein sequence
MKKTITFILI VALFLVLPSC ATLTQLGKAA RSGDIRQVET LLKNGADVNE TALMGVTPLY 
EAVLYDAPIE IVRLLLDNGA DVNRGMGNGW KPIHLAVDNG NAAVVKLLID RGADVSFQNP
HGKTPLQMAQ ENGQAVMLRL LQDAEEKQYK ALFAKSDIEA RPSIDGGSVS ILKSDVDDPP
SIHSKNNHSA YAIVVGIESY RQQLPKADFA ARDAQTMTSY LTKAMGYPEE NVVTLLNDRA
AKSDFEKYFE KWLSNNVETG STVFVYFSGH GAPDPKTGSA YLVPYDGDPT FIAETGYSLS
RMYTALGKLP AKEIIVALDS CFSGAGGRSV LAKGARPLVM NLQTGTAISK NMTVIAASAG
DQISSTYDEK GHGLFTYFLL KGIKNEDVLN PDGSLRMDDL FGYISPQVER IARKQYNNEQ
TPQLIGAK