Gene Dole_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1571 
Symbol 
ID5694408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1870400 
End bp1871578 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content61% 
IMG OID641264166 
Productamidohydrolase 
Protein accessionYP_001529452 
Protein GI158521582 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0206923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCACC CATACACCAT CATTGATACC GCTGTTGTGC GGGTGGGCAG TCTGATCGAC 
GGCACGGGTA GACCTGCGCG AAAAGATCTG TTTGTGCGGG TGGAAAAGGG AATGGTTCAG
GCCATAACGG ATACGGTGCC GTCCGGACCG CATGATCTGA TCGACCTGTC CGGCTGCACC
GTGCTGCCCT GCCTGATCGA CAGCCATGTT CACCTTTTTC TGTCCGGCAG CCTGGATTCG
GAAGAACACC GTCGGCAGAT GGCCGCCGGA TTTGAGGATG CGTGCCGGAC CATAGCGGAA
AATATCGATT GCCAGCAGTC CTGCGGCGTG CTGACCGTGC GCGACGGCGG TGATGGCCGG
GCCCATGTGT CGCGTTTTTT GCGGGATAGA GTTGAAAAGG GGCACAGCCT TTTTCTTGCG
CAGACCCCTA GCAGGGGATG GTTCAAGGCG GGCCGTTACG GAAAACTGGT GGGCGGCGAG
CCCCTGCCGG AAAGCGCTTT TCTGGAAGCC ATCACCGGGC AGATGGCCGC CGGTGCTGAT
CATGTCAAGC TGGTCAACTC GGGATTAAAC AGCCTGACCC GATTCGGTGT GCAGACGACG
CCGCAGTTTA CACCGGACGA GCTTGCCGCC ATTGTGGCAT TGGCCCACGG GGCCGGGCGG
CCGGTGATGG TCCATGCCAA TGGCGAAATT CCTGTTCGTC AGGCCGTGGA GGCCGGAGTA
GATTCCATTG AGCACGGGTA TTTCATGGGG ACCGATAACC TGTTACGAAT GGCCGAACGG
CAGACCTTCT GGGTGCCCAC CCTGGCGCCC ATGCATGCTT TTGCCCAGAC CACCGTTGAT
TTCAGCGGCG TGGCGGCCCG CACGCTTGAG CACCAGATGG GGCAGCTTGC CTTTGCCCGC
CGGGTCGGGG TAAAGGTGGC CCTGGGCACG GATGCCGGCA GCCCCGGCGT TTATCACGGC
ACGGGTGTGA TTCGCGAGCT TGAACTTTTT ATGGCCGCTG GCTACACCAT GGAAGAGGCC
GTTGGCTGTG CCGCGGTTTG CAATGCCGAC CTGCTCGGCC TGGCCGACCG TGGAAGAATC
GCACCGGACA TGCCGGCGCT GTGGGCCGTT GTGTCGGGAG ATGCAGGTAG GCTTCCCGCC
AGTCTGGCCC AGGCGGTTGT GTATGCGGGG AAGGGTTGA
 
Protein sequence
MNHPYTIIDT AVVRVGSLID GTGRPARKDL FVRVEKGMVQ AITDTVPSGP HDLIDLSGCT 
VLPCLIDSHV HLFLSGSLDS EEHRRQMAAG FEDACRTIAE NIDCQQSCGV LTVRDGGDGR
AHVSRFLRDR VEKGHSLFLA QTPSRGWFKA GRYGKLVGGE PLPESAFLEA ITGQMAAGAD
HVKLVNSGLN SLTRFGVQTT PQFTPDELAA IVALAHGAGR PVMVHANGEI PVRQAVEAGV
DSIEHGYFMG TDNLLRMAER QTFWVPTLAP MHAFAQTTVD FSGVAARTLE HQMGQLAFAR
RVGVKVALGT DAGSPGVYHG TGVIRELELF MAAGYTMEEA VGCAAVCNAD LLGLADRGRI
APDMPALWAV VSGDAGRLPA SLAQAVVYAG KG