Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1571 |
Symbol | |
ID | 5694408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1870400 |
End bp | 1871578 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641264166 |
Product | amidohydrolase |
Protein accession | YP_001529452 |
Protein GI | 158521582 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0206923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCACC CATACACCAT CATTGATACC GCTGTTGTGC GGGTGGGCAG TCTGATCGAC GGCACGGGTA GACCTGCGCG AAAAGATCTG TTTGTGCGGG TGGAAAAGGG AATGGTTCAG GCCATAACGG ATACGGTGCC GTCCGGACCG CATGATCTGA TCGACCTGTC CGGCTGCACC GTGCTGCCCT GCCTGATCGA CAGCCATGTT CACCTTTTTC TGTCCGGCAG CCTGGATTCG GAAGAACACC GTCGGCAGAT GGCCGCCGGA TTTGAGGATG CGTGCCGGAC CATAGCGGAA AATATCGATT GCCAGCAGTC CTGCGGCGTG CTGACCGTGC GCGACGGCGG TGATGGCCGG GCCCATGTGT CGCGTTTTTT GCGGGATAGA GTTGAAAAGG GGCACAGCCT TTTTCTTGCG CAGACCCCTA GCAGGGGATG GTTCAAGGCG GGCCGTTACG GAAAACTGGT GGGCGGCGAG CCCCTGCCGG AAAGCGCTTT TCTGGAAGCC ATCACCGGGC AGATGGCCGC CGGTGCTGAT CATGTCAAGC TGGTCAACTC GGGATTAAAC AGCCTGACCC GATTCGGTGT GCAGACGACG CCGCAGTTTA CACCGGACGA GCTTGCCGCC ATTGTGGCAT TGGCCCACGG GGCCGGGCGG CCGGTGATGG TCCATGCCAA TGGCGAAATT CCTGTTCGTC AGGCCGTGGA GGCCGGAGTA GATTCCATTG AGCACGGGTA TTTCATGGGG ACCGATAACC TGTTACGAAT GGCCGAACGG CAGACCTTCT GGGTGCCCAC CCTGGCGCCC ATGCATGCTT TTGCCCAGAC CACCGTTGAT TTCAGCGGCG TGGCGGCCCG CACGCTTGAG CACCAGATGG GGCAGCTTGC CTTTGCCCGC CGGGTCGGGG TAAAGGTGGC CCTGGGCACG GATGCCGGCA GCCCCGGCGT TTATCACGGC ACGGGTGTGA TTCGCGAGCT TGAACTTTTT ATGGCCGCTG GCTACACCAT GGAAGAGGCC GTTGGCTGTG CCGCGGTTTG CAATGCCGAC CTGCTCGGCC TGGCCGACCG TGGAAGAATC GCACCGGACA TGCCGGCGCT GTGGGCCGTT GTGTCGGGAG ATGCAGGTAG GCTTCCCGCC AGTCTGGCCC AGGCGGTTGT GTATGCGGGG AAGGGTTGA
|
Protein sequence | MNHPYTIIDT AVVRVGSLID GTGRPARKDL FVRVEKGMVQ AITDTVPSGP HDLIDLSGCT VLPCLIDSHV HLFLSGSLDS EEHRRQMAAG FEDACRTIAE NIDCQQSCGV LTVRDGGDGR AHVSRFLRDR VEKGHSLFLA QTPSRGWFKA GRYGKLVGGE PLPESAFLEA ITGQMAAGAD HVKLVNSGLN SLTRFGVQTT PQFTPDELAA IVALAHGAGR PVMVHANGEI PVRQAVEAGV DSIEHGYFMG TDNLLRMAER QTFWVPTLAP MHAFAQTTVD FSGVAARTLE HQMGQLAFAR RVGVKVALGT DAGSPGVYHG TGVIRELELF MAAGYTMEEA VGCAAVCNAD LLGLADRGRI APDMPALWAV VSGDAGRLPA SLAQAVVYAG KG
|
| |