Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1545 |
Symbol | |
ID | 5709961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1627234 |
End bp | 1628484 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641276053 |
Product | imidazolonepropionase |
Protein accession | YP_001541358 |
Protein GI | 159042106 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0601909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGG TTTCGCTCTT AATAGGCCCA GTTAATCAAC TGGTTACGGC TAAATCAATA CCCTGGAGGC ATGGTGACTC AGTCATTATA ATTGAGGAGG CTGGCGTTGC CGTTAATGGC CATGAGGTAA TTGATGTTGG CCCATGGAGA GAGATAAGTA GGAGATATGA GGGTAAATGC AGGGTATCAG CAGAGGATTC ACTGGTTACT GCCGGTCTCA TAGATCCCCA TACGCACCTA CTTTTCGCAG GCTCCAGGGA GGATGAACTA GAGCTTAAAC TTCAAGGCAC GCCTTATGAG GAGATACTTA AGAGGGGTGG CGGTATATAC AGGACTGTTA ATGAGACGGT TAATGCAAGT GATGAGGAGT TAAGGGGAAT CCTATTAAGT AGATTACTAG AGGTAGCTAA GTATGGTACA ACGACGATAG AGGTTAAGAG TGGTTACGGC ATTAGCCCTA AGGAGGAGAT TAGGCTTCTA AGAATAATTA ATGACGCTGC CGGTGAAGCA TTAATTGATG TTATACCAAC ATACCTAATT CATGTGCCGC CTAGGAACAT GAATAGGGTA GAGTATATAA ATGCCGTGAC CGCATCATTG GATCAAGCCA AGGAATTAGC TAAGTTCATC GACGTCTTCT GCGATGACGG TGCATTCACT GTTAATGAAA CTAGGGCAAT ACTTAATGAG GCTTCATTAA GAGGCTTTAA ACTTAGGCTT CATGCTGATG AAATCGCCTA CATTGGGTGC AGTGATTTAG TTAATGAATT TAACTTAACG TCGCTTGATC ATTTATTAAA TATGCCTGAG CAGAATGCTA AAGCCATGGC TTTAAGAGGC TCAACAGCCA CATTACTGCC TGCAACAGTA CTATCATTAA TGAGTAGTAA GAGACCACCA GTGAAGGCGC TTAAGGATTC TGGAGTACCT ATAGCCTTAG GCTCAGACTT CAGTCCAAAT ACCTGGGTAT TAAACATGCA GTACGTAATG GAGTTAGCCA CATATTTACT CGGTATGACT CCCCTGGAGG TTTTAATGGC TTCAACAGTA AATGCAGCAT ACAGTCTAGG CTTCAGGGAT AGGGGTATTC TACAGCCGGG TTACCTAGCT AACATGGTTA TTTGGAGTAT ACCTAATTAC AAGTGGCTGA GCTACGAGTT AGGTAGGAAT AAGGCATTAA TGACTATAAG CAGGGGCATT ATCCTTGAAT CAAAGCCCAC CAACCCCAAT GTAACCAAGG AATGTTCATA A
|
Protein sequence | MNKVSLLIGP VNQLVTAKSI PWRHGDSVII IEEAGVAVNG HEVIDVGPWR EISRRYEGKC RVSAEDSLVT AGLIDPHTHL LFAGSREDEL ELKLQGTPYE EILKRGGGIY RTVNETVNAS DEELRGILLS RLLEVAKYGT TTIEVKSGYG ISPKEEIRLL RIINDAAGEA LIDVIPTYLI HVPPRNMNRV EYINAVTASL DQAKELAKFI DVFCDDGAFT VNETRAILNE ASLRGFKLRL HADEIAYIGC SDLVNEFNLT SLDHLLNMPE QNAKAMALRG STATLLPATV LSLMSSKRPP VKALKDSGVP IALGSDFSPN TWVLNMQYVM ELATYLLGMT PLEVLMASTV NAAYSLGFRD RGILQPGYLA NMVIWSIPNY KWLSYELGRN KALMTISRGI ILESKPTNPN VTKECS
|
| |