Gene Cmaq_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1545 
Symbol 
ID5709961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1627234 
End bp1628484 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID641276053 
Productimidazolonepropionase 
Protein accessionYP_001541358 
Protein GI159042106 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0601909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGG TTTCGCTCTT AATAGGCCCA GTTAATCAAC TGGTTACGGC TAAATCAATA 
CCCTGGAGGC ATGGTGACTC AGTCATTATA ATTGAGGAGG CTGGCGTTGC CGTTAATGGC
CATGAGGTAA TTGATGTTGG CCCATGGAGA GAGATAAGTA GGAGATATGA GGGTAAATGC
AGGGTATCAG CAGAGGATTC ACTGGTTACT GCCGGTCTCA TAGATCCCCA TACGCACCTA
CTTTTCGCAG GCTCCAGGGA GGATGAACTA GAGCTTAAAC TTCAAGGCAC GCCTTATGAG
GAGATACTTA AGAGGGGTGG CGGTATATAC AGGACTGTTA ATGAGACGGT TAATGCAAGT
GATGAGGAGT TAAGGGGAAT CCTATTAAGT AGATTACTAG AGGTAGCTAA GTATGGTACA
ACGACGATAG AGGTTAAGAG TGGTTACGGC ATTAGCCCTA AGGAGGAGAT TAGGCTTCTA
AGAATAATTA ATGACGCTGC CGGTGAAGCA TTAATTGATG TTATACCAAC ATACCTAATT
CATGTGCCGC CTAGGAACAT GAATAGGGTA GAGTATATAA ATGCCGTGAC CGCATCATTG
GATCAAGCCA AGGAATTAGC TAAGTTCATC GACGTCTTCT GCGATGACGG TGCATTCACT
GTTAATGAAA CTAGGGCAAT ACTTAATGAG GCTTCATTAA GAGGCTTTAA ACTTAGGCTT
CATGCTGATG AAATCGCCTA CATTGGGTGC AGTGATTTAG TTAATGAATT TAACTTAACG
TCGCTTGATC ATTTATTAAA TATGCCTGAG CAGAATGCTA AAGCCATGGC TTTAAGAGGC
TCAACAGCCA CATTACTGCC TGCAACAGTA CTATCATTAA TGAGTAGTAA GAGACCACCA
GTGAAGGCGC TTAAGGATTC TGGAGTACCT ATAGCCTTAG GCTCAGACTT CAGTCCAAAT
ACCTGGGTAT TAAACATGCA GTACGTAATG GAGTTAGCCA CATATTTACT CGGTATGACT
CCCCTGGAGG TTTTAATGGC TTCAACAGTA AATGCAGCAT ACAGTCTAGG CTTCAGGGAT
AGGGGTATTC TACAGCCGGG TTACCTAGCT AACATGGTTA TTTGGAGTAT ACCTAATTAC
AAGTGGCTGA GCTACGAGTT AGGTAGGAAT AAGGCATTAA TGACTATAAG CAGGGGCATT
ATCCTTGAAT CAAAGCCCAC CAACCCCAAT GTAACCAAGG AATGTTCATA A
 
Protein sequence
MNKVSLLIGP VNQLVTAKSI PWRHGDSVII IEEAGVAVNG HEVIDVGPWR EISRRYEGKC 
RVSAEDSLVT AGLIDPHTHL LFAGSREDEL ELKLQGTPYE EILKRGGGIY RTVNETVNAS
DEELRGILLS RLLEVAKYGT TTIEVKSGYG ISPKEEIRLL RIINDAAGEA LIDVIPTYLI
HVPPRNMNRV EYINAVTASL DQAKELAKFI DVFCDDGAFT VNETRAILNE ASLRGFKLRL
HADEIAYIGC SDLVNEFNLT SLDHLLNMPE QNAKAMALRG STATLLPATV LSLMSSKRPP
VKALKDSGVP IALGSDFSPN TWVLNMQYVM ELATYLLGMT PLEVLMASTV NAAYSLGFRD
RGILQPGYLA NMVIWSIPNY KWLSYELGRN KALMTISRGI ILESKPTNPN VTKECS