Gene Cmaq_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0073 
Symbol 
ID5709386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp86231 
End bp87637 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content43% 
IMG OID641274576 
Productaldehyde dehydrogenase 
Protein accessionYP_001539917 
Protein GI159040665 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGTC AAATCCCAGT ATATGAACCA GCGACCGGTG AGGTTTTAGC CTATGTACCG 
GACATGAGTA TTAATGAAGT TAGGGATGCA ATAAGTAAGG CATATGACGC ATTACCCAGG
ATACAGTCGA TACCGGCCTA CGAGAGGGCT AAGCTACTAA TGAAGGTTGC TCAGGCTATT
AGGGCACGTA AGGAGGAGTT AGCTAGGTTA CTAACAAGGG AGATTGGTAG ACCAATTAAG
AGCACTAGGT TAATCCTCGA GAGGACGGCT AGGATTTATG AATTAGCAGC CCAGGAATTA
CCCCATGTCT TAACCGGTGA ATTCATACCC CTTGAGGCTT ATGATTACCC GGCAGGTAAT
GAGAAGAGAA TAGCCTTCAT TAGAAGGGAA CCAGTGGGTG TGGTGGGTGC TATAACACCC
TTTAATTTTC CACCAGACAG TATGGCTCAT AAGGTTGCCC CAGCCTTGGC AATAGGTAAT
ACTGTGGTTC TTAAGCCGAG TAGGAATTCA CCATTAACGG AAACTGAGAT AGCTAAGATA
ATTACTGAGG TGGGGTTCCC TGAGGGTTCA ATTAACGTGG TTACTGGTGA TTCATCAATG
ATTGGTGATG AATTCGTTAA TAACCCTAAG GTATCATTAA TAACGTTCAC TGGTTCATCT
AAGGTTGGTC TTAATTTAGC CAGTAGGGCA ATATTAATGG GTAAGAGGGT TATTATGGAG
CTTGGCGGTA GTGATGCAAT GATAATCCTG GAGGATGCAG ACTTAAATAA GGCTGTCCAA
GCAGCCACAG TGGGTAGGTT TGATTACGCT GGGCAATTCT GCAATGCCAC CAAGAGGTTA
ATAGTGAGGG ATGAGGTGTA TGATGAATTC ATTAAAAGGC TTACTGAGAG TGTTTCGAAA
CTTAAGATAG GTGATCCATT AAGGGAGGAT ACTGATGTTG GTCCATTAAT AAGTAGGGAG
GCTGTGGAGA CTATGGAGTT CTTCGTTAAT GATGCTTTAA GTAAGGGTGG GAGAATCATA
TATAGGGTCA GTGGGGTTCC GGAAAGGGGG TTCTATTATC CACCAACAAT ACTGGAGGCT
CCGTTTAACT CAGCGGTGTG GATTGAGGAG GTTTTCGGCC CAGTATTACC TGTTGCTAGG
GTTAAGGATG ATGATGAGGC TGTTGAATTA GCCAATAGGA CTGAGTATGG GCTTGACGCA
TCAATATTCA GTAGGAATTT CTCAAGGGCA TATAAGTTAG CCACTAGGAT TAAGGCAGGA
ACCATATTCA TTAATGATAC CACTAGGCTC AGGTTTGATA ACCTACCCTT TGGTGGATTT
AAGAAGTCTG GTATTGGACG TGAGAGTGTT AGGGATACTA TGATTGAGAT GAGTGAAGTT
AAGGTTATAT CTTACACATT AGATTGA
 
Protein sequence
MQRQIPVYEP ATGEVLAYVP DMSINEVRDA ISKAYDALPR IQSIPAYERA KLLMKVAQAI 
RARKEELARL LTREIGRPIK STRLILERTA RIYELAAQEL PHVLTGEFIP LEAYDYPAGN
EKRIAFIRRE PVGVVGAITP FNFPPDSMAH KVAPALAIGN TVVLKPSRNS PLTETEIAKI
ITEVGFPEGS INVVTGDSSM IGDEFVNNPK VSLITFTGSS KVGLNLASRA ILMGKRVIME
LGGSDAMIIL EDADLNKAVQ AATVGRFDYA GQFCNATKRL IVRDEVYDEF IKRLTESVSK
LKIGDPLRED TDVGPLISRE AVETMEFFVN DALSKGGRII YRVSGVPERG FYYPPTILEA
PFNSAVWIEE VFGPVLPVAR VKDDDEAVEL ANRTEYGLDA SIFSRNFSRA YKLATRIKAG
TIFINDTTRL RFDNLPFGGF KKSGIGRESV RDTMIEMSEV KVISYTLD