Gene Cmaq_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1779 
Symbol 
ID5708787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1853087 
End bp1854544 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content45% 
IMG OID641276290 
Productaldehyde dehydrogenase 
Protein accessionYP_001541592 
Protein GI159042340 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA GAGGCGTATT CATTGGTAAG CTCATTATTC CACGGGATAG GGAGTTCTAT 
GAAATCAGGA ATCCCGCTGA TACTAGGCAG GTTGTAGCTA AGTTCCCTAG GTTGAGGAGG
GATGATGCAA GGGAGGCTAT AGGTGTGGCT AAGGAGGCCT TTGCTAAGTG GGGTAAGACA
TTGCCTGTGG AGAGGGCTAG GATACTGTAT AAGGTTGCTG ATATTATTGA ATCAAGGGCT
GATGAAATGG CTAGGACTCT AACCCTTGAG GAGGGTAAGA CTCTTCCGGA CAGTATGTTT
GAGGTTGTTA GGACTGTTAA TTTACTAAGG TTCTACGCGG GCTTAATAAC AAGGGGTCAG
GGTAAGGTTA TTTCATCACA GGATAGGAAC ACTATAATAA TGACCACTAG GGAGCCTCTT
GGCGTAATAT CAGTCATAAC GCCGTGGAAC TTCCCATTAT CATTACCAGC TTGGAAAATA
ATACCAGCAA TAGCCACAGG TAACACTGTG GTTTGGAAGC CTGCAAGCAT AACGCCTACT
ATAGCCTATG AGCTTGTTAA GGCATTCTAC GATGCAGGCC TACCTGAGGG TGTACTCAAC
CTGGTCACTG GGTCAGCCAG TGAAGTTGGT GATGAGTTAG TGACTAATAA GGATGTTGAT
GCAATAACCT TCACAGGTAG TTTACAGACT GGTAAGGAGA TTAATGAGAA GGTTGGTAGA
ATGAATAGGT TCATTAGGGT TCAACTTGAG TTAGGTGGTA AGAATGCCAC GGTATTATCT
AAGAATGGTG ATATTAATTT AGCTGTGGAG TTAACTGCTA GGTCAGCCTT TGGTTTAACT
GGCCAAGCCT GCACAGCAAC ATCAAGGTTC CTGGTACCCG AGGACATGCA TGATGAGGTT
TTAAGTAAGC TAATTGAGAG GACTAAGAAG ATTGTTGTGG GTAATGGGTT GAAGAGCGGC
GTGGATATGG GTCCATTAGC CAGTAAGGAG CAGTATGATA AGATACTGAG CTACATTGAG
ATAGGTAGGA ATGAGGGGGC TAAGTTAGTT TACGGTGGTC AACCAATAAA GGGGAGTGAG
GAGTTTGATC ACGGCTACTT CGTAATGCCA ACAATATTTG ACGGCGTAAC ACCAGACATG
AGGATAGCCA AGGAGGAGAT ATTCGGCCCA GTATTAGCGG TAATGACCTA TAGGACTCTT
GATGAGGCAA TCGACATAGT TAACTCCACT GAGTACGGCT TAATAGCAGG CATAGTTACT
AGGGATATTA GTGAAGCAGC TAAGTTCACT GAGGGTGTTA AGGTTGGTGT TGTTAAGGTT
AATAAGCCGA CAATAGGTCT TGAACCATGG GTACCCTACG GTGGTGTTAA GGGTTCAGGG
AACGACATGT ATAAGGAGAT GGGTGAAGAA GCCGTTGAAT TCTTCACCAG GATTAAGGCA
ATCTACGTCG GCTACTAG
 
Protein sequence
MSERGVFIGK LIIPRDREFY EIRNPADTRQ VVAKFPRLRR DDAREAIGVA KEAFAKWGKT 
LPVERARILY KVADIIESRA DEMARTLTLE EGKTLPDSMF EVVRTVNLLR FYAGLITRGQ
GKVISSQDRN TIIMTTREPL GVISVITPWN FPLSLPAWKI IPAIATGNTV VWKPASITPT
IAYELVKAFY DAGLPEGVLN LVTGSASEVG DELVTNKDVD AITFTGSLQT GKEINEKVGR
MNRFIRVQLE LGGKNATVLS KNGDINLAVE LTARSAFGLT GQACTATSRF LVPEDMHDEV
LSKLIERTKK IVVGNGLKSG VDMGPLASKE QYDKILSYIE IGRNEGAKLV YGGQPIKGSE
EFDHGYFVMP TIFDGVTPDM RIAKEEIFGP VLAVMTYRTL DEAIDIVNST EYGLIAGIVT
RDISEAAKFT EGVKVGVVKV NKPTIGLEPW VPYGGVKGSG NDMYKEMGEE AVEFFTRIKA
IYVGY