Gene Cmaq_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0979 
Symbol 
ID5710224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1029221 
End bp1030762 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content46% 
IMG OID641275480 
Productaldehyde dehydrogenase 
Protein accessionYP_001540801 
Protein GI159041549 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAGG TTAAGGGTAA AGTTAAGGTT GAGGTTAAGT CGCCTGAACT ATCATCAATA 
CTGAAGACTG GGCCTGATGG TACACCACTG TTCCCAACAT TCATTAATGG GCAATGGTAC
ATGGGTGATA ATTGGCAGGA CGTCAAGTCC CCAATAGACT TATCTGTAAT AGCCAAGGTA
CCGAGACTAC CAGGCAACGT AACTGAGCAG GCTATCGAAA CCACCTATAG GGAGGGTAGG
TGGGCTATAA GGGACATGCC TGGTCAAAGG AGACTCGATG CATTTCATAG GGCGGCTGAC
CTACTTGATA AGTTTAGGGA GGACTTCGTT AATGTACTAG TCTCCAACGC TGGTAAAACA
ACCTCAGCGG CTAATGGGGA GGTTAATTCA GCAATTGAGA GACTGAGGAG ACTGGATTTT
GATGTTGAAG GGGTTCACGG TGACTACGTG CCAGGGGACT GGAGTTTTGA TGCCTTGGAG
AGCGAGGCCA TAGTTAAGAG GGAACCCATT GGGGTTGTGT TAGCCATAGT ACCATTCAAT
TACCCACTCT TCGACACTGT TAATAAAATA GCATACTCAG CAATAGCTGG TAATGCCGTC
TTAATTAAAC CAGCCTCAGC TGACCCATTA CCAACAATAC TCTTCGCCAG GGTACTTGAG
CTAGCTGGAT TCCCAGTTAA GGCACTTGCA GTATTAACAA TACCAGGTAG GGACATGGGC
AAGGTGGTTT CAGACAGGAG GATAGGGGCA ATTTCATTAA CTGGAAGCAC TGAGACCGGT
ATTGAGGTTA TTAGGGAGGC TGGCATTAAG CAGTTTGTAA TGGAGCTTGG TGGCGGTGAC
CCAGCCATAG TGCTTAATGA TGCTGACCCC AAGTGGGCTG CCCAAAGAAT AGCCATAGGC
ATATACAGTT ACGCTGGGCA AAGGTGTGAT GCAGTTAAGT TCATTTTCGC TGAACCAAAC
GTATATGATC AACTTAAGGC AAGCCTAATT GAGGAGTTAT CTAAGGTTAA GGTCGGTGAC
CCAAGAAGCC CAGACACAAC AATGGGTCCA TTAATAGATG AGGCCACGGC TGACGAAGTC
ATTAAGGCGG CTCAAGACGC AGTCTCCAAG GGTGGTAGAA TACTTTACGG TGGTAGGAAA
CTCGGCCCCA CTTACATTGA ACCCACTTTA ATTGAGATTG ATAAGAGTAA GGTTAAGGAC
CTGTACCTCT ACAATAAGGA GGTCTTCGCA GCCATAGCAG TGTTAGTGAA GGTTAATGAC
TTAGATGAGG CCATTGAATT ATCTAATGGT AGGAGGTATG GACTTGATGC AGCAATATTC
AGTAATGATG TAAGTAGGAT AAGGAAGGCA GCTAGGCTAC TTGAGGTTGG GGCAGTGTAC
GTGAATGATT ACCCAAGACA CGGTATAGGC TACTACCCAT TTGGAGGCAG GAAGGATTCA
GGTATTGGGA GGGAGGGACT TGGCTACACC CTTGAGTATG TTACGGCATA TAAGGCAATT
GTATACAATT ATAGAGGTAA GGGTGTCTGG AGGTACTCGT GA
 
Protein sequence
MIEVKGKVKV EVKSPELSSI LKTGPDGTPL FPTFINGQWY MGDNWQDVKS PIDLSVIAKV 
PRLPGNVTEQ AIETTYREGR WAIRDMPGQR RLDAFHRAAD LLDKFREDFV NVLVSNAGKT
TSAANGEVNS AIERLRRLDF DVEGVHGDYV PGDWSFDALE SEAIVKREPI GVVLAIVPFN
YPLFDTVNKI AYSAIAGNAV LIKPASADPL PTILFARVLE LAGFPVKALA VLTIPGRDMG
KVVSDRRIGA ISLTGSTETG IEVIREAGIK QFVMELGGGD PAIVLNDADP KWAAQRIAIG
IYSYAGQRCD AVKFIFAEPN VYDQLKASLI EELSKVKVGD PRSPDTTMGP LIDEATADEV
IKAAQDAVSK GGRILYGGRK LGPTYIEPTL IEIDKSKVKD LYLYNKEVFA AIAVLVKVND
LDEAIELSNG RRYGLDAAIF SNDVSRIRKA ARLLEVGAVY VNDYPRHGIG YYPFGGRKDS
GIGREGLGYT LEYVTAYKAI VYNYRGKGVW RYS