Gene Cmaq_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1714 
Symbol 
ID5709413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1791180 
End bp1792280 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content44% 
IMG OID641276224 
Productalcohol dehydrogenase 
Protein accessionYP_001541527 
Protein GI159042275 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.656799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTA TTGTAGTTAA ACCCCCGAAA CCCGGAGTGG AGGTTAGGGA TTTAAGTCAA 
GTGATTAGAC ATGGTTCAGG CACTGTTAAG GTGAGGATTC TTGAAAACGG CATATGTGGT
TCTGACCGTG AAATAGTAAA GGGGGAGTTA ACCACAGCTA GGCCGCCTGA GGGTAGGGAT
TGGCTTGTCC TGGGTCATGA AGCCCTAGGT ATTGTTGAGG ATTCAAGTGA CCCAAGGTTT
AAGCCAGGTG ACTTAGTAAT GCCTATTAAT AGGAGGAGTT ACCATGGTAA GTGCCTTAAC
TGCCTTGTGG GTAGACCGGA TTTCTGTGAA GCCAATGAGT TTGTTGAGGC AGGCATGGTT
GGGATGGATG GTTTCATGGT TGAGTACTGG TATGATGACC CCAAGTACCT TGTTAAAGTA
CCTAAGGACA TAGCTGACAT AGCTATTGTG GCTCAACCCC TTTCTGACCT TGAGAAGTCT
GTTGAGGAGA TACTTAATGT TCAGAGGAGA TTCATTTGGA CTTGTGATGA TGGAACATAC
AACTGCAGGA GAAGCATAGT CTTCGGCACA GGCTCAACTG GGATACTGAT TTCACTATTG
CTTAGGACAG TAGGGTTTGA GGTTTATGTG GCTAATAGGA GGGATCCACT TGAGAGTGAG
GCTAAGATCA CTGAGGAGGC TGGTATAATT TACTATAATT ACTCTAAGGA TGGTTTAGAT
AAACTTAAGT CAATGGGCTT CGACCTAGTA GTGGATACCA CTGGTGCATC AGCGTCATTA
ATTGGTCATG AGGTTGAGAT GCTTAAGCCA AATGGTATAC TAGGCCTATT TGGATTCCCA
TCCGAGGGGG AGTTAACGTT AAGGTATGAT GTTATACAAA GGTTCATTTA TAAATCCAAT
GCAATAGTGG GTTTAATAAA TGGGCAGAAG CCACACTTCC AGCAGGCTCT CGCTCACCTG
GCTCAATGGA AGGTTGTTTG GCCCACTGTG GCTAAGTCTT TAATAACTAG GGTTGTTGAC
GTTAATAATG ATAAGGAGTT ACTGCAGGTC CTTAACCATA AGGAGAGGGG GGAGATTAAG
GTTAAGATTA AGTGGAGTTA A
 
Protein sequence
MKAIVVKPPK PGVEVRDLSQ VIRHGSGTVK VRILENGICG SDREIVKGEL TTARPPEGRD 
WLVLGHEALG IVEDSSDPRF KPGDLVMPIN RRSYHGKCLN CLVGRPDFCE ANEFVEAGMV
GMDGFMVEYW YDDPKYLVKV PKDIADIAIV AQPLSDLEKS VEEILNVQRR FIWTCDDGTY
NCRRSIVFGT GSTGILISLL LRTVGFEVYV ANRRDPLESE AKITEEAGII YYNYSKDGLD
KLKSMGFDLV VDTTGASASL IGHEVEMLKP NGILGLFGFP SEGELTLRYD VIQRFIYKSN
AIVGLINGQK PHFQQALAHL AQWKVVWPTV AKSLITRVVD VNNDKELLQV LNHKERGEIK
VKIKWS