Gene Cmaq_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1952 
Symbol 
ID5709856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp2029240 
End bp2030622 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content45% 
IMG OID641276460 
ProductUDP-glucose/GDP-mannose dehydrogenase dimerisation 
Protein accessionYP_001541758 
Protein GI159042506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.868357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATC TTAATCTATT GTTGATGAGT AATGAGGATT TAGTTAATGC CTTGAGGAGT 
GGTTCATTAA CCGTATCCGT TTACGGCATG GGTTACGTAG GTACTGCCAT TGCTGCCGTA
TGGCTTAGGG CTGGTGCCAG GGTTATTGGT GTTGATGTTG ATGCTGAGAA GATTAAGAAA
CTGGATGCCT GTGAACTTAA GTTAAGTGAT AGGCAGGTTG AGGAGGAGTT GAGGAGGCTT
AAGGATAGGA TAAGCTACAC CACTGATGGT GTTGAGGCAT CGAGGTTAAG TAATGTGAAG
ATTGTTACTG TCCCAGTTTA CTTAGGTAAG GATAAGAGAC CAATCTTCGA TTCCTTTAAG
GCTTCAATAG AGAATATTGC CAGGGGGCTT AAGGTCGGTG ACTTAGTCAT AATAGAGTCC
TCTGTTCCCC CTGGCACAAC AATGGATGTT GCCTTACCCA TCCTGGAGAG GATTAGTGGA
TTAAGGGTTG AGAGGGATTA TGCATTAGCC TACTCGCCTG AACGCATCTA CGTTGGTAGG
GCTATTGCTG ATATTGAGGA GCGTTATCCT AAGGTTATTG GTGGTGTTGG ACCCATTAGT
AGCAGGGTTG CGTCCACATT ATACGGTGCC ATAGCTAGGA AGGGTACCTT AATTTTATCC
AATCCAACAG CCGCTGAGTT TGAGAAGCTT GCGGAGGGTG CTTATAGGGA CGTGAACATA
GCGTTGGCTA ATGAGCTTGC CCAATTAGCT AGGTTACTTG GATTAGATTT CGATGAGATA
AGGGAGGCGG CTAATAGTCA ACCCTACTCT AATATTCATA AGCCTGGCCC AGGGGTTGGT
GGATCATGCA TACCAGTGTA CCCCTACTTC CTAATGTATG CTGCTGAGAG GGCTGGCTTC
AACATGAAGC TTGTTCAAAC AGCCAGGGGT ATTAATGAGT ATGCGCCCGC TTATGTGGCT
GAGTTGATTA AGACTGCTGC GGGTGAATTA GGGGTTAGTA GGCCTAGGGT GGCTGTGTTG
GGTTTAGCCT TTAGGGGTAA TGTTGATGAC ACTAGGCTTA GTCCATCATA CGACATAATT
AATTACCTAA GGGGTTCAAT GGATATTATT GTTCATGATC CATACGTTAA ATTCGATAAA
ACCCTTGAGG AATGGGGGAT TAGGTTAACT AACAGTATTG AGGATGCGTT AAAGGGGGCT
AACATAGTGG TGATAGCAAC AGATCACAGT GATTACGGTG GGTTAACGTT AAGTAGGATT
ATTCAATTAA CAGGCTTAAG CAGTATTGCC GTAGTGGATT CAAGGCACAT GATTAAGGAT
TGGAGAAACC CACCCCCGGG TGTTGTTTAC CTGGCTGTGG GTAGACCCAC TGCAAAGGCG
TGA
 
Protein sequence
MGNLNLLLMS NEDLVNALRS GSLTVSVYGM GYVGTAIAAV WLRAGARVIG VDVDAEKIKK 
LDACELKLSD RQVEEELRRL KDRISYTTDG VEASRLSNVK IVTVPVYLGK DKRPIFDSFK
ASIENIARGL KVGDLVIIES SVPPGTTMDV ALPILERISG LRVERDYALA YSPERIYVGR
AIADIEERYP KVIGGVGPIS SRVASTLYGA IARKGTLILS NPTAAEFEKL AEGAYRDVNI
ALANELAQLA RLLGLDFDEI REAANSQPYS NIHKPGPGVG GSCIPVYPYF LMYAAERAGF
NMKLVQTARG INEYAPAYVA ELIKTAAGEL GVSRPRVAVL GLAFRGNVDD TRLSPSYDII
NYLRGSMDII VHDPYVKFDK TLEEWGIRLT NSIEDALKGA NIVVIATDHS DYGGLTLSRI
IQLTGLSSIA VVDSRHMIKD WRNPPPGVVY LAVGRPTAKA