Gene Cmaq_0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0807 
Symbol 
ID5708720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp842454 
End bp843449 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content49% 
IMG OID641275310 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001540632 
Protein GI159041380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.021936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.252673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAATAA GGGTAACGCC AGGTAAGATT AATGAAGTTG CATCAGCTCT AGATAAGGCT 
AAGGTAAGGT TCAGGGAGGT TAAGCTGCTT GGTGAGGAGT TAATAGTGAC TTGGCCTGAA
GACCCTGTTG ATGAGGGAGC CATTAGAGTG ATTGACCCTG GGGCAGTACT AGTTAACGTT
AAGGCCAAGT ACCAGTTAGC CAGTAAGCAA TGGAGGCAGA GGAGCATTGT GGATGTTTCA
GGGGTTAAGA TAGGGGGTGA TGATTTAGTG GTTGCAGCTG GGCCATGTGC AGTGGAGAGT
TATGAGCAGG TTAAGGAGAC TGCCGAGGCA GTTAAGGGGG CTGGAGCAAG ACTACTGAGG
GGTGGGGCGT TTAAACCTAG GACAAGTCCC TACAGTTTCC AGGGACTTGG AGTAGATGGC
TTAAAGATAC TGAGGCGAGT CTCAGATGAG GTTGGTTTAC CCGTAGTCTC TGAGGTTATG
GATACTAGGA TGGTTGAGGT GGTGGCCAGT TACGTTGACA TGATTCAGAT AGGGGCTAGG
AATGCCCAGA ATTTTGACCT ACTTAAAGAG GCTGGTAAGA CTGGGAAACC AATACTACTC
AAGAGGGGTA TGGGAAACAC GGTTGAGGAG TGGCTTCAGG CAGCGGAATA CATCATGCTT
GAGGGTAATG GTAACGTAGT GCTTTGTGAA AGAGGGATAA GGACCTTTGA GAACGCCACG
AGATTCACGC TGGACTTAGG TGCAGTGGTG GCGGCTAAGA AATTAACCCA CTTACCAATA
TGCGTGGATC CATCACACCC AGCCGGTAAG AGGGAGTACG TTATTCCACT GGCCTTAGCC
GCAGTGGCAG CTGGGGCAGA TATGATTATT GTTGAGGTTC ACCCAAGGCC GTGGGAGGCT
TTATCAGACT CCGAGCAGCA ATTAACCTTC GATATGTTTA ATGAATTAAT GAGTAAGGCT
AAGGCAGTAG CCCAGGCAAT AGGTAGGGGT ATATGA
 
Protein sequence
MIIRVTPGKI NEVASALDKA KVRFREVKLL GEELIVTWPE DPVDEGAIRV IDPGAVLVNV 
KAKYQLASKQ WRQRSIVDVS GVKIGGDDLV VAAGPCAVES YEQVKETAEA VKGAGARLLR
GGAFKPRTSP YSFQGLGVDG LKILRRVSDE VGLPVVSEVM DTRMVEVVAS YVDMIQIGAR
NAQNFDLLKE AGKTGKPILL KRGMGNTVEE WLQAAEYIML EGNGNVVLCE RGIRTFENAT
RFTLDLGAVV AAKKLTHLPI CVDPSHPAGK REYVIPLALA AVAAGADMII VEVHPRPWEA
LSDSEQQLTF DMFNELMSKA KAVAQAIGRG I