Gene Cmaq_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1850 
Symbol 
ID5709596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1927975 
End bp1929219 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content44% 
IMG OID641276357 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001541657 
Protein GI159042405 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTTA GGATTAATAA GTCGGTGGCG TTTGGCTCAG TGAAGGCTCC TAGATCAAAG 
AGCTGGGCTA TTAGGCTAAT ATTATTATCA GCGATTAGTG ATGAGGAGAC CACAATCTGC
AGTATACCGG ATTCTGATGA TACTGAAGCC GCCTTAAGGA TGATTGAGGT CTTAGGATCA
AAGGTGATTA GGCAGGGTAA CTGCATTAGG GTAATTCCTA ATTTAAGGCA ATGCGGCGGC
TACGTCAATG TCGGTGGCTC CGGAACGGTG ATGAGGCTTG GAGTAGCCTT GGCGTCATCA
TGTAGAAACC CAGTGATTAT TGATGGTGAT GAAACCCTTA AGAGGAGGCC GATACGGGAA
TTACTGGAAT CATTAAGGAG CCTGGGGGTT AATGTGAATG GTGATTCATT ACCAGTGGCC
ATTAATGGAC CAGTTAAGGG TAATTATGTT GAGATTAGGG GTGATTTAAC AAGCCAGTAT
ATTTCAGGGT TAATAATGCT TGGCCTGGTA TCAGGTATTA CTATACGTGT AATTGGTGAC
TTAGTGTCTA GGCAGTACGT GGATTTGACT AGGAGGATTA TTGAGGAATC AGGGTGTAGT
GTGGGGGTAA GTAATGATGT AATTACTGTT AATGAATGCA TACCGAGGAT TAGTTTAAGT
AATGTGCCTG GTGATTACGC ATTGTCAGGA TTCTACACGG CCTTAGCCTT AGCCACGGGT
GGATTAGTGA CTGTAACTGG GTTACCGAAA CCACTTGGTT ACGGTGATGA TTCACTAGTG
AATATATTTA GCAATGCAGG TGCACGCAGC GTCTTCAGTA ACGGTGATTG GAGTGTTGAG
GGTGGAGGTG AATTAAGGGG TATTGTGGTT GATTTAAAGG ATTCCCCTGA CCTAGCCCCA
GTGGTAGCAT CAATCGCACC CTTTGCCTCG GGTGAAACGG TGATAACTGG GGTGAGGCAT
CTTGCCTTTA AGGAGAGTAA TAGATTGGAG ACAATATCAG ATTCCTTAAG GGCGTTTGGT
GTTAATGTTA ACCATGGTGA TGACTCCTTG AGGATAAGTG GGTCAATAAC CCATGGTGCA
TTAATCAAGT GTCCAAACGA CCACAGGATA GCAATGATGA GTGGTGTAGT TGCCGCAGGT
TCTAACGGTG AGTCAATTAT TCATAATGCT GAGTGTGTTA ATAAAAGTAA TAGATTATTC
TGGAGGGACT TGGTGAAGCT TGGAGTTAAA TTAACCATTA ATTAA
 
Protein sequence
MIVRINKSVA FGSVKAPRSK SWAIRLILLS AISDEETTIC SIPDSDDTEA ALRMIEVLGS 
KVIRQGNCIR VIPNLRQCGG YVNVGGSGTV MRLGVALASS CRNPVIIDGD ETLKRRPIRE
LLESLRSLGV NVNGDSLPVA INGPVKGNYV EIRGDLTSQY ISGLIMLGLV SGITIRVIGD
LVSRQYVDLT RRIIEESGCS VGVSNDVITV NECIPRISLS NVPGDYALSG FYTALALATG
GLVTVTGLPK PLGYGDDSLV NIFSNAGARS VFSNGDWSVE GGGELRGIVV DLKDSPDLAP
VVASIAPFAS GETVITGVRH LAFKESNRLE TISDSLRAFG VNVNHGDDSL RISGSITHGA
LIKCPNDHRI AMMSGVVAAG SNGESIIHNA ECVNKSNRLF WRDLVKLGVK LTIN