Gene Hlac_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0687 
Symbol 
ID7401822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp704332 
End bp705693 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID643707753 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002565359 
Protein GI222479122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCA ACGTCACCAG TTCGACCGTG CGGGGTACCA CTCGCGCGCC GCCCTCGAAG 
AGCTACACCC ACCGCGCGCT GCTGGCCGCG GGGTACAGCG ACGGCGCGAC CGTGCGCTCC
CCGCTCGTCT CGGCCGACAC GAAGGCGACC GCCCGGGCCG TGACCGCCTT CGGCGGCGCG
GTCGAGCCGG AATCCGGAGA GCGTTTCGAC GACGCCGACG CGCTCGTCGT CGACGGATTC
GATGGCCGAC CCGCCGTTCC CGACGACGTG ATCGACTGCG CGAACTCCGG GACGACGATG
CGGCTCGTCA CCGCCGCGGC CGCGCTCGCG GACGGGACCA CGGTGTTGAC GGGGGACGAA
TCCCTCCGCT CGCGCCCGCA GGGACCCCTT CTGGAGGCGC TCGGTGACCT CGGCGTGCGC
GCGGAGTCGA CCCGCGGGAA CGGACAGGCG CCGCTCGTCG TGAGCGGACC GCTCGCGGGC
GGCGAAGTCG CGATCCCCGG CAACGTGTCC TCGCAGTATG TCACCGCCCT GTTGATGGCC
GGCGCGGTCA CCGAGGAGGG CGTCGAAATC GACCTGACGA CGCCCCTCAA ATCGGCCCCG
TACGTCGATA TCACGCTCGA ACTACTCGAC GATTTCGGGA TCGAGGCCAC GCCGGTCGGC
GACGGTGGCG ACGCGCTCGA CGGCGCGGCC GGTGCGGCGG GCTTCGTCGT CGACGGTGGA
CAGTCGTACG CGCCCGCGGG CGGGAGCTAC ACCGTTCCCG GCGACTTCTC CTCGATCTCG
TACCTGGTCG CCGCCGGCGC GGTCGCCGCC GAGCCGGGAG AGCCGGTTCG GATCGAGGGC
GCCGTGCCGA GCGCGCAGGG CGACTCCGCG ATCGTGGAAA TCGTCGAGCG CATGGGCGCC
GACATCGAGT GGGACCGCGA GGCCGGCGTC ATCACCGTCC GGCGCTCCGA GCTGTCGGGC
GTCGAGGTCG ACGTGGGTGA CACGCCCGAT CTGCTCCCCA CGATCGCCGC GCTCGGCGCG
GTCGCCGACG GCGACACCCG AATCATGAAC TGCGAGCACG TCCGGTACAA GGAGACGGAT
CGCGTCTCGG CGATGGCCGA GGAGCTGGAG AAGTTGGGCG CAAAGACGAC CGAAGAGCCG
GACACGCTGA CGGTCCACGG TTCCGAGAGC GACCTCCGGG GCGCGAGCGT CGACGGGCGC
GCCGACCACC GGATCGTGAT GGCGCTCGCC GTGGCCGCGC TCGTCGCCGA GGGCACCACG
ACGATCCGCG GCGGCGAGCA CGTCGACGTC TCCTTCCCGA ACTTCTTCGA TGCGATGGCC
GACCTCGGGA TAGCCGTCGA GCGCGACGGC GCGGGCGAGT AG
 
Protein sequence
MDVNVTSSTV RGTTRAPPSK SYTHRALLAA GYSDGATVRS PLVSADTKAT ARAVTAFGGA 
VEPESGERFD DADALVVDGF DGRPAVPDDV IDCANSGTTM RLVTAAAALA DGTTVLTGDE
SLRSRPQGPL LEALGDLGVR AESTRGNGQA PLVVSGPLAG GEVAIPGNVS SQYVTALLMA
GAVTEEGVEI DLTTPLKSAP YVDITLELLD DFGIEATPVG DGGDALDGAA GAAGFVVDGG
QSYAPAGGSY TVPGDFSSIS YLVAAGAVAA EPGEPVRIEG AVPSAQGDSA IVEIVERMGA
DIEWDREAGV ITVRRSELSG VEVDVGDTPD LLPTIAALGA VADGDTRIMN CEHVRYKETD
RVSAMAEELE KLGAKTTEEP DTLTVHGSES DLRGASVDGR ADHRIVMALA VAALVAEGTT
TIRGGEHVDV SFPNFFDAMA DLGIAVERDG AGE