Gene Hlac_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1114 
Symbol 
ID7400923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1120190 
End bp1121377 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID643708179 
Productchorismate synthase 
Protein accessionYP_002565778 
Protein GI222479541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGA ACCGGTTCGG TCGGCTCTTC CAGGTGACGA CGTACGGCGA GAGCCACGGC 
GAGGCGATGG GTGTGACGGT CTCGGGCGTG CCCGCCGGCG TCGAGCTCGA CGAGGAGGCG
ATCCAAGCAC AGCTTGACCG GCGCAAGCCG GGCCAGTCGA TGATCACCAC CTCCCGGGGC
GAGCCCGACG AGGTCGTCGT CAACTCCGGC GTACAGGACG GCTACACCAC CGGAACGCCG
ATCGGAATGG TGATCCAGAA CAAGGACGCG CGCTCGGGGA AGTACGAGCC GTACGTCACC
GCGCCGCGCC CATCGCACGG CGATTACACC TACTCCGCGA AGTTCGGCAC GCGCAACTGG
GGCGGCGGCG GGCGCTCCTC CGCCCGGGAG ACGGTGAACT GGGTCGCGGC CGGCGCGGTC
GCCGAGCAGG TGCTCGACGC CTCCGAGTAC GACGTGGAGA TCAAAGCCCA CGTGAACCAG
ATCGGCGACG TCGAGGCCGA CGACGTGAGC TTCGAGCAGA TACTCGACCA CAGCGAGGAG
AACGACGTGC GCTGTGCCGA CCCCGAGGCG GCCGCCGAGA TGCAGGAGCT GATTGAACGG
TATCAGGAGG CGGGCGACTC CATCGGCGGC TCCATCTACT TCGAGTGCCG CGGCGTCCCC
CGCGGGCTCG GCGCCCCGCG CTTCGACGGC TTCCCGTCCC GGCTCGGGCA GGCGATGTTC
TCGATCCCGG CGACCACGGG CGTCGAGTTC GGACTCGGCA AAGACGCCGT GAACGTGACC
GGGAGCGAGC GCAACGAGGA CTGGACATTT GACGACGGCG AGTCGTTCGA CCATGTCGAA
AGCGAGGAGG GCGATCCGGT CCCCGTCGGG AACGACCACG GCGGGCTCCA GGGCGGGATC
ACGACCGGTG AGCCCATTTA CGGCGAGGCG ACGTGGCACG CGCCCACCTC GATCCCGAAA
AAGCAGCGCT CCGCCGACTG GGAGACGGGC GAGGAGAAGG ACGTGCAGGT CGTCGGCCGG
CACGACCCCG TCCTCCCGCC GCGGGCCGTC CCCGTCGTCG AGGCGATGCT GTACTGCACC
GTCCTCGACT TCATGTTGCT CGCCGGCCGG ATCAACCCCG ACCGCGTCGA CGGCAACCCG
GGCCAGTACG ACACCGACTA CCACCCGAGC AGCCCCGACA ACGATTGA
 
Protein sequence
MNGNRFGRLF QVTTYGESHG EAMGVTVSGV PAGVELDEEA IQAQLDRRKP GQSMITTSRG 
EPDEVVVNSG VQDGYTTGTP IGMVIQNKDA RSGKYEPYVT APRPSHGDYT YSAKFGTRNW
GGGGRSSARE TVNWVAAGAV AEQVLDASEY DVEIKAHVNQ IGDVEADDVS FEQILDHSEE
NDVRCADPEA AAEMQELIER YQEAGDSIGG SIYFECRGVP RGLGAPRFDG FPSRLGQAMF
SIPATTGVEF GLGKDAVNVT GSERNEDWTF DDGESFDHVE SEEGDPVPVG NDHGGLQGGI
TTGEPIYGEA TWHAPTSIPK KQRSADWETG EEKDVQVVGR HDPVLPPRAV PVVEAMLYCT
VLDFMLLAGR INPDRVDGNP GQYDTDYHPS SPDND