Gene Hhal_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0760 
Symbol 
ID4711199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp843209 
End bp844501 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content55% 
IMG OID639855221 
ProductEPSP synthase (3-phosphoshikimate 1-carboxyvinyltransferase) 
Protein accessionYP_001002340 
Protein GI121997553 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0484678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCCCT TTACGGCGAG TGTTCCAAAA GAGAACAACG GCGGTTATCT CAAAATTCTA 
CCTGCACGTT TGAGTGGTAC GGTGCGGCTA TCGGGAGCCA AGAACGCCGC CTTGCGTCAT
TTAGCTGCCA GCTTGCTCAC TGATGAACCA GTGATTCTTA ATAATTTCCC CGCCAGCCTG
CTAGATGCGA AGATTCACGT CGAGATGCTG GAGCGCTTGG GCAAACGTGC GGCATTACGA
AAACAAGACT GCATCATCTT GAGCCAATTA ACGCCTTCAA CATCACGGTT AGAATGGACA
GGCCGATCGA TTCGAAACAC GCTGTTGATC CTGGGTGCAT TAGTCGCACG TACCGGGGCT
GGCGCGGTTC CCTTGCCAGG GGGTTGTAAG CTGGGTGAGC GGAAGTACGA CCTGCATGTT
AGTGTACTCG AGAGTCTGGG GGCTACTGTC TGGGAAGAAG ACGATACACT TTGCGCCGAG
GCCCCTCAAG GCTTAACAGG CATGGATATC CACTTACGTA TTCGCTCAAC AGGAGCGACC
GAGAACGCAA TTCTGTGTGG CACCTTGGCC CGTGGTGTCA CTCGCATCTG GAATCCACAT
ATCCGACCAG AGATCCTCGA CCTGATCGAA CTACTGCGCA AAATGGGCGC ACACATTCGC
GTTTTTGGCC AGGAGCACAT CGAAGTGACG GGTGTAGAGG GCCTCGGCGG TGCAGAGCAT
ACGGTTATCG CAGACAATAT GGAGGCTATA ACTTGGCTGG TTGGGGCTGC GATTTCTGGC
GGCGATGTGG AGATTGAGGG TTTTCCGATT TCAGACATGG AAGTAGTACT TGCTCATTTG
AAATCCGCGG GCGCGAAGAT CTATCAAGGT GCCGGATCGG TCATTGTGCG GGGAGGCCCA
TGCTACCCGC TGGAAATCAG CACCGGACCG CACCCCGGGA TCAATTCTGA TGTGCAACCG
ATCCTTGCGG CATGGGCAGC CCATGCTAAG GGTGAGTCGC GCATCATCGA TTTACGTTTT
CCAGGGCGCT ATGCCTATGC GGAAGAACTG GCTCGTATGG GTCTTTGCCA CCAATTACAT
GGCGATATGT TGCTGATCTA TGGTAATGGC GGCGGGCTGC ACGGGGCTGA AGTACGGGCA
CTTGACTTAC GTGCAGGTGC CGCACAAGTA CTGTGCGGCC TGACGGCAGA AGGCGAGACC
GTGATCCACG ATGCGTGGCA ACTTCTCCGG GGTTATGACC GTTTTGCTGA AAAACTAGCA
GCATTGGGTG TGGAGTGCTG GTGGAAGAAT TGA
 
Protein sequence
MDPFTASVPK ENNGGYLKIL PARLSGTVRL SGAKNAALRH LAASLLTDEP VILNNFPASL 
LDAKIHVEML ERLGKRAALR KQDCIILSQL TPSTSRLEWT GRSIRNTLLI LGALVARTGA
GAVPLPGGCK LGERKYDLHV SVLESLGATV WEEDDTLCAE APQGLTGMDI HLRIRSTGAT
ENAILCGTLA RGVTRIWNPH IRPEILDLIE LLRKMGAHIR VFGQEHIEVT GVEGLGGAEH
TVIADNMEAI TWLVGAAISG GDVEIEGFPI SDMEVVLAHL KSAGAKIYQG AGSVIVRGGP
CYPLEISTGP HPGINSDVQP ILAAWAAHAK GESRIIDLRF PGRYAYAEEL ARMGLCHQLH
GDMLLIYGNG GGLHGAEVRA LDLRAGAAQV LCGLTAEGET VIHDAWQLLR GYDRFAEKLA
ALGVECWWKN