Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0760 |
Symbol | |
ID | 4711199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 843209 |
End bp | 844501 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639855221 |
Product | EPSP synthase (3-phosphoshikimate 1-carboxyvinyltransferase) |
Protein accession | YP_001002340 |
Protein GI | 121997553 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase |
TIGRFAM ID | [TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0484678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATCCCT TTACGGCGAG TGTTCCAAAA GAGAACAACG GCGGTTATCT CAAAATTCTA CCTGCACGTT TGAGTGGTAC GGTGCGGCTA TCGGGAGCCA AGAACGCCGC CTTGCGTCAT TTAGCTGCCA GCTTGCTCAC TGATGAACCA GTGATTCTTA ATAATTTCCC CGCCAGCCTG CTAGATGCGA AGATTCACGT CGAGATGCTG GAGCGCTTGG GCAAACGTGC GGCATTACGA AAACAAGACT GCATCATCTT GAGCCAATTA ACGCCTTCAA CATCACGGTT AGAATGGACA GGCCGATCGA TTCGAAACAC GCTGTTGATC CTGGGTGCAT TAGTCGCACG TACCGGGGCT GGCGCGGTTC CCTTGCCAGG GGGTTGTAAG CTGGGTGAGC GGAAGTACGA CCTGCATGTT AGTGTACTCG AGAGTCTGGG GGCTACTGTC TGGGAAGAAG ACGATACACT TTGCGCCGAG GCCCCTCAAG GCTTAACAGG CATGGATATC CACTTACGTA TTCGCTCAAC AGGAGCGACC GAGAACGCAA TTCTGTGTGG CACCTTGGCC CGTGGTGTCA CTCGCATCTG GAATCCACAT ATCCGACCAG AGATCCTCGA CCTGATCGAA CTACTGCGCA AAATGGGCGC ACACATTCGC GTTTTTGGCC AGGAGCACAT CGAAGTGACG GGTGTAGAGG GCCTCGGCGG TGCAGAGCAT ACGGTTATCG CAGACAATAT GGAGGCTATA ACTTGGCTGG TTGGGGCTGC GATTTCTGGC GGCGATGTGG AGATTGAGGG TTTTCCGATT TCAGACATGG AAGTAGTACT TGCTCATTTG AAATCCGCGG GCGCGAAGAT CTATCAAGGT GCCGGATCGG TCATTGTGCG GGGAGGCCCA TGCTACCCGC TGGAAATCAG CACCGGACCG CACCCCGGGA TCAATTCTGA TGTGCAACCG ATCCTTGCGG CATGGGCAGC CCATGCTAAG GGTGAGTCGC GCATCATCGA TTTACGTTTT CCAGGGCGCT ATGCCTATGC GGAAGAACTG GCTCGTATGG GTCTTTGCCA CCAATTACAT GGCGATATGT TGCTGATCTA TGGTAATGGC GGCGGGCTGC ACGGGGCTGA AGTACGGGCA CTTGACTTAC GTGCAGGTGC CGCACAAGTA CTGTGCGGCC TGACGGCAGA AGGCGAGACC GTGATCCACG ATGCGTGGCA ACTTCTCCGG GGTTATGACC GTTTTGCTGA AAAACTAGCA GCATTGGGTG TGGAGTGCTG GTGGAAGAAT TGA
|
Protein sequence | MDPFTASVPK ENNGGYLKIL PARLSGTVRL SGAKNAALRH LAASLLTDEP VILNNFPASL LDAKIHVEML ERLGKRAALR KQDCIILSQL TPSTSRLEWT GRSIRNTLLI LGALVARTGA GAVPLPGGCK LGERKYDLHV SVLESLGATV WEEDDTLCAE APQGLTGMDI HLRIRSTGAT ENAILCGTLA RGVTRIWNPH IRPEILDLIE LLRKMGAHIR VFGQEHIEVT GVEGLGGAEH TVIADNMEAI TWLVGAAISG GDVEIEGFPI SDMEVVLAHL KSAGAKIYQG AGSVIVRGGP CYPLEISTGP HPGINSDVQP ILAAWAAHAK GESRIIDLRF PGRYAYAEEL ARMGLCHQLH GDMLLIYGNG GGLHGAEVRA LDLRAGAAQV LCGLTAEGET VIHDAWQLLR GYDRFAEKLA ALGVECWWKN
|
| |