Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0970 |
Symbol | |
ID | 5732856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1112376 |
End bp | 1113671 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278102 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001543746 |
Protein GI | 159897499 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.138964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA CAGTTAGTCA TGCCAAGCGC CTGCGCGGGG CAATTAGCGT CCCAGGTGAT AAATCGATCT CGCATCGCTC GGTGTTATTT AATGCTTTAG CCGAGGGCAA CGCCGAAATT ACGGGGTTCT TGCCAGGCGC TGATTGTCTT TCGTCAATCG CTTGTTTGCG CCAAATGGGC GTTGAAATTG AACACAGCGA CGATAAGGTA CGGGTGTTCG GGCGGGGTTT GCGTGGCCTG CGTGAGCCAA GCGACGTTTT AGATTGTGGT AATTCGGGTA CAACGCTCCG TTTGTTGGCA GGTTTATTGG CTGGTCAGCC ATTTTTGAGC GTGCTAACTG GCGATGCGTC GTTGCGTTCA CGCCCTCAGA AACGCATTGT TGAACCATTA CGCCAACTAG GAGCCAAGCT CGATGGCCGC GATAACGGCA ACCGTGCACC CTTGGTGATT CGTGGCACAA CCATTCATGG TGGCAACTAT GAATTGCCGA TCGCCAGTGC TCAAGTTAAA TCGGCCTTGC TCTTGGCTGG TTTAACTGGC GATGCGCCAA TGCGTTTATC GGGCAAAATC GTTAGCCGCG ACCATACCGA GCGCATGTTG ATCGCCATGG GAATTGATCT CACCGTTAAA GATGATGAGA TTGTGCTCTA TCCACCGAGC CATCCGGTTT TCCCCTATCC GCTTTCGTTG CATGTTCCAG GCGATCCTTC GTCGGCAACC TTTTGGTGGG TAGCCGCAGC GATTCACCCC GATGCCGAAA TTACCACCTT GGGCGTTGGA TTAAACCCCA GTCGCACTGG AGCGCTCGAT GTGCTCAAGG CCATGGGCGC TGATATTACG ATCAGCAATG AGCGCAATGA AGGTGCAGAG CCTGTTGGCG ATGTAACCGT GCGTGGCGGT GGCTTACGAG GCACACGCAT CGATGGCGAT TTAATTCCGC GTTTGATCGA TGAAATTCCG GTGCTGGCGG TGGCGGCAGC CTGTGCAGTT GGCGAAACCG TGGTTGCCGA TGCCGAAGAA CTGCGGGCCA AAGAAACCGA TCGGGTAGCC ACAGTGGTTA GCGAACTAAC AGCCATGGGT GCGACCCTCG AAGCCACACC CGATGGCATG ATCATCGCTG GTGGTGGCGA ACTCCAAGGC GCTCACGTTC AATCGCATGG TGATCATCGC ATCGCGATGG CCTTGGCGGT GGCTGGCTTA GTGGCCGAAG GCGAAACGAT TATCGACGAA GCTGAAGCCG TGACCGTCTC GTACCCAACA TTCTGGCAGC ATTACGCGCA GATCAAAGAA GCCTGA
|
Protein sequence | MKQTVSHAKR LRGAISVPGD KSISHRSVLF NALAEGNAEI TGFLPGADCL SSIACLRQMG VEIEHSDDKV RVFGRGLRGL REPSDVLDCG NSGTTLRLLA GLLAGQPFLS VLTGDASLRS RPQKRIVEPL RQLGAKLDGR DNGNRAPLVI RGTTIHGGNY ELPIASAQVK SALLLAGLTG DAPMRLSGKI VSRDHTERML IAMGIDLTVK DDEIVLYPPS HPVFPYPLSL HVPGDPSSAT FWWVAAAIHP DAEITTLGVG LNPSRTGALD VLKAMGADIT ISNERNEGAE PVGDVTVRGG GLRGTRIDGD LIPRLIDEIP VLAVAAACAV GETVVADAEE LRAKETDRVA TVVSELTAMG ATLEATPDGM IIAGGGELQG AHVQSHGDHR IAMALAVAGL VAEGETIIDE AEAVTVSYPT FWQHYAQIKE A
|
| |