Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4081 |
Symbol | |
ID | 5735940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5213440 |
End bp | 5214510 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281233 |
Product | chorismate synthase |
Protein accession | YP_001546841 |
Protein GI | 159900594 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGGAA ATAGTTTTGG TCGGTTGTTC CGAATTTCAA CATGGGGCGA ATCCCATGGA GTTGGCTTGG GCGTGGTGAT CGATGGTTGC CCCGCAGGTT TGGAGCTTGA TTTGGCTGCC ATCCAAGCCC AATTGGATCG CCGTCGGGTT GGACAAAGCC GCATGACTTC GGCTCGGCGG GAGCCTGATG AAGTTGAGAT TTTATCGGGC ATGTTTGAGG GTCGCACGAC TGGTACAGCC TTGGCAATGC TGATTCGCAA TACCAACGCC CGATCCAGCG ATTACGATGC AATCAAACAT TTATATCGAC CTGGCCATGC TGACCATAGT TACGATGCTA AATATGGCTT CCGTGATTAT CGTGGTGGTG GTCGTTCGAG CGCACGCGAA ACCGCCGCGC GGGTTGCGGC TGGCGCAGTC GCCCGCCAAA TCTTGGCCAC AATGGGCATT AGCTTGGTGG CCTATACGCT GAGTTTAGGC CATCTCAAAG CCCAAATCAT CGACGAAAAC GAAATTGAAA ATAACATTAT GCGCTGCCCA GACCCCGCTG TGGCCGAGCA GATGATCGCC TATGTCGATC AAGCCCGCCG CGATTTGGAT TCGCTGGGTG GCGTGGTTGA GGTGCGGGCA CGTGGAGTTC CGGCTGGGCT AGGCGAGCCA GTGTTTGATA AACTTGATGC TTTGATTGGT CATGCCATGT TTAGTATTCC CGCAGTCAAA GCAGTCGAAA TTGGCTCAGG CATCGAGGCA GGCAATGCCC GTGGTTCGCA AAATAACGAT CCATTTATCC AGCGAGCAGA TGGTAGCATT GGCACAAGCA GCAACCATGC TGGCGGGATT TTGGGTGGCA TCAGCAGCAG CGAGGAGATT GTGGTGCGCC TGACGGCCAA ACCACCAGCT TCAATCGCCC AAGAACAAAC CACGGTCGAT CAAGCGGGCG AACCTGCCAC AATTGTGGTC AAAGGCCGCC ACGACCCAAC CGTCTTGCCG CGTTTAGTGC CAGTTGCCGA GGCGATGTTG GCCTTGGTGC TGGTCGATTG TGTCTTGCAA CAACGTGCCG CCCGATTGTA G
|
Protein sequence | MPGNSFGRLF RISTWGESHG VGLGVVIDGC PAGLELDLAA IQAQLDRRRV GQSRMTSARR EPDEVEILSG MFEGRTTGTA LAMLIRNTNA RSSDYDAIKH LYRPGHADHS YDAKYGFRDY RGGGRSSARE TAARVAAGAV ARQILATMGI SLVAYTLSLG HLKAQIIDEN EIENNIMRCP DPAVAEQMIA YVDQARRDLD SLGGVVEVRA RGVPAGLGEP VFDKLDALIG HAMFSIPAVK AVEIGSGIEA GNARGSQNND PFIQRADGSI GTSSNHAGGI LGGISSSEEI VVRLTAKPPA SIAQEQTTVD QAGEPATIVV KGRHDPTVLP RLVPVAEAML ALVLVDCVLQ QRAARL
|
| |