Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3904 |
Symbol | |
ID | 5735765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4894787 |
End bp | 4895950 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281055 |
Product | aminotransferase class I and II |
Protein accession | YP_001546666 |
Protein GI | 159900419 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATCCCA AATCCGCCAA TCGCTTAGCT GGCTTTGGTA CATCAATTTT TAGTGAAATA AGTGCTTTGG CTGCGCGTTA TCAAGCGATT AATCTCGGCC AAGGCTTTCC TGATTTTGCT GGCCCAGCAT TTTTAAAAGA TGCTGCTTGT AGCGCTATCA ACGCTGATCT CAATCAATAT GCGCCAAGCA CTGGCTTGCC AACCTTGCGA GCGGCGATTG CGCGGACATG GGAACGTCAT AGCAAGGCCT CAGTTGACCC CGACGCTGAA ATTACCGTAA CTAGCGGGGC AACTGAAGCT ATGTTTGCCA TAATTATGGC GTTGATTAAC CCTGGCGATG AGGTTTTGAT TTTCGAGCCG TTCTATGATT CGTATCCGCC GAATGTGCTG ATGGCGGGAG GCATACCACG TTATATTCGC TTGCACGAGC CACGCTGGGA TGTGGATTTT GCTCAAGTTC GTGCCGCAAT TACTCCCCAA ACCAAGGCGA TTATTTTGAA CACGCCGCAT AATCCCACAG GCAAGGTTTG GTCGCGGGCC GAATTGAGCC AATTAGCAAC GATTGCAATC GAGCATGATC TCTTGGTGAT CAGCGATGAA GTTTATGATC GTTTGGTGTT TGAGGATTAT CAGCATTGCT CGATTGCCAC CTTGCCCGGC ATGTGGGATC GTACAATCAC CATCAGTAGC ACTGGCAAAA CCTTTAGCGT CACAGGCTGG AAAATTGGTT ATGCAATTGC CCCCAATTCA TTAACTGAGG CAATTCGGCG GGTGCATCAA TTTGTGACCT TTGCCAGTGC CACGCCCTTG CAAGCAGCAG CGGTGGTTGG TTTAAACGCT GGCGAACCCT ATGAACGCCA ACTATTGCAA TTTTATAATG CCCGCCGCGA GCAATTGGTG AAGGTCTTGC GCGATGCTGG ATTGTATGTG TTGCCGCCGC AAGGCACCTA TTTCGTAATG GCTGATATTC GTGATTTGGG CTGGGAGAAT GATGCAGAAT TTTGCCGTTA TCTGATTAGC GAAATTGGCG TGGCGGCAAT TCCACCCTCA GCGTTTTACC ACGATGGCTA TCAATCAGGG ATGGTACGCT TTTGCTTTGC CAAAAAGCCC GAAACAATTG CTGCCGCCGC TGAAAAACTC AAGCAATTAG GGAGTCGAAG CTAA
|
Protein sequence | MHPKSANRLA GFGTSIFSEI SALAARYQAI NLGQGFPDFA GPAFLKDAAC SAINADLNQY APSTGLPTLR AAIARTWERH SKASVDPDAE ITVTSGATEA MFAIIMALIN PGDEVLIFEP FYDSYPPNVL MAGGIPRYIR LHEPRWDVDF AQVRAAITPQ TKAIILNTPH NPTGKVWSRA ELSQLATIAI EHDLLVISDE VYDRLVFEDY QHCSIATLPG MWDRTITISS TGKTFSVTGW KIGYAIAPNS LTEAIRRVHQ FVTFASATPL QAAAVVGLNA GEPYERQLLQ FYNARREQLV KVLRDAGLYV LPPQGTYFVM ADIRDLGWEN DAEFCRYLIS EIGVAAIPPS AFYHDGYQSG MVRFCFAKKP ETIAAAAEKL KQLGSRS
|
| |