Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0117 |
Symbol | |
ID | 5732010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 151209 |
End bp | 152342 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641277239 |
Product | aminotransferase class I and II |
Protein accession | YP_001542897 |
Protein GI | 159896650 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00105897 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTTTTG ATGATACTGA TTTAACGCCC AACCGGATTG AAATGGCTCG TCGCGCTGCG ACGGGTCATT TTATTGATCT GACCAGTAGT AACCCAACGC AGCAAGATTT GCTTTTTCCG CCCGATGTGC TGGAGCAAGC CGCCCAAAAA TATTGGTCAC AACGGTGCTA CGAGCCAAAT CCACGCGGTT TGGAGCCAAC TCGGCAGGCA ATTATTGATT ATTATGCTCA GCGCAGGCCA GCTTTGGCGC TAACGCTTGA TGATATTTTT ATTACTGCCA GCACCAGCGA GGCCTATAGT TTGTTGTTCT CGTTGCTCAC CGCACCAGGC GATAATATTC TTGGGCCAAA TGTCACCTAT CCATTGTTTG AATATTTGGC CGATTTGCAT CACGTTGAGT TGCGTACCTA CGAGCTTGAT CCTGCTAACA ATTGGGTGAT CGATCAGGCT TCGTTGCTGG CCGCTGCTGA TCAGAACACA CGGGCGATTT TATTAATTTC ACCGCATAAT CCAACTGGGG CAATTATTAG CGAGCCAATC GCCGCATTAA ATCAACTAGG AATTCCATTG ATTTGCGATG AAGTGTTTGC GCCGTTTGCG TTGGCCAAAT CGCATGTGCC AGCATTAGGC GGGCTGCATC CCGACGTACC CGTATTTCAA TTGAATGGCA TTTCTAAGCT CTTAGCCTTG CCCGACCTCA AACTAGGCTG GATTGCGCTG AATCAAGCGG CTCAAGGCTA TGCAGAGCGT TTAGAACTGA TCAACGATAC CTTTTTGAGT TGTAGCACAT TAATTCAGAC GATGCTGCCC GATTTGTTGC ATGCTGCGCC GCCGTTTATC GATCAGATGC TTGAGCGAGT GCGAGCGAAT ATTGCCTATG CCCGTGAGCA TTTAGCCCAG CACCCACGCT TGATTTGGAG CGAGCCAGAT GGTGGCTATT ATTTATTTTT GCAAGTACGC GATGAATTCG ATGATGAAGC CTTGGTCGTG CGTTTGATCG AACAAGGGGT GTTGGTGCAT CCAGGCTTCT TTTTCGATTG GATCGATGAT TGTCGGATTA TGCTCTCGGC CTTGACCGAA CCGCACCAAT GGCAAGCAGG TATCCAAAAA TTGGCTCAGA TTCTTACAAT TTGA
|
Protein sequence | MVFDDTDLTP NRIEMARRAA TGHFIDLTSS NPTQQDLLFP PDVLEQAAQK YWSQRCYEPN PRGLEPTRQA IIDYYAQRRP ALALTLDDIF ITASTSEAYS LLFSLLTAPG DNILGPNVTY PLFEYLADLH HVELRTYELD PANNWVIDQA SLLAAADQNT RAILLISPHN PTGAIISEPI AALNQLGIPL ICDEVFAPFA LAKSHVPALG GLHPDVPVFQ LNGISKLLAL PDLKLGWIAL NQAAQGYAER LELINDTFLS CSTLIQTMLP DLLHAAPPFI DQMLERVRAN IAYAREHLAQ HPRLIWSEPD GGYYLFLQVR DEFDDEALVV RLIEQGVLVH PGFFFDWIDD CRIMLSALTE PHQWQAGIQK LAQILTI
|
| |