Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0053 |
Symbol | |
ID | 5731925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 68112 |
End bp | 69212 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277174 |
Product | aminotransferase class I and II |
Protein accession | YP_001542833 |
Protein GI | 159896586 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGC CAATCTATCT TGAATGGGCC AAACAACAGC CTGCCACTGC GATTTTGAGC AATAGCAGCG TTTACTTGCC CAGTGTGCGA AACGCAATTC AGCAACAGTT AGCTGATGCT GCGTGGTTTG CAGCCGCCCA ACAAGCCAAC CCTTGGGGAT ATCCCAGCCT TCAGCAAGCC TTGCAAGCAT GGCATCGTAG TAGCTATTCG CCGTTGATTG TGGCTGGCGC TTCGAGTGCC TTGGCGGTGG TATGCCAAGC CTTGTTGCAA CCAGGCGATC ATGCTCTGAT CGAAACCCCG CAATATGAGC CATTCAGCCG TTGTGTCGCC GCCCGAGGAG CCACTTGGAG CGCTTGGCCA CGCCATCCGC AGAGCTTTGA ATTGCAATTA GACCAATTGG CCAGCGGCAT CACGCCTCGG ACCAAGCTGT TGTTAATCAG CAACTTGCAT AACCCCAGCA GCACACTGAG CAACCAATCC CAATTGCTTC GCTTGCAACA AACCCTTACT CAAGCCACAG ATACGCTTGG AATTGCTCCG ATCACGATGG TGGTTGATGA GATTTATTGG CATTTGGTAG CACAAGCTGA GTTTCGTTCG GTAGCCGAAC TTGGGCCGCA GTGGATCGGG ATCAACAGCC TGTCGAAGGT GTATGGGCTA AGCATGCTGC GTTGTGGCTG GATTATGGCC GCGCCCCAGC TGCTCGATCA ACTACGCCCA GCTTATCTCG ATTTGATTAA CATTGGTTCG CCGTTGACTG AATATTTGGC GGCGAGCATT ATTGAACAAT TGGCCAATTA TCAAACTGCT GCCCAAGCCC ATGTTGCCGT CAATCGCCAA ATAGTGCAAC GCTATATGCA GCCGTTGCTT GAGCGTGAAT TGATCAACGG TGTAATTCCT GCGGCGGGCT GCACTTATTT TCCCAAAATC ATGCTTGATC AAACCCAAAT TGACCACGTA GCCCAACAAA CTGGCTGTGT GCCTGGGCGG TTTTTCGGCT CGGCCTATCA GCAGCAGCTA CGAATTGGCT TTGGCGGCCC GAGTAATTTA ATTGAAGCAG CCTTGAAACC CTTTACCCAA ACAATCCTAC CACTTCAATA A
|
Protein sequence | MNQPIYLEWA KQQPATAILS NSSVYLPSVR NAIQQQLADA AWFAAAQQAN PWGYPSLQQA LQAWHRSSYS PLIVAGASSA LAVVCQALLQ PGDHALIETP QYEPFSRCVA ARGATWSAWP RHPQSFELQL DQLASGITPR TKLLLISNLH NPSSTLSNQS QLLRLQQTLT QATDTLGIAP ITMVVDEIYW HLVAQAEFRS VAELGPQWIG INSLSKVYGL SMLRCGWIMA APQLLDQLRP AYLDLINIGS PLTEYLAASI IEQLANYQTA AQAHVAVNRQ IVQRYMQPLL ERELINGVIP AAGCTYFPKI MLDQTQIDHV AQQTGCVPGR FFGSAYQQQL RIGFGGPSNL IEAALKPFTQ TILPLQ
|
| |