Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3864 |
Symbol | |
ID | 4071016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4575351 |
End bp | 4576475 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985888 |
Product | aminotransferase |
Protein accession | YP_592938 |
Protein GI | 94970890 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.502444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACG ACGAAATGGT ACCGGCGCAC ATTCGCGCGC TCGGTCCGTA TGTGCCGGGA AAACCGCTCC GGCAGGCTGA AGAAGAAACT GGAATCAAAT GCACCAAGAT GGCCTCGAAC GAGAACCCGT TCGGGCCATC GCCGCGTGCG CTGGAAGCGA TGGAGCACGC GCTGCGCGAA GTTCACCTCT ACCCTGAGAA CGGCGCGCCG GAACTGGCGA AGAAGATCGC CGAGAACGAA GGCGTAACGC CAGACCACAT CCTGGTGACC GGCGGCTCCA CGCCGTGCCT CGACATGGTA GGCCGCACGC TGCTCGCGCC GGGCAGCAAC GCCATCACCA GTGAGCGTTC GTTCATCGTG TATCCGATCG TGACGCGCGC GGCTGGCGCG ACGCTCAAGC TGGTTCCAAC CAAGGACAAC GGCTTCGATC TCGATGGCGT ACTGAACGCG ATTGATACAG ACACGCGTGT AATTTTCCTC GCGAATCCGA ACAATCCCAC GGGCACGATG TTCACCGCGC AGGAACTCGA TGCATTTCTC GACAAGGTTC CCGACCACGT AGTGGCCGTG ATCGACGAGG CGTATTACGA CTTCGCGAAA TATCTCGCGG AACGGCGCGG CGTGGAGTAC TCGCACTCGG TCCGCTACGT GAATGAAGGC CGCAAGGTCA TCGTGATGCG GACTTTCTCG AAGACGCATG GGCTCGCAGG TGTGCGCGTG GGCTTCGGGA TCGGCGACCC GGTCCTGATG AACTACATGG CGCGGGTGCG CACGGCGTTC CAGACCACCA CCGTCGGCGA AGCCGGGGCA CTGGCCGCGT TGCATGACGA CGACCACTTA CAGCGCACGG TGGTCAACAA CGCAAAGGGC GCGGAATTCC TCGAGCACGG ACTGCGTGAA CTCGGAGTGC ACGTTACGCC GACCTGGGCG AACTTCGTCT TCTTCGAAGT GGATGACAAC GCACCGGGAA TTTCGCAAGC GCTAGAGCAT TCGGGAGTGA TCGTGCGTCC GCTGAAGGGC TGGGGCATTC CGCAGGGTTT GCGGGTTACG ATCGGAAAGC CGGAGCAGAA CCAGAATTTC CTGAGTGCCT TGCGAAGAGT GTTGGAGAAG GAGCCGGTGC GGTAG
|
Protein sequence | MKYDEMVPAH IRALGPYVPG KPLRQAEEET GIKCTKMASN ENPFGPSPRA LEAMEHALRE VHLYPENGAP ELAKKIAENE GVTPDHILVT GGSTPCLDMV GRTLLAPGSN AITSERSFIV YPIVTRAAGA TLKLVPTKDN GFDLDGVLNA IDTDTRVIFL ANPNNPTGTM FTAQELDAFL DKVPDHVVAV IDEAYYDFAK YLAERRGVEY SHSVRYVNEG RKVIVMRTFS KTHGLAGVRV GFGIGDPVLM NYMARVRTAF QTTTVGEAGA LAALHDDDHL QRTVVNNAKG AEFLEHGLRE LGVHVTPTWA NFVFFEVDDN APGISQALEH SGVIVRPLKG WGIPQGLRVT IGKPEQNQNF LSALRRVLEK EPVR
|
| |