Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3684 |
Symbol | |
ID | 4070434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4358375 |
End bp | 4359445 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985707 |
Product | histidinol phosphate aminotransferase |
Protein accession | YP_592759 |
Protein GI | 94970711 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.882414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAAGG CGCGAGAGAC CGTGCAGTCG CTGCCGACGT ATCATCCGCC GCTCGGTGGG CGCGAGGGCC TGCGGCTCGA CTTCAACGAG AATACGGTGG GATGCTCGCC GCGCGTGGCC GAAAAGCTGC GAGAGATCTC GCGAGACGCT CTGGCACGTT ATCCCGAACG GGGAGCGGTC GAAGCGACCG TTGCGGAATT TCTGGGGCGA AACGTCGATG AAGTCCTGCT CACGAACGGC GTGGATGAAG GCATCCACCT GCTGTGCGAG ACCTATCTTG AACCGGGCGA CGAGGTGTTG ATCGTGGTGC CGACGTTCGC GATGTACGAG ATTTACGCAC GGGCAACGGG AGCAAAGGTC ATTAGCATTC CGGCGGGCGA GGATTTTGTT TTCCCCACCG ATGCCGTGCT CTCTGCGATT TCGCCTCGCA CGCGGCTCAT CGCCATCGCA AATCCGAACA ATCCGACGGG CACGGCGGTT TCGAGAGCTG ACTTGCTGAC GATCGCGGAA GCCGCGCCCC ACGCGGCGTT GCTCGTCGAT GAAGCGTACT TCGAATTCCA CGATAAGACG ATGGTTGGCG ACATCGCCCA AGTGCCGAAT CTCTTTATCG CGCGAACGTT TTCGAAGGCC TACGGGCTGG CTGGTCTACG CATAGGAATC CTTGCTGGAG AAGCGGGACA AATGACAATG GTGCGACGCG TGAGTTCCCC TTACAACGTG AACGCCGCGG CGCTCGCTTG TTTGCCAGAG GCGCTCGCAG ATTCGGAGTA CGTGTCGCAG TATGTGCGGG AGAGTGTGAC TAACCGCAGG CGGCTGGAAG AGTTTTTTGC CGCAGAAGGA ATTCCCTTCT GGCCAAGCCG GGCGAATTTC GTTTTGGCCA GGTTCGACGA GCTTCGCGTG CCATTTGTAA AAGGAATGCG CGAACGAGGA ATTCTAGTGA GAGATCGCAA CAGCGACTAC GGATGTGCTG GGTGCGTGCG CGTCACGGCG GGAACCGAGT CACAAATGGA TTTGCTGTTC GAGGCGATGA AGGACGTTTT GCGCGACTTG CGCCAAGGAC AGGTTCGATG A
|
Protein sequence | MLKARETVQS LPTYHPPLGG REGLRLDFNE NTVGCSPRVA EKLREISRDA LARYPERGAV EATVAEFLGR NVDEVLLTNG VDEGIHLLCE TYLEPGDEVL IVVPTFAMYE IYARATGAKV ISIPAGEDFV FPTDAVLSAI SPRTRLIAIA NPNNPTGTAV SRADLLTIAE AAPHAALLVD EAYFEFHDKT MVGDIAQVPN LFIARTFSKA YGLAGLRIGI LAGEAGQMTM VRRVSSPYNV NAAALACLPE ALADSEYVSQ YVRESVTNRR RLEEFFAAEG IPFWPSRANF VLARFDELRV PFVKGMRERG ILVRDRNSDY GCAGCVRVTA GTESQMDLLF EAMKDVLRDL RQGQVR
|
| |