Gene Acid345_3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3684 
Symbol 
ID4070434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4358375 
End bp4359445 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID637985707 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_592759 
Protein GI94970711 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.882414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGG CGCGAGAGAC CGTGCAGTCG CTGCCGACGT ATCATCCGCC GCTCGGTGGG 
CGCGAGGGCC TGCGGCTCGA CTTCAACGAG AATACGGTGG GATGCTCGCC GCGCGTGGCC
GAAAAGCTGC GAGAGATCTC GCGAGACGCT CTGGCACGTT ATCCCGAACG GGGAGCGGTC
GAAGCGACCG TTGCGGAATT TCTGGGGCGA AACGTCGATG AAGTCCTGCT CACGAACGGC
GTGGATGAAG GCATCCACCT GCTGTGCGAG ACCTATCTTG AACCGGGCGA CGAGGTGTTG
ATCGTGGTGC CGACGTTCGC GATGTACGAG ATTTACGCAC GGGCAACGGG AGCAAAGGTC
ATTAGCATTC CGGCGGGCGA GGATTTTGTT TTCCCCACCG ATGCCGTGCT CTCTGCGATT
TCGCCTCGCA CGCGGCTCAT CGCCATCGCA AATCCGAACA ATCCGACGGG CACGGCGGTT
TCGAGAGCTG ACTTGCTGAC GATCGCGGAA GCCGCGCCCC ACGCGGCGTT GCTCGTCGAT
GAAGCGTACT TCGAATTCCA CGATAAGACG ATGGTTGGCG ACATCGCCCA AGTGCCGAAT
CTCTTTATCG CGCGAACGTT TTCGAAGGCC TACGGGCTGG CTGGTCTACG CATAGGAATC
CTTGCTGGAG AAGCGGGACA AATGACAATG GTGCGACGCG TGAGTTCCCC TTACAACGTG
AACGCCGCGG CGCTCGCTTG TTTGCCAGAG GCGCTCGCAG ATTCGGAGTA CGTGTCGCAG
TATGTGCGGG AGAGTGTGAC TAACCGCAGG CGGCTGGAAG AGTTTTTTGC CGCAGAAGGA
ATTCCCTTCT GGCCAAGCCG GGCGAATTTC GTTTTGGCCA GGTTCGACGA GCTTCGCGTG
CCATTTGTAA AAGGAATGCG CGAACGAGGA ATTCTAGTGA GAGATCGCAA CAGCGACTAC
GGATGTGCTG GGTGCGTGCG CGTCACGGCG GGAACCGAGT CACAAATGGA TTTGCTGTTC
GAGGCGATGA AGGACGTTTT GCGCGACTTG CGCCAAGGAC AGGTTCGATG A
 
Protein sequence
MLKARETVQS LPTYHPPLGG REGLRLDFNE NTVGCSPRVA EKLREISRDA LARYPERGAV 
EATVAEFLGR NVDEVLLTNG VDEGIHLLCE TYLEPGDEVL IVVPTFAMYE IYARATGAKV
ISIPAGEDFV FPTDAVLSAI SPRTRLIAIA NPNNPTGTAV SRADLLTIAE AAPHAALLVD
EAYFEFHDKT MVGDIAQVPN LFIARTFSKA YGLAGLRIGI LAGEAGQMTM VRRVSSPYNV
NAAALACLPE ALADSEYVSQ YVRESVTNRR RLEEFFAAEG IPFWPSRANF VLARFDELRV
PFVKGMRERG ILVRDRNSDY GCAGCVRVTA GTESQMDLLF EAMKDVLRDL RQGQVR