Gene Acid345_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3864 
Symbol 
ID4071016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4575351 
End bp4576475 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content61% 
IMG OID637985888 
Productaminotransferase 
Protein accessionYP_592938 
Protein GI94970890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.502444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACG ACGAAATGGT ACCGGCGCAC ATTCGCGCGC TCGGTCCGTA TGTGCCGGGA 
AAACCGCTCC GGCAGGCTGA AGAAGAAACT GGAATCAAAT GCACCAAGAT GGCCTCGAAC
GAGAACCCGT TCGGGCCATC GCCGCGTGCG CTGGAAGCGA TGGAGCACGC GCTGCGCGAA
GTTCACCTCT ACCCTGAGAA CGGCGCGCCG GAACTGGCGA AGAAGATCGC CGAGAACGAA
GGCGTAACGC CAGACCACAT CCTGGTGACC GGCGGCTCCA CGCCGTGCCT CGACATGGTA
GGCCGCACGC TGCTCGCGCC GGGCAGCAAC GCCATCACCA GTGAGCGTTC GTTCATCGTG
TATCCGATCG TGACGCGCGC GGCTGGCGCG ACGCTCAAGC TGGTTCCAAC CAAGGACAAC
GGCTTCGATC TCGATGGCGT ACTGAACGCG ATTGATACAG ACACGCGTGT AATTTTCCTC
GCGAATCCGA ACAATCCCAC GGGCACGATG TTCACCGCGC AGGAACTCGA TGCATTTCTC
GACAAGGTTC CCGACCACGT AGTGGCCGTG ATCGACGAGG CGTATTACGA CTTCGCGAAA
TATCTCGCGG AACGGCGCGG CGTGGAGTAC TCGCACTCGG TCCGCTACGT GAATGAAGGC
CGCAAGGTCA TCGTGATGCG GACTTTCTCG AAGACGCATG GGCTCGCAGG TGTGCGCGTG
GGCTTCGGGA TCGGCGACCC GGTCCTGATG AACTACATGG CGCGGGTGCG CACGGCGTTC
CAGACCACCA CCGTCGGCGA AGCCGGGGCA CTGGCCGCGT TGCATGACGA CGACCACTTA
CAGCGCACGG TGGTCAACAA CGCAAAGGGC GCGGAATTCC TCGAGCACGG ACTGCGTGAA
CTCGGAGTGC ACGTTACGCC GACCTGGGCG AACTTCGTCT TCTTCGAAGT GGATGACAAC
GCACCGGGAA TTTCGCAAGC GCTAGAGCAT TCGGGAGTGA TCGTGCGTCC GCTGAAGGGC
TGGGGCATTC CGCAGGGTTT GCGGGTTACG ATCGGAAAGC CGGAGCAGAA CCAGAATTTC
CTGAGTGCCT TGCGAAGAGT GTTGGAGAAG GAGCCGGTGC GGTAG
 
Protein sequence
MKYDEMVPAH IRALGPYVPG KPLRQAEEET GIKCTKMASN ENPFGPSPRA LEAMEHALRE 
VHLYPENGAP ELAKKIAENE GVTPDHILVT GGSTPCLDMV GRTLLAPGSN AITSERSFIV
YPIVTRAAGA TLKLVPTKDN GFDLDGVLNA IDTDTRVIFL ANPNNPTGTM FTAQELDAFL
DKVPDHVVAV IDEAYYDFAK YLAERRGVEY SHSVRYVNEG RKVIVMRTFS KTHGLAGVRV
GFGIGDPVLM NYMARVRTAF QTTTVGEAGA LAALHDDDHL QRTVVNNAKG AEFLEHGLRE
LGVHVTPTWA NFVFFEVDDN APGISQALEH SGVIVRPLKG WGIPQGLRVT IGKPEQNQNF
LSALRRVLEK EPVR