Gene Acid345_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3031 
Symbol 
ID4071938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3596922 
End bp3598085 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content59% 
IMG OID637985050 
Productputative aminotransferase 
Protein accessionYP_592106 
Protein GI94970058 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAGA TCGATCGTCG TTCTTTCCTT CGCACGGCTT CGTTCGCCGG AGCAGCTTTG 
GTCGCGAGCA CGGAAGCGCA GTTTGCGTTC GCGCAACGAC GGAGCATGAA AGCCGTTGGT
CCCATCGCCG TGTTTCTGAA TGCGAATGAG AATCCGTTAG GTCCGTGCGA GGCGGCGCGC
GCCGCGATGA CCACAGCGAT TGCGGAGAGC GGACGCTATC ACGACGAGTA CGCGGAGGAG
TTCGTGAGTT TGTTTGCATC CCAGGGTGGA CTGAAAACGG AGTACGTAGA TCCATACTCA
GGATCGAGCC AACCGCTTTC GTACTGTGTT TCGGCATTCT GTAATGCAGA GAAACCACTC
GTGATTGCGG ACCCCGGCTA TGAAGCTGCG GCGCGGACAG CAGATGCCAT GGGCGTGAAG
GTCTTGAAAG TGCCGCTCGC GAAAGACTAC TCGCACGATG TGCGGGCGAT GGTGGCGGCT
TCGCCGAATG CCGGAGTGTA CTATGTCGCG TCGCCGAACA ATCCCACTGG GTCGGTGACG
AGCCGAGCGG ATGTCGAGTG GCTGCTCGCG AACAAGCCCA AGGGCTCGAT CGTAGTGCTC
GATGAAGCCT ACATCCACTT TTCCGATGCG ACGCCTTGCC TCGACCTTGC AGCAGCGGAC
AAAGACATCG TGGTGCTGCG CACCTTCAGC AAGCTTTATG GAATGGCCGG AGCGCGGTTA
GGTGCGGCTG TAGCGCGTCC CGACCTGCTC AAAAAAGTGC GCGAACAAGG TGGATGGACA
ATGTGTCCGG TGACCGCGAC GCAGGCAGGG ATCGCAAGCT TGAAAGATGC GTCGCTGGTT
GCGAAGCGGA AGACGATCAA CGCCGATGCG CGGCAGGAGA CCTTTGACTG GCTGACTTCG
AAGGGATTCG CATATGTGCC ATCGCAGGCG AACCACTTCA TGCTCGATGC GAAGCGTCCG
ACGCAGACGG TGATCGCCGG CATGGCAGGT AAGGGAATCC TGATCGGACG CGCCTGGCCG
ATATGGCAGA CACATGTCCG CGTTACCGTG GGCACGCCGC AAGAGATGGT GCGGTTTCGG
AGCGCTTTCG AGGATGTAAT GCGACAGCCG CAATCGTCGA GCTACGCGCC AGTGCGGCAC
TCGAACGCGA CGCTATTCAG CTAG
 
Protein sequence
MIQIDRRSFL RTASFAGAAL VASTEAQFAF AQRRSMKAVG PIAVFLNANE NPLGPCEAAR 
AAMTTAIAES GRYHDEYAEE FVSLFASQGG LKTEYVDPYS GSSQPLSYCV SAFCNAEKPL
VIADPGYEAA ARTADAMGVK VLKVPLAKDY SHDVRAMVAA SPNAGVYYVA SPNNPTGSVT
SRADVEWLLA NKPKGSIVVL DEAYIHFSDA TPCLDLAAAD KDIVVLRTFS KLYGMAGARL
GAAVARPDLL KKVREQGGWT MCPVTATQAG IASLKDASLV AKRKTINADA RQETFDWLTS
KGFAYVPSQA NHFMLDAKRP TQTVIAGMAG KGILIGRAWP IWQTHVRVTV GTPQEMVRFR
SAFEDVMRQP QSSSYAPVRH SNATLFS