Gene Acid345_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1458 
Symbol 
ID4069608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1760589 
End bp1761803 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content59% 
IMG OID637983467 
Productaminotransferase 
Protein accessionYP_590534 
Protein GI94968486 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGT TGACCAAAGC CACGCCCATT CAAGCCGCAG ACCGCATGGT GAATGTTCGC 
TATGCGATTC GCGATCTTGC CGTTCTTGCC GACCAGGTTG CCTCGCAGGG AAACAAGATC
CTCTACTGCA ACATCGGCGA CCCCTGCAAG TACGACTTCC CTGTCCCCGT GCACATCATG
GAAGCCGCCA TCAAAGCGAT GCGCGACGGC TACAACGGAT ACGGCGAGTC GCTCGGCATC
AAACCGGCGG TCGAAGCGAT CCGAAATGAA GCTGAACGCG ATGGTTTCAA GAACATCCAA
GGTGTGTTCG TTGGCCTCGG CAGCGGTGAA GCCATCGACT CCTGCCTGAC GGCGTTGCTC
AATCCCGGCG AGAATTTTCT CGCGCCCAGC CCCGAATATC CGCTCTACGG CGCCATCACC
GCCAAGCTTG GTGCTGAACC CAACGCCTAT TTCCTCGATG AATCGAACGA CTGGCAGCCC
GACGTGGAAG ACCTCGAGCG CCGCATCAAC GCCAAGACGC GCGCACTGCT GATCATTAAT
CCCAACAACC CGACCGGCGC CGTGTACTCG CGCGAAACGT TGGAGAAAAT TGCCGACGTG
GCGCGCCGCC ACAATTTGCT TCTGATCTCC GACGAGATCT ATAACAAGCT CGTCTTCGAC
CCCAGCGCGA AACACATCTC CATCGCTACG CTCGCGCCGG ATGTTCCGTG TATTACCTTC
AACGGCTTGT CGAAGGCATA TCTCGTGCCC GGATGGCGTA TCGGGTGGGG CGTCGGCACC
GGACCGGCGG AACTGATCAA GCCCTTCCTC GAGAACATCT ACAAGTTGCT TCGTGCGCGT
CTCTCCGCGC CGCATCCGTA CCAGTACGCC GTGAAGGCTG CGCTCGAAGG CCCGCAGGAC
CATCTCAAGT GGGTGAATGA AAAACTCGCA GCGCGCGCCA AGGTCACAAA AGACTGGGCC
GCCAGCGAAC CGCGCGTCAA CCTGGTCGCG CCGAAGGGCG CGTTCTACGC CTTCCCGTCG
CTCGACATCC CGGAAGATGA CCTAACCTTT GTCAGCGAAC TCCTGATACA AAAACACGTC
CTGCTCGTTC ACGGCAGCGG CTTCGGCCAG AAACCCGGCA CACACCACTG CCGTATTGTC
ACGCTGCCGC AGGAAGCAGT GCTGACCAAC GCCTACGCGA AGGTCAGCGA GTTCTTGAAA
GAGCGATACC AGTAG
 
Protein sequence
MSTLTKATPI QAADRMVNVR YAIRDLAVLA DQVASQGNKI LYCNIGDPCK YDFPVPVHIM 
EAAIKAMRDG YNGYGESLGI KPAVEAIRNE AERDGFKNIQ GVFVGLGSGE AIDSCLTALL
NPGENFLAPS PEYPLYGAIT AKLGAEPNAY FLDESNDWQP DVEDLERRIN AKTRALLIIN
PNNPTGAVYS RETLEKIADV ARRHNLLLIS DEIYNKLVFD PSAKHISIAT LAPDVPCITF
NGLSKAYLVP GWRIGWGVGT GPAELIKPFL ENIYKLLRAR LSAPHPYQYA VKAALEGPQD
HLKWVNEKLA ARAKVTKDWA ASEPRVNLVA PKGAFYAFPS LDIPEDDLTF VSELLIQKHV
LLVHGSGFGQ KPGTHHCRIV TLPQEAVLTN AYAKVSEFLK ERYQ