Gene Acid345_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2243 
Symbol 
ID4072988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2663907 
End bp2665106 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID637984259 
Productaminotransferase 
Protein accessionYP_591318 
Protein GI94969270 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC TTATGACCAA AACTATGACA GAGCTGCCGC TCGCGCGCCG CATGGCCCGA 
CTCGGCACAG AAACCGCGTT CGAAGTCCTG GTCCGCGCCC GCGCGCTGGA AGCCAAGGGT
AAGGACATTG TCCACCTTGA AATTGGCGAG CCCGATTTCG ATACTCCGCG CAACGTCGTA
GACGCTGGCA TAGACGCGCT CAACAAGGGT TTCACCCACT ACGGGCCGTC GGCCGGACTG
CCGATTCTTC GGCAGACGAT CGCGGAAGAA GTCTCGAAGA CGCGGGGCGT GAAGGTCACG
CCGGAAGAAG TCGTCGTCGT CCCGGGCGGC AAGCCGATCA TCTTCTTCAC CATGCTCGCG
CTCGCCGACG AAGGCGACGA GATCATCTAC CCGAATCCCG GGTTCCCGAT CTACGAGAGC
ATGATCAATT TTGTGGGCGC GAAAGCGGTG CCGATCCCGC TTCGCGAAGA GCGCGACTTC
CGTCTCGACG TCAATGAGCT CAAAGATCTC ATCACCGATC GCACCAAGAT GATCATCATC
AATTCGCCGC AGAACCCCAC GGGCGGCGTT TTGACGAAGC AGGACATCGC CGATATCGCC
GACGCCATTG GCGACCGCGA CATCATGGTG ATGAGCGACG AGATCTACAG CCGCCTGATT
TTCGAAGGCC AGCACCACTC CATCATGTCG CTGCCGGGCA TGCAGGAGCG TACCATCCTG
CTCGACGGCT TCTCCAAGAC GTGGGCGATG ACGGGCTGGC GCATGGGCTT CGGCGTAATG
CGCGCGGACC TTGCAGCGCA CATCGCGCGC TTGGCGACGA ACTCCGTCTC CTGCACCGCG
AGCTTCTCGC AAATGGCCGG CGTGGAAGCG ATCAAAGGCG ATCAGTCGTC GGTGGCGGCG
ATGAACAAAG CTTTCCTCGA GCGCCGCGAT TTCTTCATCA AGCGCGTAAA CCAGATTAAA
GGCTTCTCGT GCCGTAGCCC GCATGGCGCG TTCTACACGT TCCCGAACAT CACCAAGACC
GGCTGGCCTT CGAAAAAACT GGCCGACGCT CTCCTTGATG AAGCTGGTGT CGCCGGGCTT
TCGGGCACGG CCTTCGGATC CTTCGGTGAA GGGTATCTGC GTTTCTCAAT CGCGAATTCG
ATCGAGAACA TTGCCAAGGC GCTGGACCGC ATCGACGAGT GGACGAAACA TCGGCTCTAG
 
Protein sequence
MSLLMTKTMT ELPLARRMAR LGTETAFEVL VRARALEAKG KDIVHLEIGE PDFDTPRNVV 
DAGIDALNKG FTHYGPSAGL PILRQTIAEE VSKTRGVKVT PEEVVVVPGG KPIIFFTMLA
LADEGDEIIY PNPGFPIYES MINFVGAKAV PIPLREERDF RLDVNELKDL ITDRTKMIII
NSPQNPTGGV LTKQDIADIA DAIGDRDIMV MSDEIYSRLI FEGQHHSIMS LPGMQERTIL
LDGFSKTWAM TGWRMGFGVM RADLAAHIAR LATNSVSCTA SFSQMAGVEA IKGDQSSVAA
MNKAFLERRD FFIKRVNQIK GFSCRSPHGA FYTFPNITKT GWPSKKLADA LLDEAGVAGL
SGTAFGSFGE GYLRFSIANS IENIAKALDR IDEWTKHRL