Gene Acid345_4548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4548 
Symbol 
ID4071493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5391315 
End bp5392610 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content61% 
IMG OID637986588 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_593622 
Protein GI94971574 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.773793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.133207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCA AACTCGATCG TTCCCGCGAA CTCCAGAAAC GTGCTGAAGC CCTTATTCCC 
GGTGGGGTGA ATTCACCCGT GCGTGCGTTC CGCGCCGTGG GAGGCGAACC GCCGATTTTG
GTGCGCGGCG AAGGGGCGCG GGTGTTTGAC GCCGATGGCA ATGGGTATAT CGATTACGTA
TTGTCGTGGG GGCCGTTGAT TTTGGGGCAT GCGTTCACGC CGGTGATCAA TGCGATTGAG
AAGGCGGCGG AGAAGGGCAC TAGCTTTGGG GCGTCGACGC CGACGGAAGC CGATCTCGCG
GAAGCCGTGG TGCATGCGAT GCCGGCGATC GAGAAGATTC GGTTCGTGAG TTCGGGCACC
GAGGCGACGA TGTCGGCGAT TCGTCTGGCG CGCGGATTTA CCGGGCGCAA GTACATTGTG
AAGTTCGAGG GCTGCTACCA CGGGCACAGC GATTCGCTGC TGGTGAAGGC GGGTTCGGGC
GTGGCGACGC TGGGGATCCC GGGATCGGCG GGCGTTCCCG ATGAATTGGC GCAACTGACG
CTGGCGCTGC CTTACAACAA CGTCGCGGCT GTCGAGCAGG CATTCACGAA ATTCAAAGGA
CAGATTGCAT GCGTGATTGT TGAGCCGATT GTGGGCAACA TGGGCTGTGT GCCTCCGGCC
GCTGATTATT TGCAGGCGTT GAGCGACATC ACGAAGCGCG AAGGTGCGGT GCTGATCGTG
GATGAAGTGA TGACGGGATT CCGCGTGGCG TATGGCGGCG CGCAGGAACT GTATGGACTG
AAGCCGGACC TGGTCACGCT GGGGAAGATC ATCGGCGGTG GGTTACCGGT GGCGGCGTAT
GGCGGGCGTA AGGACATCAT GGACAAGATT GCGCCGTTGG GGCCGGTATA CCAGGCGGGA
ACGCTGTCGG GGAATCCACT GGCGATGGCG GCGGGGTTGG CAATGCTCTG CCATCTACGC
GATAACGCGA TGGAGATCTA TCCGCGACTC GATCAACTGA GCGCAGATCT TGTGAATCGT
GTGCTCGATG CGGCGCGGGA AGCAGGTGTC GCGCTGACGG CGAACCGAGT GGGATCGATG
TTTACCTGGT TCTTTACCGA CAAGCATGTG ACCGATTGGG ACTCGGCAGC GACTTGCGAC
ACGAAGCAAT TCGGACAGTT TCACGGCGCG ATGCTGGATG CAGGGGTGTG GCTGCCGCCG
GCGCAATTCG AGGCGGCGTT CTTGTCATCG GCGCACACCG AGCAGGACAT TGACGATACG
GTGGCGGCGG CGAGAGAGGC GTTCGCGATT CTTTAG
 
Protein sequence
MSRKLDRSRE LQKRAEALIP GGVNSPVRAF RAVGGEPPIL VRGEGARVFD ADGNGYIDYV 
LSWGPLILGH AFTPVINAIE KAAEKGTSFG ASTPTEADLA EAVVHAMPAI EKIRFVSSGT
EATMSAIRLA RGFTGRKYIV KFEGCYHGHS DSLLVKAGSG VATLGIPGSA GVPDELAQLT
LALPYNNVAA VEQAFTKFKG QIACVIVEPI VGNMGCVPPA ADYLQALSDI TKREGAVLIV
DEVMTGFRVA YGGAQELYGL KPDLVTLGKI IGGGLPVAAY GGRKDIMDKI APLGPVYQAG
TLSGNPLAMA AGLAMLCHLR DNAMEIYPRL DQLSADLVNR VLDAAREAGV ALTANRVGSM
FTWFFTDKHV TDWDSAATCD TKQFGQFHGA MLDAGVWLPP AQFEAAFLSS AHTEQDIDDT
VAAAREAFAI L