Gene Acid345_2542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2542 
Symbol 
ID4072186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3003111 
End bp3004322 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID637984559 
Productphosphoglycerate kinase 
Protein accessionYP_591617 
Protein GI94969569 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGC TCTCAATTAA AGATCTACAG CTATCCAATA AACGCGTATT CATGCGCGTG 
GATTTCAACG TCCCTCTCGA TGAAAACGGC CGCGTCACCG ACGACACCCG CATCCGCGAA
ACGCTGCCCA CCATCGAATA CGCCCTGCGC CACGGCGCGA AGCTGATCCT CTGCTCGCAC
CTCGGACGCC CGAAGGGCAA ACCCAATCCA AAAATGAGCC TCAAGCCCGT TGCCGAGCGC
CTACGCGTGA TGCTCGACCA CGCGATCAGT CCCGGCCAGA ATGTCGGCTT CTCGCCCGAT
TGCATCGGTA TGCAAGCGCA GGAGATGGCG AAGCAGTTGG AAAAGGGCCA GGCCCTTCTG
CTGGAGAATG TTCGCTTCCA CGCCGAAGAG GAGAAGAACG ATCCGGCCTT CGCGAAGGAA
CTCGCCAGCC TCTGCGAGCT CTACGTGAAC GATGCGTTCG GCTCCGCACA CCGCGCCCAC
GCCTCGACGG AAGGCATTAC GCACTACGTC GAGAAATCGG CTGCGGGCTT GCTGATGCAG
AAGGAACTCG ACTATCTCGG CAAGGCGACC TCGAACCCGG CGAAGCCGTT CGTGGCCATC
CTCGGCGGCG CCAAGGTCAG CGACAAGATC GGCGTCATCC AGAACCTCAT GGCCAAAGTT
GACGCCATCA TCATCGGCGG CGGCATGGCT TACACCTTCC TCAAGGCGCA GGGCCAGGAG
ATCGGTAAGT CCCTCTTCGA GGCCGATAAA CTCGACCTCG CCAAGCAGAT CCTGGCCGAC
GCGCACAAAC GCGGATTGAA GTTCCTGCTG CCCGTCGACC ACGTCACTGC CGACAAGTTC
GACATGCACG CCACCCCCCA TCAGATCGGT GAAGGCCAGT CCATACCAGC CGAGCAGATG
GCGCTGGATA TCGGCCCTAA GACGGTCGCT CTCTTCTCAG AGGAGATCGC GAAGGCGCGC
ACGATCGTGT GGAACGGTCC CATGGGCGTC TTCGAGTTCG ACAACTTCGC CAAGGGCACC
CGTGCCATCG CCAAAGCCGT TGCCGGCAAC AGCGGCGCCA CCTCAATCGT AGGCGGAGGC
GACAGTGTAG CGGCGGTGCA CGATGCCGGC GTTGCCGACA AGATCACCCA CATCTCCACT
GGCGGCGGCG CTTCGCTGGA GTTCCTGGAA GGCAAGAAAC TGCCCGGCGT GGAAGCGCTG
ACCAACAAAT AG
 
Protein sequence
MSKLSIKDLQ LSNKRVFMRV DFNVPLDENG RVTDDTRIRE TLPTIEYALR HGAKLILCSH 
LGRPKGKPNP KMSLKPVAER LRVMLDHAIS PGQNVGFSPD CIGMQAQEMA KQLEKGQALL
LENVRFHAEE EKNDPAFAKE LASLCELYVN DAFGSAHRAH ASTEGITHYV EKSAAGLLMQ
KELDYLGKAT SNPAKPFVAI LGGAKVSDKI GVIQNLMAKV DAIIIGGGMA YTFLKAQGQE
IGKSLFEADK LDLAKQILAD AHKRGLKFLL PVDHVTADKF DMHATPHQIG EGQSIPAEQM
ALDIGPKTVA LFSEEIAKAR TIVWNGPMGV FEFDNFAKGT RAIAKAVAGN SGATSIVGGG
DSVAAVHDAG VADKITHIST GGGASLEFLE GKKLPGVEAL TNK