Gene Acid345_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2541 
Symbol 
ID4072185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3001778 
End bp3002782 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content58% 
IMG OID637984558 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_591616 
Protein GI94969568 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA AGGTTGGCAT CAATGGATTT GGCCGCATCG GCCGTAACGT CTACCGCGCC 
GCGCTCAACG ACTCGAATAT CGAGATCGTT GCCGTTAACG ACTTAACCGA TCCGAAAACA
CTGGCGCACC TCCTGAAGTA CGACTCCATC CTCGGCAATC TTCATCACAA CATCTCCGCC
ACCGCTGACG CCATCACCGT GGACAGCAAG AGCGTAAAGG TGTTCGCAGA GAAAGATCCC
GCTGCCCTCG ATTGGTCCGC ACTCGGCGTG CAGGTCGTCG TGGAATCGAC CGGCCGTTTC
ACCAAGGCCG CTGACGCCAA GAAGCACCTG AAGGGCTCGG TCAAGAAGGT CATCATCTCC
GCCCCGGCCA CCGATGAAGA CATCACCATC GTCCTCGGCG TCAACCAGGA CAAGTACGAT
GCTTCGAAGC ACCACATCCT TTCCAATGCT TCGTGCACCA CGAACTGCCT GGCTCCGGTC
GTGAAGGTCC TTCATGAAAC CTTCGGTATC GTGAGCGGTT TGATGACCAC GATCCACAGC
TACACCAACG ATCAGGTCAT TCTCGACTTC CCGCACAAGG ACCTCCGTCG CGCCCGCGCT
GCCGCGATCA ACATGATCCC GACCTCTACC GGCGCCGCCA AGGCGTTGCG GCTGGTGATC
CCGGAAATGG CTGGCAAGCT CGATGGTTTC GCGATCCGTG TCCCAACCCC GAACGTCTCG
GTTGTCGATC TGACGTTCAC GGCGGAAAAG CCGATCACGA AGGACGCCAT CAACGAAGCA
GTCAAGAAGG CGGCAGAAGG CGAACTGAAG GGCATCCTGG GTTACGAGAC CGCACCGCTG
GTTTCGAGCG ACTTTAAGGG CAATTCCCTG TCGTCGATCT TCGATGCCGA GCTGACCAAG
GTTCTCGGCA ACACCGCGAA AGTGATCTCG TGGTACGACA ACGAGTGGGG TTATTCCTGC
CGCGTACGCG ACCTGATCAC CTTCCTCGGA AACAAAGGCC TGTAA
 
Protein sequence
MAIKVGINGF GRIGRNVYRA ALNDSNIEIV AVNDLTDPKT LAHLLKYDSI LGNLHHNISA 
TADAITVDSK SVKVFAEKDP AALDWSALGV QVVVESTGRF TKAADAKKHL KGSVKKVIIS
APATDEDITI VLGVNQDKYD ASKHHILSNA SCTTNCLAPV VKVLHETFGI VSGLMTTIHS
YTNDQVILDF PHKDLRRARA AAINMIPTST GAAKALRLVI PEMAGKLDGF AIRVPTPNVS
VVDLTFTAEK PITKDAINEA VKKAAEGELK GILGYETAPL VSSDFKGNSL SSIFDAELTK
VLGNTAKVIS WYDNEWGYSC RVRDLITFLG NKGL