Gene Acid345_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1043 
Symbol 
ID4073130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1308627 
End bp1309973 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content60% 
IMG OID637983050 
Producthydroxypyruvate reductase 
Protein accessionYP_590120 
Protein GI94968072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2379] Putative glycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.346508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAATG CGCAGGACGC CTCAGGGCCG CTCATTTCGG GGATTTCGGC GCATTTTCGT 
GGCGTCGCGC GTGAGATTTT CCAGCATGCG CTGACCGAAT CCAGCATCCG CAAAGCTTTC
GATCGTCACG TGAGTCTCGA TCGTGGCATT TTGCGCGTGG GCGAAGACCT GTTCGATCTC
GGATCGTTCT CGCGCATCTT CGTGGTCGCG ATGGGGAAGG CAGCGCACAC CATGGTCGAA
GCGCTCATGA CCCACCTGGG TGCGGGAGTG ACCGGCATTG TCGCGTGCTC GACGGATCCG
GTGACGCAAG TCTTCGGGTT CCGCTATTAC CGCGGCGGCC ATCCGATGCC GAACAGCGAC
TCCGTGCGCG CAGCGGAAGC GATCCTGAAA TCCCTTTCTA CCCATGCATC GCGTTCACTC
GTGATCTTTC TGGTCAGCGG CGGGGCATCG GCGATCGTCG AGAAACCTGT GGACGACAGC
ATCACGCTGG AAGATCTCAT CGCGACCTAT AAGGTGCTGG TGCATAGCGG TGCGCCGATT
CGCGAAATTA ATGCCGTCCG CAAGCATCTT TCAGCCACCA AGGGCGGACG ATTGGCGCTG
ATGGCCTCGC CCGCCCAACA GGTCTCGATC CTCGTCAGCG ATGTGCCAGA CGGAACGGTG
GACTCACTGG CGTCGGGGCC GACGATGCCA GATACGACGA CGGTCGAGGA GTGCTACGAC
ATCGTCAAGA AACACAAGAT TCTGAAGCAG TTTCCGGCGT CGGTACGTGA TCTGTTCGAA
CGGGTTGAGT TGGAAGAAAC TCCGAAGCAC GGTGACGGTT CGTTCGACCG CTCGCGCTTC
TGGACGATCC TGTCGAACGA AATTGCTCGC AAGCACGCAG TAGCCAAGGC GGCGATGAAT
GGGTTCGCCA TTGAGGTGGA CAACACCTGC GACGACTGGG ACTACGCGGA GGCGGCGGAC
CATCTGCTGA AGAAGTTGCG GACACTGCGC AAAGGTGTCT CGCGCGTCTG CCTGATTTCC
GGTGGCGAAG TGACGGTGAA AGTCACTGGG GAAGCGGGTG TCGGCGGCCG GAACCAGCAG
TTTGCGCTCT ATTGCGCCAC GAAGATCGCG GACGAGGACA TTACCGTCTT GAGCGCTGGG
ACTGACGGCA TCGATGGCAA TAGTCCAGCG GCGGGCGCCA TTGTCGATGG AACTACTCTT
GCGCGCGCAT CCGCAGTTGG CCTCGATGCG CAAACCGCAT TGCAGACGTT CAATGCCTAC
CCGCTGTTCG ACGCCCTCGG AGACGCGATC GTTACCGGAC CGACCGGAAA CAATATTCGC
GACCTGCGAA TCCTGCTGGC GTACTAG
 
Protein sequence
MGNAQDASGP LISGISAHFR GVAREIFQHA LTESSIRKAF DRHVSLDRGI LRVGEDLFDL 
GSFSRIFVVA MGKAAHTMVE ALMTHLGAGV TGIVACSTDP VTQVFGFRYY RGGHPMPNSD
SVRAAEAILK SLSTHASRSL VIFLVSGGAS AIVEKPVDDS ITLEDLIATY KVLVHSGAPI
REINAVRKHL SATKGGRLAL MASPAQQVSI LVSDVPDGTV DSLASGPTMP DTTTVEECYD
IVKKHKILKQ FPASVRDLFE RVELEETPKH GDGSFDRSRF WTILSNEIAR KHAVAKAAMN
GFAIEVDNTC DDWDYAEAAD HLLKKLRTLR KGVSRVCLIS GGEVTVKVTG EAGVGGRNQQ
FALYCATKIA DEDITVLSAG TDGIDGNSPA AGAIVDGTTL ARASAVGLDA QTALQTFNAY
PLFDALGDAI VTGPTGNNIR DLRILLAY