Gene Acid345_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0003 
Symbol 
ID4070013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2996 
End bp3931 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content62% 
IMG OID637982003 
Productglucokinase 
Protein accessionYP_589082 
Protein GI94967034 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000615827 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.192124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGGG CTGTCGATAT CGGCGGAACA AAGATCGCTG TGGGCGTGGT GGACGCAGAC 
GGCGTGGTGA TTGCGAGCGA CGAATGTCCC ACCGAAGCGA AGCGTGGGTA TGCTGATGCG
CTGAACCGGA TCAGTGCGAT GTTGCGTGCC TGTGCCGAGA AAAGCGGCGA GGTGATCACG
GGGGTTGGAA TCGGCAGCAC CGGCCCAGTC GATCCGCTTA CGGGCGAAAT CGGCAACGCC
GAGTTCATCA AGGAGTGGAT GGGCTGCAAT CCGGTGCGCG ACCTGGCCGA ACGGTTCGGC
GTGAAGGTCG CAATGGAGAA CGACGCGGAT GCCGCTGCTC TTGGCGAGGC AGCATGGGGT
GCTGGCCGCG GTCGCAAGCA CATGATCTTC GTAACCGTGG GAACCGGGAT CGGTGGCGGC
ATTATTCTTG GCGGCAGGCT CTATCGTGGC GCAGATGGTG CGCACCCGGA GATTGGACAC
TACACGATGG ATTCTTCTGG CCCTCTCTGC TTCTGCGGCA TCCATGGTTG CTGGGAGGTA
CTGTGCGCAG GACCGGCGAT GGGCGCGTGG ATGACTTCGC AAGCGCCTGC CGATTGGCCG
CCTGAAGACT TCTCTGCCAA GCGCATTTGC GAACGCGCGC GTGAGGGCGA TCCTATTGCG
AAACGGGGGG TGGAGCGGGA AGCACACTAT CTCGGGCTGG GCGTCGCGAA CCTGATCACG
CTATTTACGC CGGAGGTCAT TGTTCTCGGA GGCAACGTGA TGCGAAGTGC GGATTTGTTC
ATGGAACAGA TCCACGCCGA GGTCCGTCGC TGCTGCACCC AGGTTCCCTA CGAGAAGACG
GATATCCGGC TCGCCTCGCT GGGACCTCAA ACCGGACTGG TCGGCGCCGC GCGGGTTTGG
CATCATCGAT TTCGGCAAGA TGGGGAGGTC GCGTGA
 
Protein sequence
MIGAVDIGGT KIAVGVVDAD GVVIASDECP TEAKRGYADA LNRISAMLRA CAEKSGEVIT 
GVGIGSTGPV DPLTGEIGNA EFIKEWMGCN PVRDLAERFG VKVAMENDAD AAALGEAAWG
AGRGRKHMIF VTVGTGIGGG IILGGRLYRG ADGAHPEIGH YTMDSSGPLC FCGIHGCWEV
LCAGPAMGAW MTSQAPADWP PEDFSAKRIC ERAREGDPIA KRGVEREAHY LGLGVANLIT
LFTPEVIVLG GNVMRSADLF MEQIHAEVRR CCTQVPYEKT DIRLASLGPQ TGLVGAARVW
HHRFRQDGEV A