Gene Acid345_0282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0282 
Symbol 
ID4068826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp294926 
End bp295933 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content64% 
IMG OID637982283 
Productglucokinase 
Protein accessionYP_589361 
Protein GI94967313 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCCA GGGGAGAGGC GTTCCTGGGA GTCGACATCG GCGGCACGAA GGTCGCCGCC 
GGACTGGTGA ACGATAACGG CGAGCTTCTC TACAAGACTC GCAATCCGAT GAATTGCTCG
CGTGGAGCGG ACGAAGCCGT CAATGCGGTG CGGGAGGCGA TTGACCGGAC TATCCGCGAA
AATCCCGAAG CTGAAGTGCG CGCGATTGGA TTGAGTTCAC CTGGCTCGGT GGACCCGCGC
ACCGGCACCG TGGTAATGGC GACCAACCTT CCCTGCTGGA AAAATTTTGG GCTCGCCGAG
ATTATCGCGA AACAGTACGG ACTTCCGACC GAACTGCACA ACGATGCCAA CGCCGCCGGA
CTTGCGGAAG CGGTTTGGGG CAACGGCGTG GGGTACGACT CCGTCTTTTA CGCGACGGTG
GGGACCGGAA TCGGCACGGC GATCTTGTTC GATCGCCAGG TTTATCTCGG ACGCACCGGC
TCGGCAGGCG AAGGCGGCCA CATGAGCATC AACTTCGATC ATCGCGGCCC ACGCTGCGCA
TGCGGCAAGC CCGGATGCAT CGAGTACCTC GCGGCGGGGC CGGGGATCGC GACCCGCGCG
CGGCGGAGAA TCGAGTCGGC CTCGGGCAAT GAAGGCGCGA AGCTCATCGA ACTCGCGGGC
GGGGATGTTT CGAAGATCAC CGGCGAGACC GTGGAAGCCG CGTGGAAAGC GGGCGATCGG
CTGGCGACCG AAGTGTTCGA AGAGACTGCC GATTACATCG CTATCTGGCT GGGCAACATT
GTGGACTTCC TCGAACCCGA TGTGATCGTG ATGGGCGGCG GCGTGGGCAA CATGCTCTCG
CCATGGTATC CGCGGATCCG CGAGTACCTG CGCTCGTGGT CGGTGAATCC GCGCGCGGGC
GAGATCCCGT TCGTGCAGGC GAAGTACGGG CCGGATTCGG GCATCGTTGG CGCGGCTGCG
CTGGTGGTGC ATCCGGGGCA GTACATCATG CACGCGCCTA CGCACTGA
 
Protein sequence
MGARGEAFLG VDIGGTKVAA GLVNDNGELL YKTRNPMNCS RGADEAVNAV REAIDRTIRE 
NPEAEVRAIG LSSPGSVDPR TGTVVMATNL PCWKNFGLAE IIAKQYGLPT ELHNDANAAG
LAEAVWGNGV GYDSVFYATV GTGIGTAILF DRQVYLGRTG SAGEGGHMSI NFDHRGPRCA
CGKPGCIEYL AAGPGIATRA RRRIESASGN EGAKLIELAG GDVSKITGET VEAAWKAGDR
LATEVFEETA DYIAIWLGNI VDFLEPDVIV MGGGVGNMLS PWYPRIREYL RSWSVNPRAG
EIPFVQAKYG PDSGIVGAAA LVVHPGQYIM HAPTH