Gene Acid345_3865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3865 
Symbol 
ID4071017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4576617 
End bp4577660 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content63% 
IMG OID637985889 
Productglucokinase 
Protein accessionYP_592939 
Protein GI94970891 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACG CAATCGGTGT GGACCTCGGC GGTACCAATC TGCGAATCGC AGCCGTCGAA 
GAACGCGGCA CCCTCCTCGA AAAAGTCACG CTTGGCACGC AGGTACAGCG CGGCCGCGAA
TATGTTGTCG GTCAAATGAC CGATGCCATC CGCCACGTCA CTACCAAGTA CCAGGACCAC
GGCAAGCTGA TTGGCATCGG CATCGGTGTC CCGGGCTTCA TTGATATGGA TACCGGCACC
GTGCGGGAAT CGCCGAACCT ACCCGGCTGG TCGAACTATC CCGTGCATAA GGACATCGAG
AGCCGGCTCG GAACCAAGGT CATTCTTGAG AACGACGCCA ACGCCGCCGC GATGGGCGAG
AAGTGGCTCG GCGCCGGCCG CGACACCGAC GACATGGTGA TGTACACGCT CGGCACCGGC
GTAGGTGGTG GAATTATCAT GGCCGGCCGC TTGTGGCACG GGATGAACGG CATGGCCGGG
GAGCTTGGCC ACCATACCGT TTTGCCCGAC GGCCATATCT GCGGCTGCGG CAACCACGGC
TGCCTCGAAC AATATGCCTC GGCGACGGCC GTCGTGCGCA TGGCGCGCGA AGCTGTCGCC
AACGGCCTGT CCGACGCGCT CGCCAATGCC TCACGCAACG ACGTAGAGTT CAGTTCGAAG
GTGATTTACC AGCTCGCCAT CCAGGGTGAC AAGGCCGCGC AGGAGATTTT CAACACCGTC
GGCCACTCGA TCGGCATCGC CGTGGCCAAC ATGGTCAACG CGCTGAATTT CCCGATGTAC
GTGATCGGCG GCGGCGTTGC CAGCGCCTGG GACGCCTTCC ACAATCCGAT GATGGAAGAA
GTACGCAAGA GATCGTTCAT CTATCGCGTC ACCGCGCCGG AAGCGGTCGC TGCCGGCCAG
AAACGCACCA TCGTGACCCG CGCTTTGCTC GGCGGCGATG CCGGTCTGTT CGGCGCCGCC
CGCCTGCCGA TGGTCGTCAA CGGCGAGTCG TCCGCACCCG CCGCGCAATC CAAGGCTGAT
ACACCGGTAG CCGGCACTCG CTAA
 
Protein sequence
MSYAIGVDLG GTNLRIAAVE ERGTLLEKVT LGTQVQRGRE YVVGQMTDAI RHVTTKYQDH 
GKLIGIGIGV PGFIDMDTGT VRESPNLPGW SNYPVHKDIE SRLGTKVILE NDANAAAMGE
KWLGAGRDTD DMVMYTLGTG VGGGIIMAGR LWHGMNGMAG ELGHHTVLPD GHICGCGNHG
CLEQYASATA VVRMAREAVA NGLSDALANA SRNDVEFSSK VIYQLAIQGD KAAQEIFNTV
GHSIGIAVAN MVNALNFPMY VIGGGVASAW DAFHNPMMEE VRKRSFIYRV TAPEAVAAGQ
KRTIVTRALL GGDAGLFGAA RLPMVVNGES SAPAAQSKAD TPVAGTR