Gene Cag_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0199 
Symbol 
ID3746686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp227719 
End bp228699 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content49% 
IMG OID637772726 
Productglucokinase, putative 
Protein accessionYP_378520 
Protein GI78188182 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGTT GGGCACTTGG TATAGATTTT GGTGGAACTG CTATTAAAGC GGCTGTTATT 
AGCGAAGGGC AAGGGTTGGT TGAAGATTGC CGTGTGCCAA CGAACAGCTC GGCAGGTCCC
GAAGCTATTT TTTCGCAGCT TGCTGAGTTA ATAGGCGCAA TGTATCACAA AGGGTGTGCT
ACATGTGATG CCGCAAATTT TGCAGGTGTT GGCTTAGGGG CACCCGGTGT GGTAGATGTG
GAGCGTGGTG TTTTAAAATA TCCACCCAAT TTGCATGGAT GGGGCTTGGT GCCATTGCGT
GAGGAGTTGC AGCAGCGTTT GCAGCAAGAG CATGGTTTGC AGGTGCAGAT TCACTTGGAT
AATGATGCGA ATGTTGCGGC GTTTGGCGAA TCGCGTTATG GGGCAGGGCA ACCATTCCCT
AACTTTTTAA TGGTTACGCT TGGCACGGGC GTTGGTGGTG GCATTGTACT TAATCGCTCA
ATTTATCGAG GCAGTTATGG TACGGCAGGC GAGGTTGGCT TTATGATTGT GGATGTTGAT
AGCCCCCATA CGCATGCTGG TATTCACGGA ACGCTTGAGG GGATGTTGGG CAAAAAGTCA
ATTGTAGCAA TGGCTTGTAG CATGATGCAC AACGCGGCAA CCACTTCCAC TATGGGAAAT
TATTGCAATA ACGACTTTTC ACGCCTTTCG CCTCGCCATA TTGAGTATGC TGCGCGCGAA
GGTGATGCGG TGGCGCTTGC CGTGTGGGAG CGTGTTGGGC ATTTACTTGG TTCAGCACTT
GCCAGCGTTA CAGCTTTAAT GGATATTCGT AAATTTGTTA TTGGAGGTGG AATTTCTGGG
GCTGGTTCCT TGATTTTTGA ACCTGCTCGG CAGCAATTAC TCCACTCAAC GCACCCTTCC
ATGCACGAAG GGCTGGAGCT TGTACCAGCA TTTCTTGGCA ATAAAGCAGG AATGTATGGA
GCGGCATCGC TCTGTTTTTA A
 
Protein sequence
MSRWALGIDF GGTAIKAAVI SEGQGLVEDC RVPTNSSAGP EAIFSQLAEL IGAMYHKGCA 
TCDAANFAGV GLGAPGVVDV ERGVLKYPPN LHGWGLVPLR EELQQRLQQE HGLQVQIHLD
NDANVAAFGE SRYGAGQPFP NFLMVTLGTG VGGGIVLNRS IYRGSYGTAG EVGFMIVDVD
SPHTHAGIHG TLEGMLGKKS IVAMACSMMH NAATTSTMGN YCNNDFSRLS PRHIEYAARE
GDAVALAVWE RVGHLLGSAL ASVTALMDIR KFVIGGGISG AGSLIFEPAR QQLLHSTHPS
MHEGLELVPA FLGNKAGMYG AASLCF