Gene Clim_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2184 
Symbol 
ID6355978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2422516 
End bp2423502 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content56% 
IMG OID642669775 
ProductROK family protein 
Protein accessionYP_001944187 
Protein GI189347658 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000268105 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAT GGGCAATTGG TATTGATCTC GGTGGTACGG CTGTCAAAGC GGCAATCGTG 
AGCCGTAAAA AAGGAATTCT CAAAAACAGG ACGGTACCTA CCGATACCGC TTCCGGCCCG
GAGGGGATTG TATCGCAGCT TGCCGTTATG ATCGCTTCGC TTTACACCGA AGCCTCTGCA
GAGCTTTCCC GTCAAGACTT TTCAGGTATC GGTTTCGGAG CTCCGGGAGC TGTTGATATT
GAAGCCGGAA CGCTGAGCTA TCCGCCCAAT CTTCCCGGAT GGACCACCTT TCCCCTGCGC
AGCGAGCTTG AGCGCGCCCT GCTGGCCAAA CTGCCGAAGT CTGTACCGGT GGTCATCGAG
AACGACGCCA ATGCTGCGGC TTACGGTGAA GCGGTCTATG GCGCCGGCCG TAATTTTCGG
GATTTTTTGA TGGTGACACT CGGCACCGGA GTAGGCGGCG GCATCGTTCT GAACCGTAAA
CTGTACCGGG GGCCGAACGG AACGGCCGGT GAAATAGGAT TTATGATTGT CGATTTTCAG
AGTCCGGCTG TGCATGCCGG TATTCACGGC ACCATAGAAG GGATGATCGG CAAAGAGCGC
ATTGTCGAAT ATGCATGCGG CCTTATTCGT GACAACCCTG AAGCCGGCTC GTTGCTTGCG
TCTCTCTGTG GCCAGGATTT TTCATCGCTC TCTCCCCGTC ATATCGAGCA GGCGGCAAAA
ATGGGCGATC AGCTCTCTCT TGCGGTATGG AACCATGTCG GGGCAATTCT CGGAACGGGG
TTCGCTTGCG TTACCTCGCT CATGGATATA CGAAAATTCG TTATCGGGGG GGGGATATCG
GCAGCCGGCA CTCTTATTTT CGAACCGGCT TACCGGCAGT TGCTCCGCTC TACCCTGCCT
TCGATGCATG ACGGGCTCGA ACTGGTTCCG GCCGAACTCG GCAACAGTGC GGGAATATAT
GGCGCGGCGG CGTTGTGTTT CAGTTGA
 
Protein sequence
MSQWAIGIDL GGTAVKAAIV SRKKGILKNR TVPTDTASGP EGIVSQLAVM IASLYTEASA 
ELSRQDFSGI GFGAPGAVDI EAGTLSYPPN LPGWTTFPLR SELERALLAK LPKSVPVVIE
NDANAAAYGE AVYGAGRNFR DFLMVTLGTG VGGGIVLNRK LYRGPNGTAG EIGFMIVDFQ
SPAVHAGIHG TIEGMIGKER IVEYACGLIR DNPEAGSLLA SLCGQDFSSL SPRHIEQAAK
MGDQLSLAVW NHVGAILGTG FACVTSLMDI RKFVIGGGIS AAGTLIFEPA YRQLLRSTLP
SMHDGLELVP AELGNSAGIY GAAALCFS