Gene Clim_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0158 
Symbol 
ID6356128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp175599 
End bp177347 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content51% 
IMG OID642667785 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_001942236 
Protein GI189345707 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCATGA AAAAAATCTC TCTCATAACG ATAGCACTGG TCGGCTTTGC TGCGCTGCTG 
ATCGCAATTT CCGATCCGGC TACTGCAAAA TCAAAGCCAT CACAGAAAAA ATGGCAGGCA
CAATATATCT TCAGTCGTAA GGATTCAGAC GTAGAAAAAG AACTGCAACG GATGACACTG
GACGAAAAAA TCGGCCAGAT GATCATCGCA CAGATAGAAA GCCGGAACAA CAGCGGCGGA
GACCCGAATT ACCTGCAGCT CCAGCGTTTG GTACAGCAGG GCAAGGTTGG CGGAATCATG
TTCATGAAAG GCGACGCATT CAATGCCGCT ATGATGGCCA ATAACTTTCA GTCGCTTGCC
TCACGCCCTC TTTTAATGAG TGCCGACATG GAGAGAGGAC TTGCCATGCG CCTTTCCGGG
GCTACGGAAT TCCCTCCGAA CATGGCGCTG GCAGCAACGG GGGACCCCGA ACTTGCCTTT
GCAATGGCAA AAGCCATTGC GGGTGAAGCC CGAACGATCG GTATTCACCA AAGTTATTCT
CCAAACGTTG ACCTGAACAT CAATCCGGCC AACCCCGTCA TCAATACCCG CTCATTCGGC
GACAATGTAC CGCTTGCCAT AACCATGAGC AATGCCATGA TCGAAGGATT TCAGACCAAT
AACGTCATTG CAACAGCAAA ACACTTTCCG GGACACGGCG ATGTCACCGT TGACAGCCAT
CTCGCCCTGC CGGTACTCAA TGCAGACAGG CAGCGCCTCT TTGCCTATGA GCTCAGACCA
TTCAAAGCCG CAATCGACCA GGGGATCATC AGCATCATGA CCGGTCACCT TGCAGTTCCC
AGACTGACAG GCTCCATGGA ACCGGCTTCG GTTTCAAAAG CAATCGTCAC CGGACTGCTC
AGAAACGAAC TGGGGTTTCA GGGACTCATC ATCACTGACG CCATGAACAT GAAAGCGCTT
TATAACGGCA ATAACGTGCC GGAAATGTCC GTTAAAGCAG TTCAGGCAGG CAACGACCTT
CTGCTCTTTT CTCCGGCGCC TGAACTTGCA CATGCCGCAA TAATCCGGGC TGTGCAGGAA
GGCGCGATCC CGATGAATCA GATTGACGCA TCGGTCAAGC GAATTCTTCA GGTTAAAAAG
TGGCTCCAGC TCGAAGAAAA AAAACTTGTC GATCTCAACC GGGTACAAAG CCAGATAAGC
ACCAACTCCC ACCGCAAGCT TGCCGCCGAG ATATCTTCCC GATCGCTGAC GGTAGTGAGA
CAGGAGCCTC GCCATCTGCC TCTGAAAAAC GGCAGAGTGC TCAATATCAT CCTTCAGGAC
AAGAGCAATC CGGAACCAGG GCGGGAATAT GCGGAAAAAC TGAACCGCTC GTTTCCATCA
ACAACGGTAC GAATCGATCC AAAAACAGAT CCCCAAACCT ATGCCTCGAC ACTTGCCGCC
GCAGGAACGG CAGATGCCGT GATCATTTCC TCCTATGTGC AGGTTTTTTC CGGATCGGGA
ACGCTCAGAC TTACCGGTCA GCAGCAGCAG TTTATCCATT CGCTTGCGCA ATCGATTCCG
GAACATAAAC CGCTTATATT CATTTCTTTC GGTACGCCGT ATCTGATAAG CGCCTTTCCT
GAAATCAAAA CCGCCGTTTG CACCTACTCA TCAAACAGGG AGAGTGAAGA TTATGCCTTA
CAGCTGCTCA GAGGTGAACT CAAACCCGTG GGACATCTGC CCGTATCGCT CCATGGCATT
ACTCCCTGA
 
Protein sequence
MFMKKISLIT IALVGFAALL IAISDPATAK SKPSQKKWQA QYIFSRKDSD VEKELQRMTL 
DEKIGQMIIA QIESRNNSGG DPNYLQLQRL VQQGKVGGIM FMKGDAFNAA MMANNFQSLA
SRPLLMSADM ERGLAMRLSG ATEFPPNMAL AATGDPELAF AMAKAIAGEA RTIGIHQSYS
PNVDLNINPA NPVINTRSFG DNVPLAITMS NAMIEGFQTN NVIATAKHFP GHGDVTVDSH
LALPVLNADR QRLFAYELRP FKAAIDQGII SIMTGHLAVP RLTGSMEPAS VSKAIVTGLL
RNELGFQGLI ITDAMNMKAL YNGNNVPEMS VKAVQAGNDL LLFSPAPELA HAAIIRAVQE
GAIPMNQIDA SVKRILQVKK WLQLEEKKLV DLNRVQSQIS TNSHRKLAAE ISSRSLTVVR
QEPRHLPLKN GRVLNIILQD KSNPEPGREY AEKLNRSFPS TTVRIDPKTD PQTYASTLAA
AGTADAVIIS SYVQVFSGSG TLRLTGQQQQ FIHSLAQSIP EHKPLIFISF GTPYLISAFP
EIKTAVCTYS SNRESEDYAL QLLRGELKPV GHLPVSLHGI TP