Gene Cpha266_2499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2499 
Symbol 
ID4568552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2865580 
End bp2867349 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content51% 
IMG OID639767059 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_912911 
Protein GI119358267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.964917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA CACCGATACG CACGATTGCC ATCGCCATCC TCTTCCTCAT AGCCGCAGGA 
TCGATTTCTG CCGACGCCTT TGGAAAATCA AAGTCGAAAT CAAAGCCATC ATTCAGCAAA
TGGCAGGCCC AGTCTATTTT TTCCTTCAGC GACTCGAAAA CAGAAAAAAC GCTGAAACAA
ATGACCCTTT CGGAAAAAAT CGGTCAGATG ATTATTGCCC AGACCGAAGC GCGATCCGGA
ATTACCACTG ACAGAGCAAC TCAACAGCTC GGCAGACTGG TACAGGAAGG CAAAGTCGGG
GGCATAATGT TTATGAAAGG CGACGCCTTC AGCGCTGCAC TGCTCTCTAA CTACTTTCAG
TCACTGACCG CGCGCCCGCT GCTCATGAGC GCCGATATGG AACGAGGACT TGCCATGAGG
CTCAGTGGGG CAACCGAATT CCCCCCCAAT ATGGCTCTTG CCGCCACAAA AGAGACCAAA
TTTGCCTTTG AAATGGCAAA AGCTATTGCA AAAGAGGCCC GGATTGTCGG GATACACCAG
AACTATGCCC CAACCGTTGA CCTGAACATC AATCCGGCCA ACCCCATCAT CAACACCCGC
TCCTTCAGCG ACAACCCTGC ACTGGCCATT GCCATGTCAA ACGCCGTTAT CGAAGGTCTC
CAATCAAACG GAATTGCCGC AACGGCAAAA CACTTTCCGG GTCATGGCGA CGTCACCGTT
GACAGTCACC TTTCGCTGCC AGTGCTGAAT GCCGACAGAG CCCGCCTCGA CGCCTATGAA
CTCCAGCCGT TCAAAGCGGC TATCGACCAG GGAATCATCA GTATCATGAC CGGTCATCTT
GCCGTACCGA AACTCACCGG CACCATGGAA CCGGCATCAA TTTCAAAAAC CATTGTTACC
GATCTTCTTC GCAAAGATCT GGGCTTCACG GGATTGATCA TTACCGATGC CATGAACATG
AAAGCGCTCT ACAACGGAAA CAACGTTGCC GAAATATCAG TAAAAGCCGT TCAGGCCGGC
AACGACCTGC TTCTGTTCTC CCCCGATCCC GAACTGGCTC ACAACGCGAT TCTTAATGCC
GTCGAAAACG GAGTAATCCC GAGAGAAAAT ATCGACGCCT CTGTCCGACG AATTCTGCAA
CTCAAGCATT GGCTGGAAAT AGAACACAGA AAACTCGTTG ACCTCAATTC CGTTATGGAC
AACATAAGTC CATCTGCGCA CCGCGACCTT GCCGAAAAAA TCACCCGGAA CTCCATAACG
ATTGCTCAGA ATGCCAATAA CGTTATTCCT TTGAAAATCG GATCTTCCTC AGGCAACATC
CTGAGCATTA TCCTGCAAGA CAAATCAAAC AGCGAAACCG GCAAACACTA TATCGATGAA
ATCAACCGAT ACTATCCTGC CTCCCATCTG AGAATAGACC CGAAAAGCGA TGACCAGACC
TTTGCTGCCG CTCTTGAATT AGCCTCGAAA GCACCGGCAG TTGTTATATC CTCTTACGTA
CAGGTCTTCT CCGGTTCCGG AACCCTGAAG CTCACCCTGA AACAACAGGA ATTCATCCAC
AAACTCGCAC AATCGCTTCC CGCAGGCAAA CCGCTGATCT TCATCTCTTT CGGCACGCCC
TATCTGATCA ATGCCTTTCC TGAAATACAT GCCCACCTGT GCGCATACGC AGCAAACGAA
ACAAGTGAAA CCTATGCAGT CAAGGCACTA CGGGGAGAGC TTAGCCCAAC GGGAACGCTG
CCGGTATCGC TGCAGAGAAA CAGCCGATAA
 
Protein sequence
MNKTPIRTIA IAILFLIAAG SISADAFGKS KSKSKPSFSK WQAQSIFSFS DSKTEKTLKQ 
MTLSEKIGQM IIAQTEARSG ITTDRATQQL GRLVQEGKVG GIMFMKGDAF SAALLSNYFQ
SLTARPLLMS ADMERGLAMR LSGATEFPPN MALAATKETK FAFEMAKAIA KEARIVGIHQ
NYAPTVDLNI NPANPIINTR SFSDNPALAI AMSNAVIEGL QSNGIAATAK HFPGHGDVTV
DSHLSLPVLN ADRARLDAYE LQPFKAAIDQ GIISIMTGHL AVPKLTGTME PASISKTIVT
DLLRKDLGFT GLIITDAMNM KALYNGNNVA EISVKAVQAG NDLLLFSPDP ELAHNAILNA
VENGVIPREN IDASVRRILQ LKHWLEIEHR KLVDLNSVMD NISPSAHRDL AEKITRNSIT
IAQNANNVIP LKIGSSSGNI LSIILQDKSN SETGKHYIDE INRYYPASHL RIDPKSDDQT
FAAALELASK APAVVISSYV QVFSGSGTLK LTLKQQEFIH KLAQSLPAGK PLIFISFGTP
YLINAFPEIH AHLCAYAANE TSETYAVKAL RGELSPTGTL PVSLQRNSR