Gene Cpha266_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2578 
Symbol 
ID4569093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2953213 
End bp2954340 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content54% 
IMG OID639767143 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_912990 
Protein GI119358346 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA ACCGATTCCC GCTCTCACTG CTGATCATGC TCTTGATGCT TCAGACCCCG 
GCTGCCTGGG CGGCAAAAGA GCCTGACAGT CTGGGAATCA AAATCGGCCA GATGATTATG
ACCGGATTCA GAGGTTGCTC TCTTGCGGAA TCGCCGCAAA TTGCATCGGA TATCCGGCGG
CAACGAATAG GCGGGGTCGT ACTCTTCGAC TACGACGTTC CATCCCGCTC GCCCATCCGT
AACATCACAA CGCCCTCCCG GCTCATGAAA CTGACCAGAG AGCTTCAGGG AATAACGGAA
ATTCCGCTCC TTATCGCCAT CGACCAGGAG GGAGGGCGGG TAAACCGCCT CAAACCCGCT
CTCGGCTTTC CCCCGTCGCT CTCGGCCGCC CGGCTCGGAA AACTCGACAA TACCGACAGC
ACAACCGCAG AGGCAGCCAA AACAGCGGAA ACGCTGAAAA CCATGCACCT GTCGATGAAC
CTCGCCCCGG TCGTCGATCT CAACAGCAAC AAAGAGAACC CTGTCATCGG CAAACTTCAG
AGAAGTTTTT CCGACGACCC GGACGTCGTC ACAAGAAACG CCCGGGCCAC CTGCAACGCA
TTCCGCGAAA AAGGAATCAT TGCGACCCTC AAACACTTTC CCGGCCACGG CAGCTCAACC
ACCGATACCC ACAAAGGATT TACCGACATT ACCGGCACCT GGCGCGAAAA CGAGCTCCAG
CCATACCGTC AGCTCATAGC CGGAGGGTAC AACGACGCCA TCATGACCGC ACACGTCTAC
AACGCAACGA TCGACAGCCT CTACCCCGCA ACGCTCTCAA AAAAAACACT CAAAGGAATC
CTTCGTGAAA AACTCGGCTT CAGAGGGGTA ATCATCACCG ACGACATGCA GATGAAAGCG
ATTGCCGACC ATTACGGACT CGAAGAGGCT CTCCGTCTTG CCATCGAAGC CGATGCCGAC
ATCCTGCTGT TCGGCAACAA CACAACCTTT GACCCCGACA TCGCCAGAAA AGCCATTGCC
ATCATCAGAA CGATGGTCAG TAAAAAAATC ATCACCACGG ACCGAATCGA CCGCTCCTAT
CGAAGAATCA TGACGCTCAA AGAACGATAC CTCTTTCAAT GCAAATGA
 
Protein sequence
MKRNRFPLSL LIMLLMLQTP AAWAAKEPDS LGIKIGQMIM TGFRGCSLAE SPQIASDIRR 
QRIGGVVLFD YDVPSRSPIR NITTPSRLMK LTRELQGITE IPLLIAIDQE GGRVNRLKPA
LGFPPSLSAA RLGKLDNTDS TTAEAAKTAE TLKTMHLSMN LAPVVDLNSN KENPVIGKLQ
RSFSDDPDVV TRNARATCNA FREKGIIATL KHFPGHGSST TDTHKGFTDI TGTWRENELQ
PYRQLIAGGY NDAIMTAHVY NATIDSLYPA TLSKKTLKGI LREKLGFRGV IITDDMQMKA
IADHYGLEEA LRLAIEADAD ILLFGNNTTF DPDIARKAIA IIRTMVSKKI ITTDRIDRSY
RRIMTLKERY LFQCK