Gene RSP_2874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2874 
Symbol 
ID3720614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1543355 
End bp1544677 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID640071060 
Productputative Beta-glucosidase A 
Protein accessionYP_352936 
Protein GI77463432 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.355603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTT CCCGCGCCGA CTTCCCCGCC GATTTCCTGT TCGGGGTGGC CACCTCGGCC 
TACCAGATCG AGGGCCACGG CGCGGGGGGC GCAGGACGCA CCCACTGGGA CGATTTCGCC
GCCGCCCCCG GCAACGTGGC TCATGCCGAG GATGGCCGCC GCGCCTGCGA CCATTACCAC
CGGTGGGAGG AGGATCTCGA TTTCGTGCGC GATGCGGGCT TCGACAGCTA CCGCTTCTCG
GCCTCCTGGG CGCGGGTGAT GCCCGAGGGC CGCGGCACGG TGAATGCCGA GGGGCTCGAC
TTCTACGACC GTCTCGTCGA CGGCATGCTC GCCCGCGGCC TGAAGCCCGC CCTCACGCTC
TACCACTGGG AACTGCCCTC GGCGCTGCAG GATCTGGGCG GCTGGCGCAA CCGCGACATC
GCAGGCTGGT TCGCCGATTA TGCCGAGGTA CTGCTCGGGC GCATTGGCGA CCGGGTCTGG
TCCACCGCGC CCGTGAACGA GCCCTGGTGC GTGGCCTGGC TGTCGCACTT CCTCGGCCAT
CATGCGCCGG GACTGCGCGA CATCCGCGCC GCGGCCCGGG CGATGCATCA TGTGCTCCTC
GCCCATGGCG CCGCCGTCGA GAGCGCGCGC GGGCTCGGCG TGGGCAATCT CGGCGCGGTC
TGCAACTTCG AACATGCGAT CCCCGCCGAC GGCAGCGAGG CTTCGGCCGC AGCGACCCGC
CGGCACGACG CCCTGATCAA CCGCTGGTTC GTCTCGGCCC TCTTCAACCG CCAGTATCCC
GAGGAGGCTC TGGACGGGAT CGCGCCGCAC CTGCCCAGCG GATGGGAGAA GGACCTCGAC
CGCATCGCCC AGCCGCTCGA CTGGTTCGGG ATCAACTACT ACACCCGCAA GCTGGTGGCG
GCCGCACCCG GCCCCTGGCC GGGCCTGTCC GAGGTGGAGG GCCCCCTGCC GCGCACCCGG
ATGGGCTGGG AGATCCATCC AGAGGGCCTG AGCGACATCC TGCTCCGCAT TCACGAGGGC
TACACCCGCG GTCTGCCGCT CATCGTGACC GAGAACGGCA TGGCCGCCGC CGACCGGGTG
CAGGCGGGCG AGGTGCAGGA TCCCGACCGC ATCGCCTATC TCGAGGGCCA TCTCGCCGCG
GTGCAGAGGG CCATCGCGCA GGGCGTGCCG GTCCGGGGCT ACCATGTCTG GTCGCTTCTC
GACAATTTCG AGTGGGCCTT CGGCTACGAC CAGCGCTTCG GTCTGGTTCA TGTCGACTTC
CAGAACTTGC AGCGCACCCC GAAAGCATCC TATCACGCCC TGGCCCGCGC GCTGGCGCGG
TAA
 
Protein sequence
MTFSRADFPA DFLFGVATSA YQIEGHGAGG AGRTHWDDFA AAPGNVAHAE DGRRACDHYH 
RWEEDLDFVR DAGFDSYRFS ASWARVMPEG RGTVNAEGLD FYDRLVDGML ARGLKPALTL
YHWELPSALQ DLGGWRNRDI AGWFADYAEV LLGRIGDRVW STAPVNEPWC VAWLSHFLGH
HAPGLRDIRA AARAMHHVLL AHGAAVESAR GLGVGNLGAV CNFEHAIPAD GSEASAAATR
RHDALINRWF VSALFNRQYP EEALDGIAPH LPSGWEKDLD RIAQPLDWFG INYYTRKLVA
AAPGPWPGLS EVEGPLPRTR MGWEIHPEGL SDILLRIHEG YTRGLPLIVT ENGMAAADRV
QAGEVQDPDR IAYLEGHLAA VQRAIAQGVP VRGYHVWSLL DNFEWAFGYD QRFGLVHVDF
QNLQRTPKAS YHALARALAR