Gene Rsph17029_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1520 
Symbol 
ID4897834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1590131 
End bp1591453 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID640112110 
ProductBeta-glucosidase 
Protein accessionYP_001043402 
Protein GI126462288 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0705421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTT CCCGCGCCGA CTTCCCCGCC GATTTCCTGT TCGGGGTGGC CACCTCGGCC 
TACCAGATCG AGGGCCACGG CGCGGGGGGC GCAGGACGCA CCCACTGGGA CGATTTCGCC
GCCACCCCCG GCAACGTGGC TCATGGCGAG GATGGCCGCC GCGCCTGCGA CCATTACCAC
CGGTGGGAGG AGGATCTCGA TCTCGTGCGC GATGCGGGCT TCGACAGCTA CCGCTTCTCG
GCCTCCTGGG CGCGGGTAAT GCCCGAGGGC CGCGGCACGG TGAATGCCGA GGGACTCGAC
TTCTACGACC GTCTCGTCGA CGGCATGCTC GCCCGCGGCC TTAAGCCCGC CCTCACGCTC
TACCACTGGG AGCTGCCCTC GGCGCTGCAG GATCTGGGCG GCTGGCGCAA CCGCGACATC
GCAGGCTGGT TCGCCGATTT TGCCGAGGTG CTGCTCGGGC GCATCGGCGA CCGGGTCTGG
TCCACCGCGC CCGTGAACGA GCCCTGGTGC GTGGCCTGGC TGTCGCACTT CCTCGGCCAT
CATGCGCCGG GACTGCGCGA CATCCGCGCC GCGGCCCGCG CTATGCACCA TGTGCTCCTC
GCCCATGGCG CCGCCGTCGA GGCCGCGCGC GGGCTCGGCG TGGGCAATCT CGGCGCGGTC
TGCAACTTCG AACATGCGAT CCCCGCCGAC GGCAGCGAGG CTTCGGCCGC AGCGACCCGC
CGGCACGACG CCCTGATCAA CCGCTGGTTC GTCTCGGCCC TCTTCAACCG CCAGTATCCC
GAGGAGGCTC TGGACGGGAT CGCGCCGCAC CTGCCCAGCG GATGGGAGAA GGACCGCGAC
CGCATCGCCC AGCCGCTCGA CTGGTTCGGT ATCAACTACT ACACCCGCAA GCTGGTGGCG
GCCGCACCCG GCCCCTGGCC GGGCCTGTCC GAGGTGGAGG GCCCCCTGCC GCGCACCCGG
ATCGGCTGGG AAATCCATCC AGAGGGCCTG AGCGACATCC TGCTCCGCAT TCACGAGGGC
TACACCCGCG GTCTGCCGCT CATCGTGACC GAGAACGGCA TGGCCGCCGC CGACCGGGTT
CAGGCGGGCG AGGTGCAGGA CCCCGACCGC ATCGCCTATC TCGAGGGCCA TCTCGCCGCG
GTGCGCACCG CTCTCGCGCA GGGCGTGCCG GTCCGGGGCT ACCATGTCTG GTCGCTTCTC
GACAATTTCG AGTGGGCCTT CGGCTACGAC CAGCGCTTCG GTCTGGTTCA TGTCGACTTC
CAGAACTTGC AGCGCACCCC GAAAGCATCC TATCACGCCC TCGCCCGCGC GCTGGCGCGG
TAA
 
Protein sequence
MTFSRADFPA DFLFGVATSA YQIEGHGAGG AGRTHWDDFA ATPGNVAHGE DGRRACDHYH 
RWEEDLDLVR DAGFDSYRFS ASWARVMPEG RGTVNAEGLD FYDRLVDGML ARGLKPALTL
YHWELPSALQ DLGGWRNRDI AGWFADFAEV LLGRIGDRVW STAPVNEPWC VAWLSHFLGH
HAPGLRDIRA AARAMHHVLL AHGAAVEAAR GLGVGNLGAV CNFEHAIPAD GSEASAAATR
RHDALINRWF VSALFNRQYP EEALDGIAPH LPSGWEKDRD RIAQPLDWFG INYYTRKLVA
AAPGPWPGLS EVEGPLPRTR IGWEIHPEGL SDILLRIHEG YTRGLPLIVT ENGMAAADRV
QAGEVQDPDR IAYLEGHLAA VRTALAQGVP VRGYHVWSLL DNFEWAFGYD QRFGLVHVDF
QNLQRTPKAS YHALARALAR