Gene Rsph17029_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1976 
Symbol 
ID4895564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2090876 
End bp2091946 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID640112570 
Productcellulase 
Protein accessionYP_001043852 
Protein GI126462738 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.65444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.379222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAC GGACCATCCT GACATCGGCC GCCGCCGCGC TGATGCTGGC CCCTGCAGGA 
CGCCTCTTCG CGCAGTCGGG CGGAGAGGCC TTGTCTGCGG ACCACCCGCT CCAGGCGGCC
TGGCGCAGCT GGAAGGATGC GTTTCTGCTG CCCGCCGGCC GCATCGTCGA CGGGCCGCAG
CAGAATGCGA GCCATTCCGA AGGGCAGGGC TACGGCGCCA CGCTCGCCGC GATCTTCGGC
GACGAGGAGG CCCTGCGGCG CATCGTCGAC TGGACCGAGG CGAACCTTGC GCGGCGCGAG
GACAAGCTTC TGAGCTGGCG CTGGCTGCCC GGTGTGGCGC TGGCCGTGCC CGACGAGAAC
AACGCCACCG ACGGCGATCT CTTCTACGCC TGGGGTCTCG CCATGGCCGC GCAGCGGTTC
GGCAAAGCCG ATTACGCCGG GCGGGCGACC GAACTGGCGC GCGCCATCGC GCTGCATTGC
GTGCGTCCGC ATCCGGACGG CTCCGAGCAG CTCGTGCTGC TGCCGGGGGC CAGCGGCTTC
GAGACGCCGG ACGGGGTGGT GCTCAACCCC TCCTACTACA TGCCCCGCGC CCTGACCGAG
CTCGCCGCCT TCAGCGGCCA GGACCGGCTG GCGCGCTGTG CCCGTGACGG GGCCGACTGG
ATCGCGTCGC TCGGGCTTCC GCCGGACTGG GCGCTGGTGA CGCCCTTCGG CACGCAGCCG
GCGCCGGGTC TGTCCCATAA CAGCGGCTAC GATGCGCTGC GGGTGCCCCT GTTCCTGCTC
TGGTCCGGGC TGACCGCCAA TCCCGCGCTG CGCCGCGCGG TGGAGGCGGC CGGGGACGCC
GCAGCCGGCG ACACGCCGGT GAGGTTCGAC CGCGACACGG GGGCGGTGCT GGAACGGTCC
GCCGATCCGG GCTTCCGCGC CGTGCTCGCG CTTGGCGATT GCGCCCTTTC GGGTCGTCCG
GGGGCGGCGA TCCCGCCCTT CGACGCGCGC CAACCCTACT ATCCCGCGAC GCTGCATCTG
ATGGCGCTCG TGGCACAAGT GGAAGGTTTC TCCGCATGCG TTCCGATCTG A
 
Protein sequence
MRRRTILTSA AAALMLAPAG RLFAQSGGEA LSADHPLQAA WRSWKDAFLL PAGRIVDGPQ 
QNASHSEGQG YGATLAAIFG DEEALRRIVD WTEANLARRE DKLLSWRWLP GVALAVPDEN
NATDGDLFYA WGLAMAAQRF GKADYAGRAT ELARAIALHC VRPHPDGSEQ LVLLPGASGF
ETPDGVVLNP SYYMPRALTE LAAFSGQDRL ARCARDGADW IASLGLPPDW ALVTPFGTQP
APGLSHNSGY DALRVPLFLL WSGLTANPAL RRAVEAAGDA AAGDTPVRFD RDTGAVLERS
ADPGFRAVLA LGDCALSGRP GAAIPPFDAR QPYYPATLHL MALVAQVEGF SACVPI