Gene Rsph17025_0964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0964 
Symbol 
ID5085031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp989694 
End bp990764 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID640482521 
Productcellulase 
Protein accessionYP_001167170 
Protein GI146277011 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.585824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGTC GCACCCTTCT CCGGCTTGCC GCCGCCACGG CCCTTGCCTC TCCGGCGGGG 
CGGCTCCTGG CTCAGGAGGG CGGTGTGCTG CCGTCCGATC ATCCGCTTCA GACCGCCTGG
GAGAGCTGGA AGGCGGCCTT CCTGCTGCCG GCCGGCCGGA TCGTCGATGG TCCGCAACAG
AATGCCAGCC ACTCCGAGGG GCAGGGCTAT GGTGCGGCGC TGGCGGCCAT CTTCGGCGAC
GAGGCGGCGC TGCGCCGCAT CGTGGACTGG ACCGAGACGA ATCTCGCCCG CCGTGACGAC
AATCTGCTGA GCTGGCGCTG GCTGCCGGGC GTGCCGCTCG CGGTGCCCGA CGAGAACAAC
GCCACCGACG GCGACCTGTT CTACGGCTGG GGTCTTGCGC TTGCCGCCCA GCGGTTCGGC
AACGCCGACC TTGCCAAACG CGCAACCGAG ATCGCCCGCG CCATTGCGCT GCACTGTGTC
CGTCCGCACC CGGATGGCTC CGAGCGGCTG GTGCTGCTGC CGGGCGCCAC AGGGTTCGAG
ACCGAGGAGG GGTTGGTGCT CAACCCGTCC TACTACATGC CGCGCGCCAT GACCGAGCTT
GCCGCCTTCA GCGGACAGGA GCGGCTGGCC CGTTGCGCGC AGGATGGCGC CCTCTGGATT
GGCGGGCTCG GTCTCGCGCC GGACTGGGTG CTGGTGACGT CCACGGGGGA TCTGCCGGCC
AAGGGCCTGT CGGCGCACAG CGGCTATGAT GCGATGCGCG TGCCGCTCTT CCTGCTCTGG
TCCGGCCTTA CGGCGAACCC CGCCCTTCGC CGCTTCATCG AGGTGCAGCG CGAGGCGGAA
CCCGGAACCG GGACCCCCGT CGTCTTCGAC CGCGACACCG GCGCCCTGCT TGAGAGGTCG
GCGGATCCGG GTTTTGCCTC GGTGCCCGCT CTGGCGGACT GTGCGCTGTC CGGGCGGCCC
GGGGCGGCCA TCCCGCCGTT TGACGCGCGG CAGCCCTACT ATCCCGCGAC GCTGCATCTG
ATGACGCTCG TCGCACAAGT GGAAGGTTTT TCCGCATGCG CTCCGATCTG A
 
Protein sequence
MRRRTLLRLA AATALASPAG RLLAQEGGVL PSDHPLQTAW ESWKAAFLLP AGRIVDGPQQ 
NASHSEGQGY GAALAAIFGD EAALRRIVDW TETNLARRDD NLLSWRWLPG VPLAVPDENN
ATDGDLFYGW GLALAAQRFG NADLAKRATE IARAIALHCV RPHPDGSERL VLLPGATGFE
TEEGLVLNPS YYMPRAMTEL AAFSGQERLA RCAQDGALWI GGLGLAPDWV LVTSTGDLPA
KGLSAHSGYD AMRVPLFLLW SGLTANPALR RFIEVQREAE PGTGTPVVFD RDTGALLERS
ADPGFASVPA LADCALSGRP GAAIPPFDAR QPYYPATLHL MTLVAQVEGF SACAPI