Gene RSP_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0331 
Symbol 
ID3719091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2057613 
End bp2059451 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content74% 
IMG OID640071543 
Producthypothetical protein 
Protein accessionYP_353408 
Protein GI77463904 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.378113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGAC GGACCATCCT GACATCGGCC GCCGCCGCGC TGATGCTGGC CCCTGCAGGA 
CGCCTCCTCG CGCAGTCGGG CAGAGAGGCT TTGCCTGCGG ACCACCCGCT CCAGGCGGCC
TGGCGCAGCT GGAAGGATGC GTTCCTGCTG CCCGCCGGCC GCATCGTCGA CGGGCCGCAG
CAGAATGCGA GCCATTCCGA AGGGCAGGGC TACGGAGCCA CGCTCGCCGC GATCTTCGGC
GACGAGGAGG CCCTGCGGCG CATCGTCGAC TGGACCGAGG CGAACCTTGC GCGGCGCGAG
GACAAGCTTC TGAGCTGGCG CTGGCTGCCC GGTGTGGCGC TGGCCGTGCC CGACGAGAAC
AACGCCACCG ACGGCGATCT CTTCTACGCC TGGGGTCTCG CCATGGCCGC GCAGCGGTTC
GGCAAGGCCG ATTACGCCGG GCGGGCGACC GAACTGGCGC GCGCCATCGC GCTGCATTGC
GTGCGTCCGC ATCCGGACGG CTCCGAGCAG CTCGTGCTGC TGCCGGGGGC CAGCGGCTTC
GAGACGCCGG ACGGGGTGGT GCTCAACCCC TCCTACTACA TGCCCCGCGC CCTGACCGAG
CTCGCCGCCT TCAGCGGCCA GGACCGGCTG GCGCGCTGTG CCCGCGACGG GGCGGACTGG
ATCGCGTCGC TCGGGCTTCC GCCGGACTGG GCGCTGGTCA CGCCCTTCGG CACACAGCCG
GCGCCGGGCC TGTCCCACAA CAGCGGCTAC GATGCGCTGC GGGTGCCCCT GTTCCTGCTC
TGGTCCGGGC TGACCGCCAA TCCCGCGCTG CGCCGCGCGG TGGAGGCGGC CGGGGACGCC
GCAGCCGGCG ACACGCCGGT GAGGTTCGAC CGCGACACGG GGGCGGTGCT GGAACGGTCC
GCCGATCCGG GCTTCCGCGC CGTGCTCGCG CTTGGCGATT GCGCCCTTTC GGGTCGTCCG
GGGGCGGCGA TCCCGCCCTT CGACGCGCGC CAACCGGGCG GCGGATGCCG AGCTGCGGCG
CCTCCGCGCG CAGTTTCCCG ACTGGGACGT GCCGTCCGAC CTCACGACGC TGGGCCAGCA
GCGGTCTCCC GCCGCCGAGA TCGACCGGAT CTACCGGCAG ATCGCGGCCG GAGACCTGAC
CGAGGCCCGG CAGGCGATGG ACGAGACGTC GCGCAACTTC CCCGGATGGA CGCCGCCGCC
CGAGATGGAG CGTCTTCTGG CCACGGCCGA GGCACAGGCC GCCTTCGATG CGGCCGCCAG
TGCGGGCAAT GCGGGCGCGG CAATCGAGAT CGCGCGGCGG ACGCCCGCGA TCCTGCGCTG
CGACCGGGTG AACAACGCCT GGCGGCTGGC CGAGCTGCAG GCGGCGGCGG GCCAGAAGGC
GGCCGCGCTG CAGAGCTATC GCGGGGTGAT CGCCTCCTGC TCGGGCCTGT CCGAGGTGAC
GGCGACGCTC GAGAAGGCGG AGGCCGTGGC CAGCGATGCG GAGCTGGTCG AGCTCTTCCG
GCTGGCCAAT GCGCAGCTTC CGGGCTCGGG ACCTGCGCTG AAGGCGCTCG AGACACGGCT
GAGGGCGGGA CGCGGCGACA CGGCGCCCGA GGCATCGGCG CCGGCTGCCG CAGCAACGGG
CGGAGCCAAG CGCACGCCGG GCCGCACTGC GGTGGCCGAG GCGGATCTGC CCGCGGCGGG
GCGCCCGCGC ACTGCGGGCG TGGCGCGCAG CGGCGGAGGG GCGGGGCTGT CCGCGGTCCG
CGCGGCAGCG CAACGCGGCG ACTGGCGGAC CTGCACCGGC CTCACCAGCG GCGCCACCAG
CGCCGACATG CTCTACGAGC GGGCCTGGTG CGTCTATAA
 
Protein sequence
MRRRTILTSA AAALMLAPAG RLLAQSGREA LPADHPLQAA WRSWKDAFLL PAGRIVDGPQ 
QNASHSEGQG YGATLAAIFG DEEALRRIVD WTEANLARRE DKLLSWRWLP GVALAVPDEN
NATDGDLFYA WGLAMAAQRF GKADYAGRAT ELARAIALHC VRPHPDGSEQ LVLLPGASGF
ETPDGVVLNP SYYMPRALTE LAAFSGQDRL ARCARDGADW IASLGLPPDW ALVTPFGTQP
APGLSHNSGY DALRVPLFLL WSGLTANPAL RRAVEAAGDA AAGDTPVRFD RDTGAVLERS
ADPGFRAVLA LGDCALSGRP GAAIPPFDAR QPGGGCRAAA PPRAVSRLGR AVRPHDAGPA
AVSRRRDRPD LPADRGRRPD RGPAGDGRDV AQLPRMDAAA RDGASSGHGR GTGRLRCGRQ
CGQCGRGNRD RAADARDPAL RPGEQRLAAG RAAGGGGPEG GRAAELSRGD RLLLGPVRGD
GDAREGGGRG QRCGAGRALP AGQCAASGLG TCAEGARDTA EGGTRRHGAR GIGAGCRSNG
RSQAHAGPHC GGRGGSARGG APAHCGRGAQ RRRGGAVRGP RGSATRRLAD LHRPHQRRHQ
RRHALRAGLV RL