Gene Rsph17029_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1586 
Symbol 
ID4896856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1667133 
End bp1668146 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content73% 
IMG OID640112177 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001043468 
Protein GI126462354 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.245014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGGGCCGGAC GGCCGCGATT TTCGGCTGCT CCGGCCCGGT CCTCACGGAT 
GCCGAGCGGC AGTTCTTCCG CGAGGCCGAT CCCTTCGGCT TCATCCTCTT CGCACGCAAC
ATCGACGACC CCGCGCAACT CCTTGCCCTG ACGCAGGAAA TGCGTTCCAC CGTGGGGCGC
GACGCGCCCG TCTTCGTCGA TCAGGAAGGC GGCCGCGTCC AGCGTCTCCG CGCGCCGTAC
TGGCGCGAGT GGCTGCCGCC CCTCGAGGCG GTGGAGCGCG CCGGAGACCG GGCCGCCCGG
ATGCTCTGGC TGCGCTACCG GCTCATCGCC GAGGACCTGC GGGCGGTGGG CATCGACGGC
AACTGCGCGC CCGTGGCCGA CATCCGCACC GCGGCGACCC ATCCGTTTCT CGCCAACCGC
TGCCTCGCCG ACGAGGCCGC GCGCGTGGCG GAGCTTGCCC GCGCGGCGGC CGAGGCGCAT
CTGGCGGGCG GGGTCCTGCC GGTGATGAAG CATCTGCCGG GCCACGGACG GGCCGCGGCC
GACACGCACC ACGACCTGCC CACGGTGACC GCCAGCCGCG AGGAGCTGGC CGCCACCGAC
TTCGCGGCCT TCCGGGCGCT TGCGGATCTG CCCTTGGCCA TGACGGGCCA TGTGGTCTTC
TCCGCCTATG ATGCGCAGCC TGCGACCCTC TCGGCGCCCA TGGTCGGCGT CATCCGCGAG
GAGATCGGCT TTTCCGGCCT TCTCATGACG GACGATCTGT CGATGCAGGC CCTCTCGGGC
GGGATCGGCG CGCGGGCGGG GGCGGCCATC GCGGCCGGCT GCGATCTGGC GCTCCATTGC
AACGGCGAAC TGGCCGAGAT GGAGGCCGTG GCCGCCGCCG CGGGCGCGAT GGGGCCCGGG
GCGCTGGAGC GCGCCGCAGC GGCGCTGGCC CGCCGCAGGC CGCCCGAGCC GGTTGACAGC
CGGGCGCTCG AGGCCGAACA TTCCGTCCTT CTGGGCGGGC ATGGGCATGG CTGA
 
Protein sequence
MSEAGRTAAI FGCSGPVLTD AERQFFREAD PFGFILFARN IDDPAQLLAL TQEMRSTVGR 
DAPVFVDQEG GRVQRLRAPY WREWLPPLEA VERAGDRAAR MLWLRYRLIA EDLRAVGIDG
NCAPVADIRT AATHPFLANR CLADEAARVA ELARAAAEAH LAGGVLPVMK HLPGHGRAAA
DTHHDLPTVT ASREELAATD FAAFRALADL PLAMTGHVVF SAYDAQPATL SAPMVGVIRE
EIGFSGLLMT DDLSMQALSG GIGARAGAAI AAGCDLALHC NGELAEMEAV AAAAGAMGPG
ALERAAAALA RRRPPEPVDS RALEAEHSVL LGGHGHG