Gene Rsph17029_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1113 
Symbol 
ID4895159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1155161 
End bp1156939 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content68% 
IMG OID640111699 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_001042995 
Protein GI126461881 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.618647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.551148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACT GGGCCTGGGG ACCGCGGATC GAGGATGGCC TCGGCCGCTT CCGGCTCTGG 
GCGCCGTCGC AGGAGCGGCT GGTGCTGAGA CTAGACGGAA CGGATCACCC GATGACACGC
AGCGACGACG GGTGGTTCGA GCTGCAGGTG CCCGCAGAGG CGGGCATGGA CTATGGCTTC
GTCCTGGAGA GCGGGCAGGT GGTGCCCGAT CCGGCGGCAC GGGCGCAGGC GGGCGATGTG
CACGGCCTGT CGCGTCTGGT CGCGCCGTCC TTCGACTGGC GGCACGACTG GACGGGGCGG
CCCTGGGCCG AGACCGTGGT GATGGAGCTG CATATCGGCA CCTTCACCGA GGAAGGCACC
TTCCGCGCCG CCATCGAGCA TCTGCCGCAT CTGGCCGAAA TCGGGATCAC GATGATCGAG
CTGATGCCCG TGGCGCAGTT CGGCGGCAAC CGCGGCTGGG GCTACGACGG GGTGCTGCTC
TACGCGCCCC ATCCCGCCTA TGGCACGCCC GAGGATCTGA AGGCTCTGGT CGACGCGGCC
CACGGCCTCG GCATGAGCGT GGTGCTCGAT GTGGTCTACA ACCATTTCGG ACCGGACGGG
AATTACCTTG GGGCCTATGC CGCGGACTTC TTCGACCCGG AACGGCACAC GCCCTGGGGC
AGCGCCATCG CCTATCACCT GCCGCCCGTG CGCCGCTTCT TCCTCGACAA TGCGCTCTAC
TGGCTGACCG AGTTCCGCTT CGACGGGCTG CGCATCGACG CGGCCGACCA TATCCGCGAT
CCGGACTCGG ACCCCGAGGT GCTGGTGGAT CTGGCCCGCG CCATAAGGCA GCGCATCCCC
GACCGGCCGA TCCATCTGAC CACCGAGGAC AACCGCAACA TCACGCGGTT GCACGAGCGG
GGACCCGAGG GACAGGTGGT GCTGCACACG GCCGAATGGA ACGACGACCT GCACAATGTG
GCCCATGTTC TGCTGACGGG CGAGACCGAG GGCTATTACT GCGACTTCGT GAAGGACCAC
TGGCGCAAAT ATGCCCGCGC GCTGGCAGAA GGCTTCGTCT ATCAGGGCGA GCATTCCGAG
CATGAGGGCG AGCCCCGCGG AAAGCCGTCG GGCCATCTGC CGCCGCTCGC CTTCGTCGAT
TTCCTCCAGA ACCACGACCA GATCGGCAAC CGCGCCTTCG GCGAGCGGCT GACGACGCTC
GCGCCCGAGG CGCGGCTGCG CGCGATGATG GCGGTCCTGC TGCTGTCGCC GCATGTGCCG
CTGCTCTTCA TGGGCGAGGA ATGGGGCGAG TCGCGGCCCT TCACCTTCTT CACCGACTTT
CACGGAGAGC TCGCCGATGC CGTCCGCAAC GGGCGGCGCA AGGAGTTCGC GCATTTCTCC
GCCTTCCAGG GGCTGGATCT CGACCGGACG GTGCCCGATC CGAATGCCGA GGGCACGTTC
CTCTCCTCGA AGCTCGACTG GCTGCACCGC GAGACCGAGC GCGGCCGGTC CTGGATGGCC
TTCGTCAAGG ATCTGCTCGC CACCCGTGCC CGCGAGATCG CGCCACGGCT CGAGCGCGCA
CCCGGAAACG GCGGGCGCAT CGTGGCGGTC TCGGACGATC TGGTGGCGGT CGACTGGCAG
CTCGACGGGG CGGTCCTGCG CCTCCGCGCC AATTTCGACG ACCGGCCGCA GGACTTGCCG
GAGGCCGGGG GCCGTGTGAT CCATGCCTCT CCCGGCACCG AGGCTGGCGG TCCGCTGCCG
CCCTGTTCGG TGCTGGTCAC GCTCGAGGAG ACCGCATGA
 
Protein sequence
MNDWAWGPRI EDGLGRFRLW APSQERLVLR LDGTDHPMTR SDDGWFELQV PAEAGMDYGF 
VLESGQVVPD PAARAQAGDV HGLSRLVAPS FDWRHDWTGR PWAETVVMEL HIGTFTEEGT
FRAAIEHLPH LAEIGITMIE LMPVAQFGGN RGWGYDGVLL YAPHPAYGTP EDLKALVDAA
HGLGMSVVLD VVYNHFGPDG NYLGAYAADF FDPERHTPWG SAIAYHLPPV RRFFLDNALY
WLTEFRFDGL RIDAADHIRD PDSDPEVLVD LARAIRQRIP DRPIHLTTED NRNITRLHER
GPEGQVVLHT AEWNDDLHNV AHVLLTGETE GYYCDFVKDH WRKYARALAE GFVYQGEHSE
HEGEPRGKPS GHLPPLAFVD FLQNHDQIGN RAFGERLTTL APEARLRAMM AVLLLSPHVP
LLFMGEEWGE SRPFTFFTDF HGELADAVRN GRRKEFAHFS AFQGLDLDRT VPDPNAEGTF
LSSKLDWLHR ETERGRSWMA FVKDLLATRA REIAPRLERA PGNGGRIVAV SDDLVAVDWQ
LDGAVLRLRA NFDDRPQDLP EAGGRVIHAS PGTEAGGPLP PCSVLVTLEE TA