Gene Rsph17025_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2978 
Symbol 
ID5085181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3042784 
End bp3044430 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content66% 
IMG OID640484549 
Productalpha amylase, catalytic region 
Protein accessionYP_001169169 
Protein GI146279010 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.533119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0820165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCA GCCGCACGAG TGATGTCTGG TGGAAGAACG CGGTCTTCTA CTGCCTTGAT 
GTCGAGACAT TCCAGGACAG CAACGGTGAC GGAGTCGGCG ACTTTGCCGG CTTGACCCGG
CGGCTCGACC ATCTGGACCG CCTTGGCGTG AGCTGTATCT GGCTGATGCC CTTCTACCCG
AGCCCCAACC ACGACGACGG CTACGATGTG ACCGACTATT ACGCCGTCGA TCCGCGGCTC
GGCACGATGG GCGATCTCGT GGAGTTTCTT CGGAGCGCAC GGGATCGCGG GATGCGAGTC
ATCGCCGACC TTGTCGTGAA CCACACCTCG CGCGAGCATC CCTGGTTTCA GGCGGCGCGC
TCGGATCCGG CCTCGCCCTA CCACGACTGG TATGTCTGGC GCGACGAGAA GCCCGAGGAG
GACGCCTCGT CTCTGATCTT CCCGGGCGAG GAAGACAGCA GCTGGAGCTG GGACCCGAAG
GCCAGGAAAT ACTACCTGCA CCGCTTCTAC AGCCACCAGC CCGACCTCAA TGTCGAGAAC
CCCGCGGTGC GGGACGAGAT CCGCCGCATC GTCGGCTTCT GGCTTCAGCT CGGCCTGTCG
GGCTTTCGGG TCGATGCCGT GCCCTTCCTG CTCGAGCAGA TGTCCACCGA GGACAAGGGC
TTTGCGCCCC ACCGCTGGCT GCGCGACCTG CGCGCCTTCA TCAACCGGCG CTCGGGCGAG
GCGGTGCTGC TGGGCGAGGT CAACCTCGAC TACCCCGACG TGCGGCGCTT CTTCGGCGAC
GAGGACGGCG ATGAACTGCA CATGTGCCTC GACTTCAACC TCAATCAGGC CATGGCGCTG
GCGCTGGCGC GCGAAGATGC GGGGCCGATC GTGCACGGGC TGCGCCACAT GCCCGAGCTG
GCGCCGGATG ACGGCTGGGC GCATTTCCTG CGCAACCATG ACGAATGGTC GCTCGACAAG
CTGACCGAGG CCGAGCGTCA GGAGGTCTTT GCCAGCTTCG GGCCGGATCC CGACATGCAG
CTCTTCGGGC GCGGGCTGCG CCGGCGGATG CCGACGATGC TCAAGGGTAA CGAGGCGCGG
ATGCGGATGG CCTATGCGCT GATGACCTCG ATGCCGGGGG CGCCGGTGCT CTTCTACGGG
GAAGAGATCG GCATGGCCGA GAACCTCGAC ATCCCCGGCC GTCTCGCGGT GCGGGCGCCG
ATGCAGTGGG AGAGCGGGCA CAACGGCGGC TTCTCGCCCG CGCCGGCCGA CAGGCTGTTG
CGCCCGGTGG TGTCCGAGGG CCGCTGGTCC CCTGCCCGCG TCAACGTCGC CGAGCAGCAG
GACCGGCCGG ATTCGTTCTT CAAGTTCATG GAGCAGCTCA TGCGGCGGCG GCGGGAATGT
CCCGAGATCG CCTTCGGCAC CCATGCCGTC CTGCCGATGG CCCAGGCCCC CGTCTTCGCC
ATCCGCCATG ACTGGCAGGA TCGGACACTG ATCGCGCTCG CGAACCTCGG CAGCCAGGAG
CAGACCGTGA CCTGTTCCCT CTCCGACATC GACTCGATCG GCACGCTCCG CCCGATCCTC
GGCAGCGGCA AGATCTCGGT GAAGAAGACC GAACTGACGG TCGAACTCGA GGGCTACGGC
CTCCGCTGGG TCCGGTTCGA GGTCTGA
 
Protein sequence
MDISRTSDVW WKNAVFYCLD VETFQDSNGD GVGDFAGLTR RLDHLDRLGV SCIWLMPFYP 
SPNHDDGYDV TDYYAVDPRL GTMGDLVEFL RSARDRGMRV IADLVVNHTS REHPWFQAAR
SDPASPYHDW YVWRDEKPEE DASSLIFPGE EDSSWSWDPK ARKYYLHRFY SHQPDLNVEN
PAVRDEIRRI VGFWLQLGLS GFRVDAVPFL LEQMSTEDKG FAPHRWLRDL RAFINRRSGE
AVLLGEVNLD YPDVRRFFGD EDGDELHMCL DFNLNQAMAL ALAREDAGPI VHGLRHMPEL
APDDGWAHFL RNHDEWSLDK LTEAERQEVF ASFGPDPDMQ LFGRGLRRRM PTMLKGNEAR
MRMAYALMTS MPGAPVLFYG EEIGMAENLD IPGRLAVRAP MQWESGHNGG FSPAPADRLL
RPVVSEGRWS PARVNVAEQQ DRPDSFFKFM EQLMRRRREC PEIAFGTHAV LPMAQAPVFA
IRHDWQDRTL IALANLGSQE QTVTCSLSDI DSIGTLRPIL GSGKISVKKT ELTVELEGYG
LRWVRFEV