Gene Rsph17029_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1437 
Symbol 
ID4896444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1490987 
End bp1492798 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content69% 
IMG OID640112025 
Productextracellular solute-binding protein 
Protein accessionYP_001043319 
Protein GI126462205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0112279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAGG AATTCCGGTC CCGCCTGAGG GGCGCCGCGA TGGCTCTCGG GCTTGCCGTT 
CTGGGCACGA CGCTGGCCGC CGAACCCCGG CACGGCATAG CTATGTATGG CGAACCGGCG
CTTCCACCGG ATTTTGTGTC TCTGCCATAT GCAAATCCCG ACGCGCCGAA GGGCGGCCGG
ATCGTGCTGG GCGAGACGGG CGGATTCGAT TCCCTCAACC CCTACATCGT GAAGGGTCGC
GCCCCCTATT CGCTCGCGCC GCTGACGGTC GAGACGCTGC TCGGCCGGTC GCTCGACGAG
CCCTTCACGC TCTACGGGCT TCTGGCCGAA TCGGTCGAGA CGGACCCCGC CCGGACCTGG
GTCGAATTCA CGCTGCGCGA GGGCGCGCGG TTCTCGGACG GCACGCCCGT GACGGTCGGG
GATGTGCTCT GGTCGTTCGA GATTCTCGGC ACGAAAGGGA TGCCGCGCTA CTGGGGCGCG
TGGCAGAAGA TCGCGAGCGC CGAGCAGACG GGCCCGCGCT CGGTCCGCTT CACCTTCTCC
GAGCAGGACC GCGAACTGCC GCTGGTGCTG GGTCTGCGCC CGATCCTGAA GAAGGCGCAG
TGGGAGGGGC GCGACTTCGG CTCCTCGGGC TTCGAGGCGC CGATCGGCTC GGGCCCTTAC
ATCCTCGAGA GCTTCGAGCC GGGCCGGGTG CTGCGCTACC GCCGGAACCC CGACTGGTGG
GGGCGCGACC TGCCCTTCAA CCGCGGCCTG CACAATCTCG ACGAGGTGGT GGTCGAGTAT
TTCGGCGATG CGAGCGTGGC CTTCGAGGCG TTCAAGGCCG GCGCCCTCTC GGTCTATCGC
GAGACGAGCG CCGCCCGCTG GGCCACTCAC TACGGCTTCC CGGCCGTGCA GAGCGGCGCG
ATGGTCAAGT CCGAGATCCC GCATGGCCGC CCGTCCGGAA TGGAGGGGCT GGTGATGAAC
ACGCGCCGCG CCCCTTTCGC GGACTGGCGC GTGCGCGAGG CGATGCTCTT GGCCTTCGAC
TTCGACCTCA TCAACCGCAC GCTGACCGGC GGCGCCGAGC CGCGGATCGC CTCCTACTTC
TCGAACTCCG CGCTCGGGAT GGAGGCGGGG GCGCCTGCCA CGGGGCGCGA GCGGGCGCTG
CTCGAGCCCT TTGCGGCAGA TCTGCTGCCC GGCACGCTCG ACGGCTATGC CCTGCCCGCC
ACGCACGGCG CCTCGAACCG CGGCAACCTC CGCAAGGCCG CCCGGCTTCT CTCGGAGGCC
GGCTGGCGGA TCGAGGACGG GATGCTGGAA GGCCCCGGCG GCGAGCCCTT CGCCTTCGAG
ATCCTGCTGC CGCAGGGCGC GGATGCGATG ATCGCGGCCG CGATCATCTA CCGGCAGGCC
CTGACGCGGC TCGGGATCTC GGCCCGCATC ACGACCGTCG ATCCGGCGCA GTTCAAGCAG
CGGGTCGACA ATCAGGATTT CGACATGACG AGCTTCCTGC GCTCGCTCAC CCTCTCGCCC
GGCAACGAGC AGCTCCTCTA CTGGTCGGCC GAGGACAAGG ATCTGCCCGG CTCGCGCAAC
CTGATGGGGA TGGAGAGCCC CGCCGCCGAG GCCGTGATCC GGCACATGCT GGCCACCGAC
GATGCCGAGG AGTTTCAAGC CTCCGTGAGG GCGCTCGACC GGGTCCTGAC CGCCGGTAGA
TATGTCATTC CGATGTGGTA TTCTCGGGTC TCCCGGCTGG CGCACGACAG GCACCTGCGC
TATCCGGCAA AAACACCCAT CTATGGCGAC TGGCCGGGCT TCCTGCCGGA CGTCTGGTGG
CAAGAAAAGT GA
 
Protein sequence
MIQEFRSRLR GAAMALGLAV LGTTLAAEPR HGIAMYGEPA LPPDFVSLPY ANPDAPKGGR 
IVLGETGGFD SLNPYIVKGR APYSLAPLTV ETLLGRSLDE PFTLYGLLAE SVETDPARTW
VEFTLREGAR FSDGTPVTVG DVLWSFEILG TKGMPRYWGA WQKIASAEQT GPRSVRFTFS
EQDRELPLVL GLRPILKKAQ WEGRDFGSSG FEAPIGSGPY ILESFEPGRV LRYRRNPDWW
GRDLPFNRGL HNLDEVVVEY FGDASVAFEA FKAGALSVYR ETSAARWATH YGFPAVQSGA
MVKSEIPHGR PSGMEGLVMN TRRAPFADWR VREAMLLAFD FDLINRTLTG GAEPRIASYF
SNSALGMEAG APATGRERAL LEPFAADLLP GTLDGYALPA THGASNRGNL RKAARLLSEA
GWRIEDGMLE GPGGEPFAFE ILLPQGADAM IAAAIIYRQA LTRLGISARI TTVDPAQFKQ
RVDNQDFDMT SFLRSLTLSP GNEQLLYWSA EDKDLPGSRN LMGMESPAAE AVIRHMLATD
DAEEFQASVR ALDRVLTAGR YVIPMWYSRV SRLAHDRHLR YPAKTPIYGD WPGFLPDVWW
QEK