Gene Rsph17029_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1360 
Symbol 
ID4896840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1414095 
End bp1415726 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content67% 
IMG OID640111947 
Productextracellular solute-binding protein 
Protein accessionYP_001043242 
Protein GI126462128 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0151381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG ACCGACTTCA TCCCGCGGCG CTCATGCACA GGACCGAAGT GGCGAGGGGC 
CGCATGAGCC GACGCGAGTT CCTGACGCGG ACCACGGCGC TCGGTGTCTC GGCGGGCGCG
GCCTATGCGC TTCTGGGCCT CGCGCAGCCT GTCCGCGCGC AGGAGACACC CCGCAAGGGC
GGCACGCTCC GGATGGAGAT GGAGACGCGC GCGCTGAAGG ATCCGCGCAC CGCCGACTGG
TCGCAGATCT CGAACGTCAC GCGCGGCTGG CTCGAATATC TCGTGGAGTA TGAGGCCGAC
GGCACCTTCC GGCCGATGCT TCTCGAAAGC TGGGAGGCCA ACGACGATGC CACCGAATAT
CTGCTGAAGG TCCGCCCCGG CGTCACCTGG TCGAACGGCG ATCCCTTCAC CGCCGAGGAT
GTGCGGCACA ATTTCGAGCG CTGGTGCGAT GCCTCGGTCG AGGGCAATGC CATGGCGGCG
CAGATGACCG CCCTTCAGGC CGAGGGCAAG CTGCGCACCG ATGCCATCGA GCTGGTGGAC
GACACGACCC TGCGGCTGAA ACTCTCGCAG CCCGACATCG CGCTGATTGC GAACCTCGCC
GACTATCCGG CCGCCGTGGT TCACAAAAGC TACGAAGGCG GCGACCCCGC GGCCAATCCG
GTGGGGACCG GCCCCTACCT TCCCGAAACG ATCGAGGTCG GCATCCGCAT GGTGCTGGTG
CGCAATGAGG CCCATCCCTG GTGGGGCACG GAGGTCTATG GCGGCCCTTG GCTCGACCGG
ATCGAATATC TCGATTTCGG CTCCGATCCT TCCGCCGCGG TAGCCGCCGC GGGGTCGGGC
GAGATCGACG CGACCTATCA GAGCGTGGGT GAATTCATCG ACGTGCTCGA CTCGCTCGGC
TGGGACAAGT CCGAGGCCCG CACCGCCACG ACGCTGGCCA TCCGTTTCAA CCAGCAGGCC
GAGGAGTACA AGGACGTCCG CGTTCGCCGC GCCCTGCAGA TGGCGGTCGA CAATGAGGTG
GTTCTGGAGC TCGGCTATTC CGGGCACGGG CGGGTGGCCG AGAACCACCA TGTCTGTCCG
ATCCATCCCG AATATGCCGA GCTGCCGCCC CTGACCGTGG ACCGCGCCGC GGCGCTGGCG
CAACTGAAGG AGGCCGGCAT GGCCGAGCAC GAGTTCGAGC TCGTCTCGCT CGACGATGCC
TGGCAGGCGG CCTCCTGCGA TGCGGTGGCG GCACAGCTGC GCGATGCCGG CATCGCCATC
CGCCGCACGG TGCTTCCCGG CGCAACCTAC TGGAACGACT GGCTGAAGTT TCCCTTCTCG
GCCACGGAAT GGAACATGCG CCCGCTGGGC GTGCAGGTGC TGGCGCTCGC CTATCGCTCG
GGTGTGCCTT GGAACGAATC CGCCTTCTCG AACAAGGCGT TCGACGCGAA GCTCGATGAG
GCCATGTCGC TCGTCGACCC GGACCGGCGC CGGGTGCTGA TGGCCGATCT CGAGCGGATC
CTGCAGGAGG AGGGCGTGCT GATCCAGCCC TACTGGCGCT CGATCTTCCG CCATGTCGAT
CCCAAGGTGA AGGGAGCCGA GGCGCATCCG ACCTTCGAGC ATCACCATTA CAAATGGTGG
ATCGACGCCT GA
 
Protein sequence
MTTDRLHPAA LMHRTEVARG RMSRREFLTR TTALGVSAGA AYALLGLAQP VRAQETPRKG 
GTLRMEMETR ALKDPRTADW SQISNVTRGW LEYLVEYEAD GTFRPMLLES WEANDDATEY
LLKVRPGVTW SNGDPFTAED VRHNFERWCD ASVEGNAMAA QMTALQAEGK LRTDAIELVD
DTTLRLKLSQ PDIALIANLA DYPAAVVHKS YEGGDPAANP VGTGPYLPET IEVGIRMVLV
RNEAHPWWGT EVYGGPWLDR IEYLDFGSDP SAAVAAAGSG EIDATYQSVG EFIDVLDSLG
WDKSEARTAT TLAIRFNQQA EEYKDVRVRR ALQMAVDNEV VLELGYSGHG RVAENHHVCP
IHPEYAELPP LTVDRAAALA QLKEAGMAEH EFELVSLDDA WQAASCDAVA AQLRDAGIAI
RRTVLPGATY WNDWLKFPFS ATEWNMRPLG VQVLALAYRS GVPWNESAFS NKAFDAKLDE
AMSLVDPDRR RVLMADLERI LQEEGVLIQP YWRSIFRHVD PKVKGAEAHP TFEHHHYKWW
IDA