Gene Rsph17029_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3767 
Symbol 
ID4898861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp888799 
End bp889956 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID640114372 
Productextracellular solute-binding protein 
Protein accessionYP_001045620 
Protein GI126464507 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGC ACCGTGACCA CATCCCGGCG AAAGAGTTTC TTCAGCGGCT GGAAAGCTAC 
CGCAAGGGCT CGATCTCGCG CCGGCACTTC CTGAACGTGA CCGGCCTCGG CGCCGCGACC
ATGGCAATGG CGGGTGCGAT GCCGGGCTTC GCCCGCCGCG CGCAGGCCCA GGGCGCGATC
GGCGACCGCG TCGTCATCGC CACCTGGCCG AACTACCACG ACCCGGCCGA CCTCGACGCC
TTCCGGGCCG CCACCGGCGC CGCGGTCGAC GTCAACGTCT TCGGCTCCAA CGAGGAGATG
CTCGCGAAGC TGCAGGCGGG CGGCACGGGT TGGGACGTGG TGGTGGCCAC GAACTACACG
ATCTCGACCT ATGTCGAGGC GGGAATCATC GAGGAGCTCG ACCTGTCGCG CATCCCGAAT
TTCGACAGGG CTTCGACCGA CGCGCGCTTC GCCGATCCGG GCGTCATCGA CGGCAAGACC
TACGCCATCC CGCGCAACAT CGGCACGACC GGCTATTGCA TCAATACCGC CGAGATCGAC
GGCGAGACGC CGACCACCTG GAAGGAATTC TGGGATCTGG CGCGCGACCG GCTGTCGGGA
CGCGGCATGG TGCATGACTA TCAGCTGACC GCCATCGGCA ACGCACTGAA ATACTACGGC
TATTCCTTCA ACTCGGTCGA TCCGGCGGAA CTCGCGAAGG CGGAGGAGCT GCTGATCGAC
GCCAAGCCGC ATCTCTTCGC GATTACCTCG GACTACCAGC CCTCGATGCG TTCCGGCGAT
GCGGCGCTGT CGATGTGCTG GACCGGCGAC GCGGTGCAGC TGCAGCGCGA CATCCCCGAG
ATTGCCTACG TGCTCGGCCG CGAGGGCGGC GAGCTCTGGT CGGACTTCTT CACCATCCCC
GCCTCGGCCC CGCACAAGGA TGCGGCCTAT GCGCTGATCG ACTTCCTGCT CGAGCCGAAG
ATGGCCGCGC AGGAGGCCAT GTTCCACGGC TATCCGACCG GAGACGCCCG GGTCGACGCG
ATGCTGCCCG CCGAGATGCG CGACAGCCCG ATCCTGTTCC CGGCTGCGGA TCTCCTGAAT
GCGCTCGAGT TCGGCGCCGC CGTCACCCTG ACCAACCCGG ACCGCGCCGA GGTCATGGCG
CGCTTCAAAT CGGCATAA
 
Protein sequence
MTQHRDHIPA KEFLQRLESY RKGSISRRHF LNVTGLGAAT MAMAGAMPGF ARRAQAQGAI 
GDRVVIATWP NYHDPADLDA FRAATGAAVD VNVFGSNEEM LAKLQAGGTG WDVVVATNYT
ISTYVEAGII EELDLSRIPN FDRASTDARF ADPGVIDGKT YAIPRNIGTT GYCINTAEID
GETPTTWKEF WDLARDRLSG RGMVHDYQLT AIGNALKYYG YSFNSVDPAE LAKAEELLID
AKPHLFAITS DYQPSMRSGD AALSMCWTGD AVQLQRDIPE IAYVLGREGG ELWSDFFTIP
ASAPHKDAAY ALIDFLLEPK MAAQEAMFHG YPTGDARVDA MLPAEMRDSP ILFPAADLLN
ALEFGAAVTL TNPDRAEVMA RFKSA