Gene RSP_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0704 
Symbol 
ID3718182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2448662 
End bp2450599 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content66% 
IMG OID640071920 
ProductABC peptide transporter, substrate binding protein 
Protein accessionYP_353781 
Protein GI77464277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.49782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAG TCACGGCGCG CACGGCGCAG GGCAGGGTCG CAGTCTCGAG ACTTCCCGAC 
GTGCGGTCAT GGCTTCTGGG GGGGCTCGGC CTGCTGGCCG CAGCCGCGGC GGCGCTGCCC
GCCCATGCGC AGGACGCGCC GAAGATCATC AAGGCGCACG GCATCTCGAC CTTCGGCGAT
CTGAAATATC CGGCCGATTT CACCCACCTC GAGTACGTCA ATCCCGACGC ACCCAAGGGC
GGCGAGATCT CGGAATGGAC CTTCGGCGGC TTCGATTCGA TGAACCCCTA TTCGGTGAAG
GGCCGGGCTG CGGCCCTCTC GTCGATCATG TATGAATCGA TCCTCGCGGG CACGGCCGAC
GAGATCGGCG CGGCCTACTG CCTGCTCTGC GAGACGCTCG AATATCCCGA GGACCGCAGC
TGGGTGATCT TCAACCTGCG CCCCGAGGCG AAATTCTCGG ACGGCACCCC CGTCACCGCA
GAGGATGTGG TCTTTTCCTA CGAGACCTTC GTGGCCAAGG GGCTCACCGA TTTCCGCACC
ATCTTCGCCC AGCAGGTCGA GGGGGCCGAG GCGCTCGACA CGCATCGGGT GAAGTTCACC
TTCAAGAAGG GCATCCCCAC CCGCGATCTG CCGCAGGACG TGGGCGGGCT GCCGGTCCTG
TCCAAGGCGC AGTATGAGCG CGAGGGGCTC GACCTCGAGG AGGGAAGCCT GAAGCCCTTC
CTCGGTTCGG GCGCCTATGT GCTCGACGAG AGCCGGATGA AGGTGGGCCA GACCGTCGTC
TACCGCCGCA ATCCCGACTA CTGGGGCAAG GACCTGCCGC TCATGCGCGG CACCGGAAAT
TTCGACGCGA TCCGCATCGA ATATTACGCC GACTACAATG CGGCCTTCGA GGGCTTCAAG
GGCGGCAGCT ACACCTTCCG CAACGAGGCC TCCTCGATCC TCTGGGCCAC GGGCTACGAC
TTCCCCGCCG TCCAGACCGG CCATGTGGTG AAGGTCGAGC TGCCCTCGGG CGCCAAGGCC
ACGGGGCAGG GCTGGATGCT GAACCTCCGG CGCGAGAAGT TCCAGGACCC GAAGGTGCGC
GAGGCGCTGA ACCTCATGTT CAACTTCGAA TGGTCGAACC AGACGCTGTT CTACGGCCTC
TATACCCGCG TCGATTCCTT CTGGGAAAAC AGCTACCTCG AGGCGGAGGG CGCGCCCTCC
GAGGCCGAGG CGGCGCTTCT GAAGCCGCTC GTCGACGAGG GCCTGCTGCC GGCCTCGATC
CTCACCGAGC CCCCGGTCAG CCCGCCCGTC TCTGGCGAAC GGCAGCTCGA CCGCAGGAAC
CTCCGGGCGG CGAGCAAGCT CTTGGACGAG GCGGGCTGGA CCGTGGGCTC GGACGGGATG
CGCCGCAACG CCAAGGGCGA GGTGCTGCGC GTCGAATTCC TCAACGACAG CCAGACCTTC
GACAGGGTCA TCAGCCCCTT CGTCGAGAAC CTGCGCGCGC TGGGCGTGGA TGCGCTGATG
ACGCGCGTGG ACAATGCCCA GATGGAAAGC CGCACCCGGC CGCCGAGCTA CGATTTCGAC
ATCACCACCG GCAATGCGCG CACGAACTAC ATCTCGGGCG CCGAGCTGAA GCAGTATTAC
GGGTCGGAGA CCGCCGACAT CTCGGCCTTC AACATCATGG GCCTGAAGGA CAAGGCTGTG
GACCGCATGA TCGAGGTGGT TCTGGCCGCC AAGACCTCCG AGGAGCTCGA AGTGGCGACC
AAGGCGCTCG ACCGGGTGCT GCGGCTGCAG CGGTTCTGGG TGCCGCAATG GTACAAGGCC
AGCAACACCG TCGCCTATTA CGACATGTTC GAGCATCCCG AGACCCTGCC GCCCTATGCG
CTGGGCGAGC TGGACTTCTG GTGGTTCAAC CCCGACAAGG CCCAGGCGCT GCGTGACGCG
GGCGCCTTGA GACAGTAA
 
Protein sequence
MGEVTARTAQ GRVAVSRLPD VRSWLLGGLG LLAAAAAALP AHAQDAPKII KAHGISTFGD 
LKYPADFTHL EYVNPDAPKG GEISEWTFGG FDSMNPYSVK GRAAALSSIM YESILAGTAD
EIGAAYCLLC ETLEYPEDRS WVIFNLRPEA KFSDGTPVTA EDVVFSYETF VAKGLTDFRT
IFAQQVEGAE ALDTHRVKFT FKKGIPTRDL PQDVGGLPVL SKAQYEREGL DLEEGSLKPF
LGSGAYVLDE SRMKVGQTVV YRRNPDYWGK DLPLMRGTGN FDAIRIEYYA DYNAAFEGFK
GGSYTFRNEA SSILWATGYD FPAVQTGHVV KVELPSGAKA TGQGWMLNLR REKFQDPKVR
EALNLMFNFE WSNQTLFYGL YTRVDSFWEN SYLEAEGAPS EAEAALLKPL VDEGLLPASI
LTEPPVSPPV SGERQLDRRN LRAASKLLDE AGWTVGSDGM RRNAKGEVLR VEFLNDSQTF
DRVISPFVEN LRALGVDALM TRVDNAQMES RTRPPSYDFD ITTGNARTNY ISGAELKQYY
GSETADISAF NIMGLKDKAV DRMIEVVLAA KTSEELEVAT KALDRVLRLQ RFWVPQWYKA
SNTVAYYDMF EHPETLPPYA LGELDFWWFN PDKAQALRDA GALRQ