Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2359 |
Symbol | |
ID | 4897098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2495499 |
End bp | 2497436 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640112955 |
Product | extracellular solute-binding protein |
Protein accession | YP_001044233 |
Protein GI | 126463119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0844875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.182874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGAAG TCACGGCGCG CACGGCGCAG GGCAGGGTCG CAGTCTCGAG ACTTCCCGAC GTGCGGTCAT GGCTTCTGGG GGGGCTCGGC CTGCTGGCCG CCGCCGCCGC GGTGCTGCCC GCCCACGCGC AGGACGCGCC GAAGATCATC AAGGCGCACG GCATCTCGAC CTTCGGCGAT CTGAAATATC CGGCCGACTT CACCCATCTC GATTACGTCA ATCCCGACGC ACCCAAGGGC GGCGAGATCT CGGAATGGAC CTTCGGCGGC TTCGATTCGA TGAACCCCTA TTCGGTGAAG GGCCGGGCCG CGGCCCTCTC GTCGATCATG TATGAATCGA TCCTCGCGGG CACGGCCGAC GAGATCGGCG CGGCCTACTG CCTGCTCTGC GAGACGCTCG AATATCCCGA GGACCGCAGC TGGGTGATCT TCAACCTGCG TCCCGAGGCG AAATTCTCGG ACGGCACCCC CGTCACCGCA GAGGACGTGG TCTTTTCCTA CGAGACCTTC GTGGCCAAGG GTCTCACCGA TTTCCGCACC ATCTTCGCCC AGCAGGTCGA GGGGGCCGAG GCGCTCGACA CGCATCGGGT GAAGTTCACC TTCAAGAAGG GCATCCCCAC CCGCGATCTG CCGCAGGACG TGGGCGGGCT GCCGGTCCTG TCCAAGGCGC AGTATGAGCG TGAGGGGCTC GACCTCGAGG AGGGAAGCCT GAAGCCCTTC CTCGGCTCGG GCGCCTATGT GCTCGACGAG AGCCGGATGA AGGTGGGCCA GACGGTCGTC TACCGCCGCA ATCCCGACTA CTGGGGCAAG GACCTGCCGC TCATGCGCGG CACCGGAAAT TTCGACGCGA TCCGCATCGA ATATTACGCC GACTACAATG CGGCCTTCGA GGGCTTCAAG GGCGGCAGCT ACACCTTCCG CAACGAGGCC TCCTCGATCC TCTGGGCCAC GGGCTACGAC TTCCCGGCCG TGCAGACCGG CCATGTGGTG AAGGTCGAGC TGCCCTCGGG CGCCAAGGCC ACGGGGCAGG GCTGGATGCT GAACCTCCGG CGCGAGAAGT TCCAGGACCC GAAGGTGCGC GAGGCGCTGA ACCTCATGTT CAACTTCGAA TGGTCGAACC AGACGCTGTT CTACGGCCTC TATACCCGCG TCGATTCCTT CTGGGAAAAC AGCTACCTCG AGGCGGAGGG CGCGCCCTCC GAGGCCGAGG CGGCGCTTCT GAAGCCGCTC GTCGATGAGG GCCTGCTTCC GGCCTCGATC CTCACCGAGC CCCCGGTCAG CCCGCCCGTC TCTGGCGAAC GGCAGCTCGA CCGCAGGAAC CTGCGGGCGG CCAGCAAGCT CTTGGACGAG GCGGGCTGGA CCGTGGGCTC GGACGGGATG CGCCGCAACG CCAAGGGCGA GGTGCTGCGC GTCGAATTCC TCAACGACAG CCAGACCTTC GACCGGGTTA TCAGCCCCTT CGTCGAGAAC CTGCGCGCGC TGGGCGTGGA TGCGCTGATG ACGCGCGTGG ACAATGCCCA GATGGAAAGC CGCACCCGGC CGCCGAGCTA CGATTTCGAC ATCACCACCG GCAATGCGCG CACCAACTAC ATCTCGGGCG CCGAGTTGAA GCAGTATTAC GGGTCGGAGA CCGCCGACAT CTCGGCCTTC AACATCATGG GCCTGAAGGA CAAGGCGGTG GACCGGATGA TCGAGGTGGT TCTGGCCGCC AAGACCTCCG AGGAGCTCGA GGTGGCGACG AAGGCGCTCG ACCGGGTGCT GCGGCTGCAG CGGTTCTGGG TGCCGCAATG GTACAAGGCC AGCAACACCG TCGCCTATTA CGACATGTTC GAGCATCCCG AGACCCTGCC GCCCTATGCG CTGGGCGAGC TGGACTTCTG GTGGTTCAAC CCCGACAAGG CCCAGGCGCT GCGTGACGCG GGCGCCTTGA GACAGTAA
|
Protein sequence | MGEVTARTAQ GRVAVSRLPD VRSWLLGGLG LLAAAAAVLP AHAQDAPKII KAHGISTFGD LKYPADFTHL DYVNPDAPKG GEISEWTFGG FDSMNPYSVK GRAAALSSIM YESILAGTAD EIGAAYCLLC ETLEYPEDRS WVIFNLRPEA KFSDGTPVTA EDVVFSYETF VAKGLTDFRT IFAQQVEGAE ALDTHRVKFT FKKGIPTRDL PQDVGGLPVL SKAQYEREGL DLEEGSLKPF LGSGAYVLDE SRMKVGQTVV YRRNPDYWGK DLPLMRGTGN FDAIRIEYYA DYNAAFEGFK GGSYTFRNEA SSILWATGYD FPAVQTGHVV KVELPSGAKA TGQGWMLNLR REKFQDPKVR EALNLMFNFE WSNQTLFYGL YTRVDSFWEN SYLEAEGAPS EAEAALLKPL VDEGLLPASI LTEPPVSPPV SGERQLDRRN LRAASKLLDE AGWTVGSDGM RRNAKGEVLR VEFLNDSQTF DRVISPFVEN LRALGVDALM TRVDNAQMES RTRPPSYDFD ITTGNARTNY ISGAELKQYY GSETADISAF NIMGLKDKAV DRMIEVVLAA KTSEELEVAT KALDRVLRLQ RFWVPQWYKA SNTVAYYDMF EHPETLPPYA LGELDFWWFN PDKAQALRDA GALRQ
|
| |