Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3169 |
Symbol | |
ID | 4898881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 194533 |
End bp | 196140 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640113771 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045041 |
Protein GI | 126463928 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.116403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAT CGCACCACCT GCTGATGGAC GATCTGGTCA CGCGGCTGCG GCGCGGACAG CTGTCGCGTC GCGAGTTTCT GGCCCGCAGT TCGGCGCTGC TGGCGGCCGG GGCCATGAGC GGGCTGCCCG GTGCGGCGCT TGCCCAGCAG GCGGCGCCGA AGGCCGGCGG CTTCATGCGG CTCGGCCTGC ACAATGCCTC GCAGAACGAC AACCTCGATC CCGGCAGCTG GTCGACGAGC TGGACCGGCG CCTCGTTCAA CGGCGGCGTC TACAACAACC TGGTCGAGAT CATGCCCGAC GGCTCCGTCG CGGGCGATCT GGCCGAGAGC TGGGAGGCGG AGCCCGGCGC GAAGGTCTGG CGCTTCAAGC TGCGCTCGGG CGTGACCTTC CACAACGGCA AGAGCCTCGA GGCGGAAGAC GTGCGCCAGT CGCTCGAGCA TCACATGAAG CCGGACTCGA CCTCGGGCGC GCGCGCCATC GTCGAGCAGA TCGAGACCAT CGACATCGAA GGGTCCGACA CCGTCCGCAT CACCCTGTCC GAGGGCAATG CCGACCTGCC CTACCTCTTG TCGGATTATC ACCTCTCGAT CTATCCGGCG CTGGAGGGCG GCGGGATCGA CATGGAGAGC GCCAACGGCA CCGGCGCCTT CCTCCTCGAG AGCTTCGAGC CGGGCATCGC CACCCGCCTC AAGCGGAACC CGAACTATCA CAAGAACAAC AAGCCCTATC TCGACGAGGT CGAGTTCATC AACATCACCG ACGCCACGGC GCGGCTGAAC GCGCTGCTGA CCGGCGAGGT CGATTTCATC CAGGATCTCG ACATCCGCAA CGTGGCGATG GTCGAGCGCA GCGGCGATTT CTCGGTTCAG CGTGTGCCGA GCCTGCGCCA CTTCACCTTC GACATGGACA CCCGCGTGGC GCCCTTCGAC AATCCCGACG TGCGGCTGGC GCTGAAATAT GCGCTCGACC GGGACGACGT GATCGAGAAG GTGTTCCTCG GCGAGGCCAC GAAGGGGAAC GACAACCCGG TCGCCTCGAT CCAGAAGTTC TACCACGACA TGCCCGCGCG CGAATACAGC ATCGCGAAGG CCAAGGAGCA TCTGGCCAAG GCCGGGCTCG ATCAGGTGAC TGTCGATCTC TCGGTGGCCG AGAATGCCTT CGCGGGCGCC ATCGAGGCGG CGACCCTCTA CCAGCGCCAT GCGGCCGAGG CCGGCATCAA CATCAACATC GTGCAGGAGG CGGCCGACGG CTACTGGGAG AACGTCTGGC GCAAGAAGCC CTTCTGCGCG GTGGACTATT TCGGCCGCGC CACCGTCGAC TGGCTGTTCT CGACGAGCTA TGTCACCGGC GCGCCGTGGA ATTCGGGCTG GTCGAACGCG CGGTTCGACG AGCTGCACCA GACGGCACGG GCCGAGACCG ACGAGGCCAA GCGCGCCGCC TGCTACGCCG AGATGCAGGA GATCCTGCGC GACGACGGCA ACGTCATCAC CGTGGCCTTC GTGAGCTGGC GCAACGCCGT CTCGAACCGC ATCGGCTTCG GCGAGGTCGG CGGGCTGATG CCGCTCGACA ACATGCGGAT GTGCGAGCGC TGGTGGGTCA AGGACTGA
|
Protein sequence | MNKSHHLLMD DLVTRLRRGQ LSRREFLARS SALLAAGAMS GLPGAALAQQ AAPKAGGFMR LGLHNASQND NLDPGSWSTS WTGASFNGGV YNNLVEIMPD GSVAGDLAES WEAEPGAKVW RFKLRSGVTF HNGKSLEAED VRQSLEHHMK PDSTSGARAI VEQIETIDIE GSDTVRITLS EGNADLPYLL SDYHLSIYPA LEGGGIDMES ANGTGAFLLE SFEPGIATRL KRNPNYHKNN KPYLDEVEFI NITDATARLN ALLTGEVDFI QDLDIRNVAM VERSGDFSVQ RVPSLRHFTF DMDTRVAPFD NPDVRLALKY ALDRDDVIEK VFLGEATKGN DNPVASIQKF YHDMPAREYS IAKAKEHLAK AGLDQVTVDL SVAENAFAGA IEAATLYQRH AAEAGININI VQEAADGYWE NVWRKKPFCA VDYFGRATVD WLFSTSYVTG APWNSGWSNA RFDELHQTAR AETDEAKRAA CYAEMQEILR DDGNVITVAF VSWRNAVSNR IGFGEVGGLM PLDNMRMCER WWVKD
|
| |