Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1321 |
Symbol | |
ID | 4896664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1368489 |
End bp | 1370213 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640111908 |
Product | extracellular solute-binding protein |
Protein accession | YP_001043203 |
Protein GI | 126462089 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00160153 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATACC GTACGACAGC CACCGCGATG GCCCTGTGCC TGATGGCCGT GGCCGCGCCC GCCTGGGCCG ACATGGAGGC CGCGAAGAAG TTTCTCGATG CCGAGATCGG GGATCTCTCC TCGCTCTCGC GCGCCGAGCA GGAAGCCGAG ATGCAATGGT TCATCGACGC GGCCCAGCCC TTCGCCGGGA TGGAGATCAA GGTCGTCTCC GAGACCATCA CCACCCACGA ATATGAGAGC AAGGTCCTCG CCCCCGCCTT CTCCGCCATC ACCGGGATCA AGATCAGCCA CGACCTGATC GGCGAGGGCG ACGTGGTCGA GAAGCTGCAG ACCCAGATGC AGTCGGGCGA GAACATCTAC GACGCCTATA TCAACGACAG CGACCTGATC GGCACCCACT GGCGCTACAA GCAGGCGCGC AGCCTCACCG ACTGGATGGC CAATGCGGGC AAGGACGTCA CCAACCCCGG CCTCGACCTG CAGGATTACA TCGGGCTGAA ATTCACCACC GCCCCCGACG GCGAGCTCTA CCAGCTGCCC GACCAGCAGT TCGCCAACCT CTACTGGTTC CGCGCCGACT GGTTCGACGA TGCCGACACC AAGGCCGCGT TCAAGGAGAA GTACGGCTAC GAGCTGGGCG TGCCGCTGAA CTGGTCGGCC TATGAGGACA TCGCCGAATT CTTCACCGGC CGCGACATGA GCGCGCTCGG CGGGCCGAAG AGCGCCTTCG GCAGCATGGA TTACGGCAAG AAGGACCCGA GCCTCGGCTG GCGCTACACC GACGCCTGGA TGTCGATGGC GGGCATGGGC GACAAGGGCG ATCCGAACGG GCTGCCCGTG GACGAATGGG GCATCCGGGT GGACGAGAAC TCGCGTCCCG TGGGCTCCTG CGTGGCGCGC GGCGGGGCCA CCAACGATGC GGCCGCGGTC TATGCAATCA CCAAATCGAT CGAATGGCTG CAGAAATACG CCCCGCCGCA GGCGGCCGGC ATGACCTTCT CGGAATCGGG CCCGGTGCCG GCGCAGGGCG AGGTGGCCCA GCAGATATTC TGGTACACCG CTTTCACCGC CGACATGGTC AAGGAGGGCC TGCCGGTGAT GAACGAGGAC GGCACGCCCA AGTGGCGCAT GGCCCCCTCG CCGCACGGCG CCTACTGGTC GGAGGGCACC AAGGTCGGCT ACCAGGACGT GGGTTCCTGG ACGCTGCTGA AATCCACCCC CGACGAGCGG GCCAAGGCCG CCTGGCTCTA TGCGCAGTTC GTCTCCTCGA AGACGGTCGA CGTGAAGAAG AGCCATGTGG GCCTGACCTT CGTGCGCGAA TCTACCATCC AGCACCAGAG CTTCACCGAC CGTGCGCCGA AGCTCGGCGG ACTCGTGGAA TTCTACCGCT CGCCCGCCCG GGTGCAGTGG TCGCCCACGG GCACGAACGT GCCGGACTAT CCCAAGCTCG CCCAGCTCTG GTGGCAGAAC ATCGGCGACG CCATGTCGGG CGCCAAGTCG CCGCAGGAGG CGCTCGACGC GCTCTGCGCC GAGCAGGAGA AGGTGATGGC CCGGCTCGAG CGCGCGGGCG TGCAGGGCGA TCTCGGCCCG AAGCTGAACG AGGAGAAGGA CCCGCAGGAA TGGCTCGACG CCCCCGGAGC CCCGGTGGCC AAGCTCGAGA ACGAGAAGCC ACAGGGCGAG ACCATCTCCT ATGACGAGCT CATCAAGTCC TGGCAGAAGG GCTGA
|
Protein sequence | MRYRTTATAM ALCLMAVAAP AWADMEAAKK FLDAEIGDLS SLSRAEQEAE MQWFIDAAQP FAGMEIKVVS ETITTHEYES KVLAPAFSAI TGIKISHDLI GEGDVVEKLQ TQMQSGENIY DAYINDSDLI GTHWRYKQAR SLTDWMANAG KDVTNPGLDL QDYIGLKFTT APDGELYQLP DQQFANLYWF RADWFDDADT KAAFKEKYGY ELGVPLNWSA YEDIAEFFTG RDMSALGGPK SAFGSMDYGK KDPSLGWRYT DAWMSMAGMG DKGDPNGLPV DEWGIRVDEN SRPVGSCVAR GGATNDAAAV YAITKSIEWL QKYAPPQAAG MTFSESGPVP AQGEVAQQIF WYTAFTADMV KEGLPVMNED GTPKWRMAPS PHGAYWSEGT KVGYQDVGSW TLLKSTPDER AKAAWLYAQF VSSKTVDVKK SHVGLTFVRE STIQHQSFTD RAPKLGGLVE FYRSPARVQW SPTGTNVPDY PKLAQLWWQN IGDAMSGAKS PQEALDALCA EQEKVMARLE RAGVQGDLGP KLNEEKDPQE WLDAPGAPVA KLENEKPQGE TISYDELIKS WQKG
|
| |