Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4664 |
Symbol | |
ID | 3912482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5275273 |
End bp | 5276619 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637886569 |
Product | extracellular solute-binding protein |
Protein accession | YP_488258 |
Protein GI | 86751762 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.593429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCG GGAGTGATCG GCGATGCGGA CGAGCCAGGA TCCTACTGAT ATCCGCGGTT GTTCTGCTTG CAATCGCCGT GGGCAACGCC CCCGCCCAGG CGGCGACCGA GATCGCATGG TGGCACGCGA TGTCCGGCCA ACTCGGCCGG GAGCTCGAAA AGCTCGCCGC GGACTTCAAC ACGTCGCAAT CCGACTACCG TGTGGTGCCC ACCTACAAGG GCAACTACAC CGAGGCGGTG ACTGCCGCGA TTTTCGCCTT CCGCTCGTCG AGCCAGCCGG CGATCGTGCA GGTCAACGAG ATCGCGACCG CCACGATGAT GGCGGCGAAA GGCGCGGTCT ATCCGGTCTA CGAATTGATG CGCGACGAAA AGGAAGCGTT CTCTCCGTCG GACTACCTTC CCGCGGTCGC CGGCTATTAT ACCGATCTGG CCGGCAACAT GCTGTCGTTT CCGTTCAACG CCTCGACCCC GATGCTGTAC TACAACAAGT CGATGTTCAG AAAGGTCGGC CTCGACCCCG AGACGCCGCC GGCGACATGG CCTGACGTCG GCGCCGCGGC GAAGCGGCTG GTCGCCGCCG GGGTGCCGTG CGGACTCACC ACGTCGTGGC CGTCCTGGGT CAATGTCGAG AATTTCTCCG CCTATCACAA CCTCCCGCTC GCGACCCGGG CGAACGGCCT CGGCGGGATG GATGCAGTAC TGGTCTTCAA CAATCCCGTC CTGGTTCGGC ACATCGCCGA ACTGGCGGAA TGGCAGAAGA CCAGGGTATT CGACTATGGT GGCCGCGCCA CCGCCACGGA GCCGCGATTC CAGCGGGGCG ATTGCGGTAT CTTCGTCGGC TCCTCGGCGA CCCGCGCCGA TATCATCGCC AATTCCAAAT TCGAGGTCGG TTACGGCCGG CTGCCGTTCT GGCCGGACGT CGCCGGCGCG CCGCAAAACA CCATTATCGG CGGCGCGACG CTGTGGGTGC TGCGCGGCCG GCCGGCCGAC GAATACAAAG GCGTCGCCAA GTTCTTCGCC TATCTGTCGC GCGCCGACGT GCAGGCCGCC TGGCATCAAA ACACGGGCTA TCTGCCGGTG ACGCGCGCCG CCTACGAACT GACGCGCGCG CAGGGATTCT ACGAACGCAA TCCCGGCACG GCGATCTCGA TCGAGCAGAT GACCCTGAAG CCGCCGACCG ACAATTCGCG CGGATTGCGA CTGGGCTCCT TCGTCCTGAT CCGCGACGTC ATTGACGACG AGCTCGAACA GGCGTTCAGC GGCCGAAAGC CGGCGCAGGC GGCAATGGAT TCCGCGGTCG AGCGCGGCAA CAAGCTGCTG CGTCAGTTCG AACGGACCCA ACCATGA
|
Protein sequence | MAIGSDRRCG RARILLISAV VLLAIAVGNA PAQAATEIAW WHAMSGQLGR ELEKLAADFN TSQSDYRVVP TYKGNYTEAV TAAIFAFRSS SQPAIVQVNE IATATMMAAK GAVYPVYELM RDEKEAFSPS DYLPAVAGYY TDLAGNMLSF PFNASTPMLY YNKSMFRKVG LDPETPPATW PDVGAAAKRL VAAGVPCGLT TSWPSWVNVE NFSAYHNLPL ATRANGLGGM DAVLVFNNPV LVRHIAELAE WQKTRVFDYG GRATATEPRF QRGDCGIFVG SSATRADIIA NSKFEVGYGR LPFWPDVAGA PQNTIIGGAT LWVLRGRPAD EYKGVAKFFA YLSRADVQAA WHQNTGYLPV TRAAYELTRA QGFYERNPGT AISIEQMTLK PPTDNSRGLR LGSFVLIRDV IDDELEQAFS GRKPAQAAMD SAVERGNKLL RQFERTQP
|
| |