Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2902 |
Symbol | |
ID | 6410571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3167488 |
End bp | 3168516 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642712782 |
Product | ABC nitrate/sulfonate/bicarbonate family transporter, periplasmic ligand binding protein |
Protein accession | YP_001991885 |
Protein GI | 192291280 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAAT CCAAAAAGGT TGGCGGTCTT TCCCGGCGTA ATTTGCTGAA GACGACCGGC GCGGCGGGGT TGGTCGTCTC GGCCGGTGCG TTGGCTGGAA AGATCTATTC GCCTGCGGTC GCCGCACCCG CGCCCAAGAT CCGCCTGGCC TGGACGGAAG TGGCAGCCTG CCATTCGCCA CTCGGTTTTG GTGTGGCCAA GGGGCTGTAT GCGAAGCACA ATGTCGACGT CGAGCTGTTC TATCAGGGGG CCAGCGGCCA GACCCTGATC CAGGCTCTCG CGACCGGCAA GGCCGATGTC GGAGCGGGAC TGATCGGCGA TTGGCTCAAG CCTCTGGAGC AGGGCTTCGA CGTCAAGCTG TTCGTCGGCT CGCATGGCGG CTGCCAGCGT CTGCTGGCCT CGCCGGCGTC GGGCATCAAG GATATTGCCG GCGTCAAGGG CAAGACGATT GCCAGCTACG ACGTCGTGTC GCCGCCGAAG GTCGCGTTCC AGGTCACGCT CGCGAAGGCT GGCATCGACC CGGAGAACGA CGTCACCTGG AAGGTCGTTC CATTCGATTT GGTCGGTGAG GCCGTCAACC GTGGTGACGC CGACATTGCG GCTCATCTCG ACCCGTGGGC GTTCTCGATC GAGAAAAAGT TCGGGCTCAC CAAGATCGCA GACACCCAGA CAGGCGTGTA CGAAGGGCAT ACCTGTTGCG TGCTGGGCGC CAATGGCGCC TTCCTCAAGG CTAACAAAGA CGCACTCCGC CGGTTGGCGA TTGCGAACAT CGAAGTGCAC GACTACGTCG CGGATCATGC CGACGAAGCC GCGCAATGGT ATTTGGACGC ATTGAAGCCT GCGGGGCTGA CGCATGCGGA ACTGACCGAG ATCCTCGGTT CGTTCGTCCT GCACAATCAC CCGATCGGAC AGCCGTTGGT CGATCAGATC CAGAAGAGTT CGGAGGATTT GAAGCTCGTC AAAGTCCTCG ATTCCAGCAC GGATCCGAAA GCGTTTGCCG AGCGTGTGAC CGTCAATCTG CTGGCTTGA
|
Protein sequence | MAQSKKVGGL SRRNLLKTTG AAGLVVSAGA LAGKIYSPAV AAPAPKIRLA WTEVAACHSP LGFGVAKGLY AKHNVDVELF YQGASGQTLI QALATGKADV GAGLIGDWLK PLEQGFDVKL FVGSHGGCQR LLASPASGIK DIAGVKGKTI ASYDVVSPPK VAFQVTLAKA GIDPENDVTW KVVPFDLVGE AVNRGDADIA AHLDPWAFSI EKKFGLTKIA DTQTGVYEGH TCCVLGANGA FLKANKDALR RLAIANIEVH DYVADHADEA AQWYLDALKP AGLTHAELTE ILGSFVLHNH PIGQPLVDQI QKSSEDLKLV KVLDSSTDPK AFAERVTVNL LA
|
| |