Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2403 |
Symbol | |
ID | 6410065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2592412 |
End bp | 2593800 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642712282 |
Product | ABC transporter nitrate-binding protein |
Protein accession | YP_001991392 |
Protein GI | 192290787 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0310148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACGT TCGACAATCC GTTCGATCCC AATCGCCGGC TGCACACCAC GGGTTGTAGT TGTGGTCGTC ACGCTACCGA GGCTGAGCAC GCTGCCGAGC AGGCCGCCGC GTTGCAGGGC ACCGTGATGC AGGGCGAAGA GAAGCGGTTC GAAGGCGTCG TCGCGTCCGC GGTGATGCGC GCAATGTTTC CGCAGGATGC CTCGCGGCGC GCCTTTCTGA AGTCGGTCGG CGCTGCCACG GCACTCGCCG CGGTGTCGCA GTTTTTCCCC CTGCAGACCG CAACCGATGT GTTCGCCTCG GGCGGTCCGC TGGAAAAGAC CGACCTCAAG GTCGGTTTCA TTCCGATCAC CTGCGCTACG CCAATCATCA TGGCGCATCC GATGGGCTTC TATGCGAAGT ACGGCCTCAA CGTCGAAGTG ATCAAGACCG CAGGCTGGGC GGTGATCCGC GACAAGACGC TGAACAAGGA ATACGACGCC GCGCATATGC TGTCGCCGAT GCCGCTGGCG ATCACGATGG GCGTCGGCTC CAATCCGATC CCGTACACCA TGCCGGCGGT CGAGAACATC AACGGCCAGG CGATCACCCT GGCGATGAAG CACAAGGATC GCCGCAATCC GAAGGACTGG AAGGGCTTCA AATTCGCCGT TCCATTCGAC TATTCGATGC ACAATTACCT GCTCCGCTAC TATTTGGCCG AGCATGGTCT CGACCCCGAC GTCGACGTCC AGATCCGCGC CGTGCCGCCG CCGGAAATGG TCGCCAACCT GCGTGCCGAC AATATCGACG GCTATCTCGC GCCCGATCCG ATGAACCAGC GTGCGGTCTA TGACGGAGTC GGCTTCATCC ACATCCTCAC CAAGGAAATC TGGGACGGCC ACCCGTGCTG CGCCTTCGCC GCGTCGAAGG AATTCGTCAC CTCGATGCCG AACACCTACG GCGCGCTCCT GAAGTCGATC ATTGAGGCGA CCGCCTACGC GCACAAGCCG GAGAACCGCA AGGAAATCGC CCAGGCGATC TCGCCGGCGA ACTACCTGAA CCAGCCGGCG ATCGTACTCG AACAGATACT CACCGGCACC TATGCGGACG GCCTCGGCAA CGTCGTCAAG CAGCCGAACC GGATCGATTT CGATCCGTTC CCGTGGCAGT CGTTCGCGAT CTGGATCATG ACCCAGATGA AGCGCTGGGG GCAGATCAAG GGCGACGTCG ACTACAAGAC GATCGCCGAG CAGGTCTATC TGGCGACCGA CACGGCGAAG CTGATGAAGG AAGCAGGCCT CGCGGCCCCC GACACCACGT CGCGATCGTT CTCGGTGATG GGCAAGCCGT TCGATGGCTC CAACCCGGAT CAGTATCTCG CCAGCTTCAA GATCAAGAAG GCCTCGTAA
|
Protein sequence | MSTFDNPFDP NRRLHTTGCS CGRHATEAEH AAEQAAALQG TVMQGEEKRF EGVVASAVMR AMFPQDASRR AFLKSVGAAT ALAAVSQFFP LQTATDVFAS GGPLEKTDLK VGFIPITCAT PIIMAHPMGF YAKYGLNVEV IKTAGWAVIR DKTLNKEYDA AHMLSPMPLA ITMGVGSNPI PYTMPAVENI NGQAITLAMK HKDRRNPKDW KGFKFAVPFD YSMHNYLLRY YLAEHGLDPD VDVQIRAVPP PEMVANLRAD NIDGYLAPDP MNQRAVYDGV GFIHILTKEI WDGHPCCAFA ASKEFVTSMP NTYGALLKSI IEATAYAHKP ENRKEIAQAI SPANYLNQPA IVLEQILTGT YADGLGNVVK QPNRIDFDPF PWQSFAIWIM TQMKRWGQIK GDVDYKTIAE QVYLATDTAK LMKEAGLAAP DTTSRSFSVM GKPFDGSNPD QYLASFKIKK AS
|
| |