Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1941 |
Symbol | |
ID | 6409601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2093716 |
End bp | 2094966 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642711827 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_001990939 |
Protein GI | 192290334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCCAG CCCCCTCGAA TGTCCGTTCG TGGTCGATGC GCGCTGTGGT CGCCGCGACT GCGATCGGAT TCGGTGCTTC GTCGGCGCTC GCTGCCGACC CGATCAAGAT CGGCGTGATC GCTGAAGCGC AGGCTATCGC CGGCGCTTCC ATTCCGCAGG CTGCGCAGCT CGCCGCTGAA GAAATCAACG CGAAAGGCGG CATCGACGGC CGCAAGATCG AGATCGTCAG CTACGACAAC CACTCCTCGT CGGCCGATTC CGTGCGAGCG TTCCAGCGCG CCGTGAATGA AGACAAGGTC AACGCGGTGA TCGCCAGCTA CATCTCGGAA GTTGTGCTGG CGCTGATGCC GTGGGCCTCG CGGCTGAAAA CGCCGTTCGT CACGCCGGGC GCGGCGTCCA ACGAGATCAC CAAGGCGATC AACAAGGACT ACGAGAAGAA CAAATACACC TTCCACGGCT ATCTCACCTC CGGCGAGCTT GCCCAGTCGG TGTGCGATGC AGCGAAGGAC CTGTTGGTCG ACGCCCGCCA GATGAAGAGC GCGGTGATCA TGAGCGAGGA CGCGGCCTGG ACCAAGCCGC TCGACGTCGG CTACCAGGAG TGCCTGCCGA AGGTTGGGCT GAAGGTGCTC GACCACATCC GGTTCTCGCC CGACACCACT GATTTCACGC CGATCTTCAA TAAGATCGAA GGCTCAAAGC CGGACGTGAT CATCACCGGC ATCTCGCATG TCGGCGTCCA GCCGACGGTG CAGTGGAAGA ACCAGCAGGT GCCGATCCCG ATGTTCGGCA TCGCCTCCCA GGCGACCAAC GAGACCTTCG GCAAGGACAC CAACAACGCC TCCGACGGCG TGCTGTACCA GGGCGTGTCG GGCCCCGGCG TCGCAGTGAC CTCGAAGTCG GTGCCGTTCG CCGAGAATTT CAAGAAGAAG TACGGCAACT ATCCGTCTTA CGCCGGCTAC ACCGCCTATG ACGAGGTCTA TTACATCGCC GAAGCGGTGA AGCGCGCCGG CTCCACCGAC GGCGAGAAGC TGGTCGAAGC GCTGGAAAAG ACCGACTACG AAGGCACCAT CGGCCGCGTC CAGTTCTACG GCAAGGACGA GCCGTTCACC CACGGCCTGA AATACGGCAA GGGCCTGCTG ACCGGTCTGA TGCTGCAATG GCAGGACGGC AAGCAGGTCG CGGTGTGGCC GCCGGAAGTG GCCAAAGCCA AGATCAAGTT CCCGGCGTTC ATCAAGGCCG CCGCGAACTG A
|
Protein sequence | MSPAPSNVRS WSMRAVVAAT AIGFGASSAL AADPIKIGVI AEAQAIAGAS IPQAAQLAAE EINAKGGIDG RKIEIVSYDN HSSSADSVRA FQRAVNEDKV NAVIASYISE VVLALMPWAS RLKTPFVTPG AASNEITKAI NKDYEKNKYT FHGYLTSGEL AQSVCDAAKD LLVDARQMKS AVIMSEDAAW TKPLDVGYQE CLPKVGLKVL DHIRFSPDTT DFTPIFNKIE GSKPDVIITG ISHVGVQPTV QWKNQQVPIP MFGIASQATN ETFGKDTNNA SDGVLYQGVS GPGVAVTSKS VPFAENFKKK YGNYPSYAGY TAYDEVYYIA EAVKRAGSTD GEKLVEALEK TDYEGTIGRV QFYGKDEPFT HGLKYGKGLL TGLMLQWQDG KQVAVWPPEV AKAKIKFPAF IKAAAN
|
| |