Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1991 |
Symbol | |
ID | 6409651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2152522 |
End bp | 2153679 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642711877 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_001990989 |
Protein GI | 192290384 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.179885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCAAGT TCAAGCTATC CGCCACGGCG ATCGCCGTGG CCCTCGCGTT GCCGGGGCTT TCCGGGGCCG CGCTTGCCGA AACTAACGAA ATCACCATCG GTATCACCGT CACCACCACC GGTCCGGCGG CGGCACTCGG CATTCCGGAG CGCAATGCTC TAGAATTCGT GGCTAAGGAA ATCGGCGGTC ATCCGCTCAA GTTGATCGTG CTCGACGACG GCGGCGATCC CACCGCGGCC ACCACCAATG CGCGGCGTTT CGTCACGGAG TCGAAGGCCG ACGTGATCAT GGGCTCGTCG GTGACGCCGC CAACCGTGGC GGTCTCCAAC GTCGCCAACG AAGCGCAGGT GCCGCACATC GCGTTGGCGC CACTGCCGAT CACGCCGGAG CGCGCCAAGT GGTCGGTGGC GATGCCGCAG CCGATCCCGA TCATGGGCAA GGTGCTCTAC GAGCACATGA AGAAAAACAA CATCAAGACC GTCGGCTACA TCGGCTATTC GGATTCCTAC GGCGATCTGT GGTTCAACGA CCTGAAGAAG CAGGGCGAGG CTATGGGTTT GAAGATCGTC GCCGAAGAGC GCTTCGCGCG GCCGGACACG TCGGTGGCAG GTCAGGTGCT GAAGCTGGTC GCCGCCAATC CGGATGCGAT CCTGGTCGGT GCGTCCGGCA CCGCGGCAGC GCTGCCGCAG ACCAGTCTGC GCGAGCGCGG TTACAAGGGC CTGATCTATC AGACCCATGG CGCCGCCTCG ATGGACTTCA TCCGTATCGC CGGCAAGTCG GCCGAGGGCG TGCTGATGGC GTCGGGCCCG GTGATGGATC CGGAAGGTCA GGACGACAGC GCGTTGACCA AGAAGCCTGG CCTCGAACTC AACACCGCCT ATGAAGCCAA GTACGGCCCG AACAGCCGCA GCCAGTTCGC CGCGCATTCG TTCGACGCCT TCAAGGTGCT GGAGCGGGTG GTGCCGGTGG CGCTGAAGAC CGCCAAGCCG GGCACGCAGG AATTCCGCGA GGCGATCCGC AAGGCGCTGG TCAGCGAAAA GGACATCGCG GCGAGCCAGG GCGTCTACAG CTTCACTGAA ACCGATCGCT ACGGCCTCGA CGACCGTTCG CGCATCCTGC TGACGGTGAA GGATGGCAAA TACGTGATGG TGAAGTAA
|
Protein sequence | MPKFKLSATA IAVALALPGL SGAALAETNE ITIGITVTTT GPAAALGIPE RNALEFVAKE IGGHPLKLIV LDDGGDPTAA TTNARRFVTE SKADVIMGSS VTPPTVAVSN VANEAQVPHI ALAPLPITPE RAKWSVAMPQ PIPIMGKVLY EHMKKNNIKT VGYIGYSDSY GDLWFNDLKK QGEAMGLKIV AEERFARPDT SVAGQVLKLV AANPDAILVG ASGTAAALPQ TSLRERGYKG LIYQTHGAAS MDFIRIAGKS AEGVLMASGP VMDPEGQDDS ALTKKPGLEL NTAYEAKYGP NSRSQFAAHS FDAFKVLERV VPVALKTAKP GTQEFREAIR KALVSEKDIA ASQGVYSFTE TDRYGLDDRS RILLTVKDGK YVMVK
|
| |