Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5300 |
Symbol | |
ID | 6413001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5719391 |
End bp | 5720584 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642715189 |
Product | putative periplasmic substrate binding protein |
Protein accession | YP_001994261 |
Protein GI | 192293656 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.401613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA AACTTCTTTC CGCCGCGCTT GCGGCTACTG CCCTCGTTGC CATCACGCCG GCTTCGGCGC AGGGCGTCAA GATCGGCATC CTCAACGACC AGTCCGGCGT GTACGCGGAC TACGGCGGCA AGTGGTCGCT CGAAGCCGCC AAGATGGCGG TGGAGGATTT CGGCGGCGAA GTGCTGGGGC AGAAGATCGA ACTCATCTCG GCCGATCATC AGAACAAGCC CGACAACGCG GTCGCGATCG CGCGCAAGTG GTACGACGTC GACGGCGTCG ACATGATTAC CGAACTCACC ACCTCGTCGG TGGCGCTGGC GATCCAGGAT CTGTCGAAGG AAAAGAAGAA GATCGATATC GTGGTCGGCG CCGCGACCTC GCGGATCACC GGCGATGCCT GCCAGCCTTA CGGGTTCCAT TGGGCGTTCG ATACTCATGC CCTCGCGGTC GGAACCGGCG GCGCCCTGGT GAAGGCCGGC GGCGACACCT GGTTCTTCCT GACCGCGGAC TACGCGTTCG GCTACGCGCT GGAGAAGGAC ACCAGTGAGG TCGTGCTAGC CAGCGGCGGC AAGGTGCTCG GCTCGGTACG GCATCCGCTG AATTCGTCGG ACTTCTCCTC GTTCCTGCTG CAGGCGCAGG CCTCGAAGGC CAAGGTCGTC GGCCTTGCCA ATGCCGGCCT CGACACCGCC AACTCGATCA AGCAGGCCTC TGAATTCGGC ATCGTCGCCG GCGGCCAGAA GCTTGCGGGC CTCTTGATGA CGCTGGCCGA AGTGAACGGT CTCGGCCTCA AGGCGGCGCA GGGCTTGGTG CTGACCGAAG CCTATTATTG GGATCGCGAC GACAAATCGC GTGAGTTCGC CGAGCGTTTC TTCAAGCGCA CCAGCCGGAT GCCGAGCATG ATCCAGGCCG GGACCTATTC GGCGACGCTG TCCTATTTGA AGGCCGTCAA GGCCGCCGGA ACCAAGGACA CCGATGCGGT GGCCAAGAAG CTGAAGGAGC TGCCAGTCGA CGACGCGTTT GCATCCGGCA AGGTGCTGGC CAACGGCCGC TTCGTCCACG ACATGTATCT GTTCGAGGTG AAGAAGCCGG AAGAATCGAA GAAGCCGTGG GATTACTACA AGCTGCTCGC CACCGTGCCG GGCGACAAGG CGTTCCCGAC TGCGGCAGAG AGCGGCTGCC CGCTGACCAA GTAA
|
Protein sequence | MKLKLLSAAL AATALVAITP ASAQGVKIGI LNDQSGVYAD YGGKWSLEAA KMAVEDFGGE VLGQKIELIS ADHQNKPDNA VAIARKWYDV DGVDMITELT TSSVALAIQD LSKEKKKIDI VVGAATSRIT GDACQPYGFH WAFDTHALAV GTGGALVKAG GDTWFFLTAD YAFGYALEKD TSEVVLASGG KVLGSVRHPL NSSDFSSFLL QAQASKAKVV GLANAGLDTA NSIKQASEFG IVAGGQKLAG LLMTLAEVNG LGLKAAQGLV LTEAYYWDRD DKSREFAERF FKRTSRMPSM IQAGTYSATL SYLKAVKAAG TKDTDAVAKK LKELPVDDAF ASGKVLANGR FVHDMYLFEV KKPEESKKPW DYYKLLATVP GDKAFPTAAE SGCPLTK
|
| |