Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4213 |
Symbol | |
ID | 6411897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4517121 |
End bp | 4518986 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714095 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001993184 |
Protein GI | 192292579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.238618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAGC TTAACCGCCG CAACGTGCTC GCTCTCGGCG TTGGCGCGCT GGCTGCCACG CATCTCCGGG GCACCGCCGC CGCGGCCGAA GGCGAGACGA TCGCCCACGG CATGTCGGCT TTCGGTGACC TGAAGTACCC GGCCGACTTC GCGCATTTCG ACTATGTCGA TCCGCGAACT CCAAAAGGCG GACTGTTCTC CACGATCCCG TCGGTTCGGG CGTTCAACCA GTCGTTTCAG ACGTTCAACT CGCTCAACGC CTATATTCTC AAAGGGGACG GCGCCCAGGG CATGGGGCTG ACCTTCGCGA CGCTGATGGC GCGTGCCGGC GACGAGCCCG ACGCGATGTA CGGCTTCGCG GCCTCCAAAG TGGCGATCTC TGCCGACGGC CTGGCCTATC GCTTCACGAT GCGGCCGGAA GCACGTTTCC ACGACGGCAG CAAACTGACG GCGCGCGACG CCGCTTTCTC GCTGAACATC TTGAAGGCGA AGGGCCACCC GATCGTCACG CAACAAATGC GCGACTTCAT CGAGGCTGTG GCGACCGATG ACGCGACGCT GGTGGTGACC TTCAAGCCGA AGCGCGGCCG CGACGTGCCG CTGTTCGTCG CCGGCCTGCC GCTGTTCTCG GAAACTTACT ATTCGAAACA GCCGTTCGAT GAATCCACCA TGGATGTGCC GCTCGGGAGC GGGCCCTACA AGGTCGGACG GCTCGAATCC GGTCGCTACA TCGAGTTCGA TCGGGTCAAG GATTGGTGGG GCGCGAAGCT GCCGGTGAAT GTCGGGGCTT ACAATTTCGA CATCGTTCGG TTCGAGTTCT ATCGCGATCG CGACGTTGCG TTCGAAGGCT TCACCGGGCG CAGCTATCTG TTTCGCGAGG AGTTCACCTC GCGGATCTGG AACACCCGCT ACGATTTCCC CGCGATCCAT GACGGCCGCG TCAAGCGCGA GATCCTGCCG GACGACACCC CGTCGGGCGC ACAGGGCTGG TTCATCAACA CCCGCCGCGA CAAGTTCAAG GATCCGCGCG TCCGCGAGGC GCTCGGCTGC GCGTTCGATT TCGAGTGGAC CAACAAGACC ATCATGTACG GCACCTATGC GCGCACGGTG TCGCCATTCC AGAATTCCGA CATGATGGCG GTGGGCGCGC CGTCGCCCGA AGAGTTGGCG CTGCTCGAAC CGTTCCGCGG CAAGGTGCCC GACGAAGTGT TCGGGACACC GTTCATACCG CCCGCATCTG ACGGCTCTGG ACAGGACCGG GCGCTGCTGC GCCGGGGCGG GCAGCTGTTG AACGAGGCTG GCTTTCCGAT CAAGAACGGC AAACGTCTGA CGCCTCAGGG GGAGCCGTTC CGGGTCGAAT TCCTGCTCGA AGAGCCGGCA TTCCAGCCGC ACCATATGCC GTTCATCAAG AACCTCGGCA CGCTCGGCAT CGACGCCACG TTGAGGCTGG TCGATCCGGT GCAACTGCGG GCGCGCCGTG ACGATTTTGA TTTCGATCTG ACGATCGAGC GCTACAGCTT TTCGACCGTG CCGGGCGACG CGCTGCGCAA CTTCTTCTCG TCGCAGGCGG CAGCCACCAA GGGCTCGAAC AATCTCGCCG GCATTTCCGA TCCGGCCATC GACGCGATGA TCGATCAGGT GATCGCGGCC GACACCCGCA CCAAACTGGT TGTTGCGGCG CGCGCGCTTG ATCGACTGAT CCGGGCTGGC CGTTATTGGG TGCCGCAATG GTACTCGGCC TCGCACCGGC TGGCCTATTG GGACGTGTTC TCCCATCCGC CGAGTCTGCC GAAATACGCC GGCGTCGGCG TGCCGGAGCT GTGGTGGGCG ACCGCCCCTG CGGCACCTGC CGGCCAAGGG AAATAG
|
Protein sequence | MAQLNRRNVL ALGVGALAAT HLRGTAAAAE GETIAHGMSA FGDLKYPADF AHFDYVDPRT PKGGLFSTIP SVRAFNQSFQ TFNSLNAYIL KGDGAQGMGL TFATLMARAG DEPDAMYGFA ASKVAISADG LAYRFTMRPE ARFHDGSKLT ARDAAFSLNI LKAKGHPIVT QQMRDFIEAV ATDDATLVVT FKPKRGRDVP LFVAGLPLFS ETYYSKQPFD ESTMDVPLGS GPYKVGRLES GRYIEFDRVK DWWGAKLPVN VGAYNFDIVR FEFYRDRDVA FEGFTGRSYL FREEFTSRIW NTRYDFPAIH DGRVKREILP DDTPSGAQGW FINTRRDKFK DPRVREALGC AFDFEWTNKT IMYGTYARTV SPFQNSDMMA VGAPSPEELA LLEPFRGKVP DEVFGTPFIP PASDGSGQDR ALLRRGGQLL NEAGFPIKNG KRLTPQGEPF RVEFLLEEPA FQPHHMPFIK NLGTLGIDAT LRLVDPVQLR ARRDDFDFDL TIERYSFSTV PGDALRNFFS SQAAATKGSN NLAGISDPAI DAMIDQVIAA DTRTKLVVAA RALDRLIRAG RYWVPQWYSA SHRLAYWDVF SHPPSLPKYA GVGVPELWWA TAPAAPAGQG K
|
| |