Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2695 |
Symbol | |
ID | 3910488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3079239 |
End bp | 3081374 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637884595 |
Product | TonB-dependent receptor |
Protein accession | YP_486308 |
Protein GI | 86749812 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.316002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGATC GGACAACCGC GAGCCGGCAC GCGCTCTCGA TGGCCGGCGG ATGGGCTGCG GCGGTCGGAA TGGGCGGCGC GAGCGCGCTC ACGATTCTGT CGGTGGTCGC CGCGACGCCC GCATTCGGAC AGACCGCCAC CGCCTCGCCC ACGGCCGAGC GCGCGATCTC GCTCGATGTC ATCACGGTCA CCGGCGAGAA GATCGAGCGC GACCAGAAGA ACACGGCGTC GTCGGTTTCG GTGATCACCG GCAAGGACAT CGAAAAGGAA AAGACCGGCG ACGCCACGGT CAGCGAAGTG GTCGCCGGGG TGCCCAACGT GGTCTACACC GACACGGTCA GCGCACCGGT GATCCGCGGC CAGGATACGC AGGGTCCCCT CACCGGCCAG TATTCGTTCT GGGGTGGCAC CGTTCCGCGC GCGACCATCA ATCTCGACGG GCACTATCTC AACTACAGCG AGTACTATTT CGGCGCCACC TCCGTCTGGG ACGTGAAGAG CGTCGAGGTC TTCCGCGGGC CGCAGACCAC GTCGCAGGGC GCCAACGCCA TCGCCGGCGC CATCATCGTC AATACCAAGG ATCCGACCTT CAAGCCGGAA GGCGCCTATC AGGCGGAGAT CGGCAGCTAC AACACGCGGC GTACTTCGGT GATGGTGTCG GGGCCGTTGG TGGAAAACCA GTTGGCTGCG CGCCTCGCCA TCGACTACTC CGCCCGTGAC ACCTTCATCG ACTACGTCAG TCCGAATTTC CAGAAGGGGC AAACGGACCA GGACTTCCGG GCGCTCAACG CGCGGTTCAA GCTGCTGTGG CTGCCGACGG AGATCTCCGG CCTGGAGGCG AAGCTCACTT ACGCGCACAA TGCCAGCAAC CGGCCGGGCC AGGAAGCGGC GTCCGCGCCC TACGAGAACC TGCAGCACGT CACGACGACG ATGCCGACCT GGAATCAGAC CACCGACACC GGAATCTTCG ATCTCAAATA CGACTTCTAC AACGGCTTCA AGCTGTTCAA TCAGACGCAA TACACCAACT CGTCGGTTCA CCGCGTTGCT CCCGTCGCCT CGAACGGCAA TGCCGACATC GAACAGCGCA ATGTTTCGAA CGAGACGCGC GTCACCTTCG GCGACCAGAG CACCGTCCTC AGCGGACTTG CCGGCGTCTA TTACGGCTAC ACCAAGACCG ACGAAGTGCT TTATCTCAGC GGGCTTTCCA CCTTCGACGA TCGCAAGAGC AATCTCGGCG TGTTCGGTGA ACTGACCTAC AAGCTGACGG ATCGCTGGAC GTTGAACGGC GGCTTGCGCA CCCAGCACGA TCATATCCAG CGTCTCGGCG TTTCGGTCTA CGCCCCCGGC CCTGTCGACT ACGACAACAC CTTCACGGTA TTGTTGCCGA AAATGTCGCT CGCTTATGCG CTGACGCCGG GCTGGACCGT CGGCGCGCTG GTCAGCCGCG GCTACAATCC CGGCGGCGTC TCGCTCAACC TCAACTCCCG GCAATGGATG TCGTTCGAGG ACGAGTCGCT GTGGAACTAC GAGCTGTTCA CGCGCGCGAA CCTGCTGAAC GACACGCTCA CGTTGAACAG CAACGTGTTC TACATGGATT TCAAGAACGC CCAGTACAAC ATCCCGGTCG TCATCTCGAC CGGGGTCGCG CAGACCTACG TCATCAACGC GGAGAAGGCG CACGCCTACG GCCTGGAGGT CGGCATCGAC TATCGGGTGC TGCAAGGCCT GACGCTCAAG GCGGGAGCCG GCGTGCTGCA GACCAAGATC GACCAGATCT CCAGCAACGT CAGCTACGAG GGCAATGAAT TCGCGAAATC GCCTGGCTAC ATGCTCAGTT TCGGCGCCAG CTGGGACGCG ACCGACAAGC TCAACGTGTC CGGCGAGGTG CGCCATCTCG ACAGATACTA TTCCGACGTC GCGAACACCG TAAAATACTC GATTGATCCC TATACCATCG CCGATGTGCG CGTCAGCTAT CAGTTTCACC AGCTCGCGCA GGTCTACGGC TACGTCAAGA ACGTCTTCGA CGAGCGCGCT CCGACCTATA TGCAGGAGAA CCGCGGCATC GGCGGCACCG AGGCCAGCAT GACTGCGCCC CGAATGTTCG GCATCGGCCT GCGGGGGACG TTTTGA
|
Protein sequence | MQDRTTASRH ALSMAGGWAA AVGMGGASAL TILSVVAATP AFGQTATASP TAERAISLDV ITVTGEKIER DQKNTASSVS VITGKDIEKE KTGDATVSEV VAGVPNVVYT DTVSAPVIRG QDTQGPLTGQ YSFWGGTVPR ATINLDGHYL NYSEYYFGAT SVWDVKSVEV FRGPQTTSQG ANAIAGAIIV NTKDPTFKPE GAYQAEIGSY NTRRTSVMVS GPLVENQLAA RLAIDYSARD TFIDYVSPNF QKGQTDQDFR ALNARFKLLW LPTEISGLEA KLTYAHNASN RPGQEAASAP YENLQHVTTT MPTWNQTTDT GIFDLKYDFY NGFKLFNQTQ YTNSSVHRVA PVASNGNADI EQRNVSNETR VTFGDQSTVL SGLAGVYYGY TKTDEVLYLS GLSTFDDRKS NLGVFGELTY KLTDRWTLNG GLRTQHDHIQ RLGVSVYAPG PVDYDNTFTV LLPKMSLAYA LTPGWTVGAL VSRGYNPGGV SLNLNSRQWM SFEDESLWNY ELFTRANLLN DTLTLNSNVF YMDFKNAQYN IPVVISTGVA QTYVINAEKA HAYGLEVGID YRVLQGLTLK AGAGVLQTKI DQISSNVSYE GNEFAKSPGY MLSFGASWDA TDKLNVSGEV RHLDRYYSDV ANTVKYSIDP YTIADVRVSY QFHQLAQVYG YVKNVFDERA PTYMQENRGI GGTEASMTAP RMFGIGLRGT F
|
| |