Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4403 |
Symbol | |
ID | 3912218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4989077 |
End bp | 4991398 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637886308 |
Product | TonB-dependent siderophore receptor |
Protein accession | YP_488000 |
Protein GI | 86751504 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4774] Outer membrane receptor for monomeric catechols |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.109366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT CAGTTGGTCC GGCTGCTCTG TTTGTCTCGA GCCTCAGCGC GCTGTCGCTT TCCGAAAGCG CACTGGCACA GGCGCCGTCT TCGCCGGCCG CCACGACGCT CGCACCGGTC GAAATCATCG CCCCGCAGGT GCGGCCGCGG CCCCCGGGCC GCGTTCGCGC TTCGCAGAAT CCCCGTCGCG GAGCGGCGAC CCGGCCACGC CCGCGTGAGG TTGCAACCCC GACGTCCGCT CCTCCCGTGC CCGCCGTCCC CCCGCAGACC GCAACCGTCG GGCAGCCCCC GGTGCCCTAT GTGGGCGGAC AGGTCGGAAC CGGCGCGCGG CTTGGTTTCC TGGGAAACAC CTCCGTCTTC ACGGCGCCGT TCAGCGTCAC GGGGTACACA TCGAAGCTCA TGGAGGATCA GCAGGCGCGC AGTGTCGCGG ACGTCGTCCT CAACGATCCC TCGGTGCGTA ACGACGCGCC GCCGTTCAGC GAACGCGACT CGTTCTTTAT CCGCGGTTTT TCGGTGACCA ATCTCGACAC CGCCTATGAC GGGCTGTTCT ACCTCGCGAA TCCGCGCCGC GCCTTCCTCG AAGGAATCGA GCGTGTCGAA ATCCTCAAGG GCCCGAGCGC ATTGCTCAGC GGCGGCACCG GGCGCGTCGG CGGAACCATC AATCTGATTC CGAAGCGCGC CACCGACGAA CCGCTGACGC GGCTGACGAC CAGCTACACC AGCAACTCGC AAATCTGGAA CCACCTCGAT CTCGGTCGTC GCTTCGGAGA CAACAAGGAG TGGGGCGTCC GCTTCAACGG CTCCTACCGC AACGGCGACA CGCCGCTGGA TCTCAATTCG GCCGAGGTCG GTGTCGCCGC CCTCGGTCTC GACTATCGCA GCGAACGCTT CCGGGCGTCG CTGGACCTCA ACGGCTCGAT TCAGAACATC ACGGCACCGA CGTCGCTGTT CAATTCCGCG GCCGCGAACA TCGTCGTCCC ACCCGCGCCG AACGGCCGCA TCAATACGTC GAGCCGCGAC GAATTCATCG ACAGCCGCTA CAAGATGATC GCCGGACGCG CCGAATACGA TCTCTTGCCG GACACCACCA TGTACCTGGC CGGCGGTGGC AGCCAATACA ACGAGGACTT CCTCACGTCG TCCTACCGAA TCACCAATTC GAACGGCACG GCCACCAACA CGCTCGCGGT TCAGCCCCAG AAGCTCGAAG GATACACCGG CGAGATCGGC GTGCGTTCGA AATTCCGGAC CGGCGTCGTC GGTCATCAGT TGAACGTCTC GGCGGTCGAA GCGAACAACG AGCTCTACCG CGGCGGCACT CTGGGCTTCA CCTCCTTCAG CTACGTGACC AACATCTACG ATCCGGTCCG CCTGCCGCAG GGCAGGTTCC AGACCAGCGG TTTCGCCACA TCCGACGACA GGCCCTTGCT GTCGCGGCTC ACCGCTCGCA GCGCCGCGAT ATCCGACACG CTGTCGCTGC TCGACGACCG GCTGCTTGTG ACGCTCGGCG GTCGCTGGCA GGACATCCTG CTGCGGGGAT TTGTGACGGC GTCGGGCCCC ACCCTCGGCA CGGAATCGTC GCGCTATCAG GAGGCTCGTT TCAGCCCGGC CGTGGGCGCG GTGATCCGCG CGACCGATCA GCTGTCGTTC TACGGAAACT ACATCGAGTC GCTCGAATCG GGACCGACGG CGCCGGCCCT CGCGAACAAT CGCAATACGG TGTTTCCGCC GGTGGTCAGC AAGCAGCAGG AGGTCGGCGC CAAATACGAT CTCGGAATCG TCGGGCTGAC GGCGTCGCTG TTCCAGATCG AACAGCCGAA CGCCTTCACC GATCCGACCA CCAACATCTT CTCCGTCAGC GGTCTGCAGC GCAACCGCGG CATCGAGCTG AGCGTTTTCG GCGAACCGGT CAAGGGCGTC CGTCTGCTCG GCGGCGTCAC CCTCATGGAC GCCAAGCTCG TTTCCACGAT CGGCGGCCGC TACGACGGCA ACGACGCGCC CGGCGTTCCG GTCACCGCGC TGAACCTCTA TGGCGAATAC GATCTGCCCC ATTGGCTGGC GCAGGGCGTG ACGGTGACCG GCCGGGCGAT CTACACCGGC GACGTGTTCT ACGATCAGGC GAACACGCAG ACCGTCTCCG ACTGGACACG GTTCGACATC GGCGCGCGGT ATGCGTTCAC GGGGCCTTCG GGCAAGCCGG CCGTGCTGCG GGCCACCATC GAGAACGTGG CCGATACGGC CTACTATCTC TCCGCCGCGC GTGGCTATCT CGCAGTCGGT GCACCGAGGA CCTACATGGT GTCGGCGACG TTCAATTTCT GA
|
Protein sequence | MKKSVGPAAL FVSSLSALSL SESALAQAPS SPAATTLAPV EIIAPQVRPR PPGRVRASQN PRRGAATRPR PREVATPTSA PPVPAVPPQT ATVGQPPVPY VGGQVGTGAR LGFLGNTSVF TAPFSVTGYT SKLMEDQQAR SVADVVLNDP SVRNDAPPFS ERDSFFIRGF SVTNLDTAYD GLFYLANPRR AFLEGIERVE ILKGPSALLS GGTGRVGGTI NLIPKRATDE PLTRLTTSYT SNSQIWNHLD LGRRFGDNKE WGVRFNGSYR NGDTPLDLNS AEVGVAALGL DYRSERFRAS LDLNGSIQNI TAPTSLFNSA AANIVVPPAP NGRINTSSRD EFIDSRYKMI AGRAEYDLLP DTTMYLAGGG SQYNEDFLTS SYRITNSNGT ATNTLAVQPQ KLEGYTGEIG VRSKFRTGVV GHQLNVSAVE ANNELYRGGT LGFTSFSYVT NIYDPVRLPQ GRFQTSGFAT SDDRPLLSRL TARSAAISDT LSLLDDRLLV TLGGRWQDIL LRGFVTASGP TLGTESSRYQ EARFSPAVGA VIRATDQLSF YGNYIESLES GPTAPALANN RNTVFPPVVS KQQEVGAKYD LGIVGLTASL FQIEQPNAFT DPTTNIFSVS GLQRNRGIEL SVFGEPVKGV RLLGGVTLMD AKLVSTIGGR YDGNDAPGVP VTALNLYGEY DLPHWLAQGV TVTGRAIYTG DVFYDQANTQ TVSDWTRFDI GARYAFTGPS GKPAVLRATI ENVADTAYYL SAARGYLAVG APRTYMVSAT FNF
|
| |