Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2354 |
Symbol | |
ID | 4022843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2628950 |
End bp | 2631286 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962547 |
Product | TonB-dependent haem/haemoglobin receptor |
Protein accession | YP_569487 |
Protein GI | 91976828 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01785] TonB-dependent heme/hemoglobin receptor family protein [TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.548378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGGGC TGAACTCGCG CATCTGTGCG CTGTTGCTAT CCGTATCCGT TATCGCGCTG GCGGCGTCGC CGAGCGCGGC GCAAACCGCG GTGCTGTCGC CGCAATCCAA AAAGCAGAAG CCGGTCACGC TCGACCAAGT AGCAAAGCCG GCGCTACAGA TGCCGGCGAC TGACCCGCTC GACGCCTATG CGCAAGCTGG TCCATCGACC CAGTCGCTCG ATGCGATCAC CGTGGTTTCC ACCAAGAACG AAGAGCGCGC GATCGACGCG CTGGCGCCGG CGAGCGCGAT CACTGTCGAC CAGATCCAGC GGCTTCAGCC GAACCGGCTG CAGGACATCT TCGTCGCCAC GCCGGGCGTA TCGTTCCAGG ATCGCGGCGA CGATCCGTCG ACCGCGATCA ACATCCGCGG TCTGCAGGAT TTCGGCCGGG TCGGCGTCGT GGTCGACGGT GCGCGGCAGA ACTATCAGCG CTCCGGCCAC AATGCGCAGG GCTCGCTCTT CCTCGATCCG GAAATGATCG GCGGCGTCGA TGTCGTGCGC GGTCCGAGCG CCAACATCTA CGGCTCGGGC GCGATCGGTG GCGTGGTCTC GTTTCGGACC AAGGACATCG ACGACGTATT GCGGGCCGGC GAACGCTGGG GCGTCGACAT GACGGGCTCC TATGGCAGCA ACAATTCCCG TGGGCTCGGC TCGGTGTTCG GCGGCATCCG GGTCGATCCG ACCGTCGACG TGTTCGGCGG CGCGCTGTAC CGCACGCAGG GCAACTACAA GGACGGCGCC GGCACCGAGA TCGGCAATAC CGGCAACGAC CTCGCCGGCG GGTTGCTCAA GCTCACGGTG CGGCCGGCCG AGGGCCACGA GGTCAAGATA GGCGGCCTGT TCCAGGACTA CAATTACAAC ATCGGCCAGT TCAACCGCGG ACCGGTGCTG ACCGCGGCGC AGCGCGCACT TTACCAAGGC TCGTCGGTCT ACGACTCCAA CGTGCGGAAT TCGACCGGAA CGTTGAGCTG GAAGTACTCA CGGCCGGACG ACATGCTGTT CGACTGGAAC ATCAGCCTGT ACGGCAACCG CACCGACAAC GACCAGACCA AGACCTATCA CAATTCCACC AGCGGTTCGG CCTATTGCGG CACCGGAAAC TACGGCAACA ACATCTCGGG TTGCATCGGC GACAAGCGCG GCTACCGGCT CGATACGATC GGCATCGACG CCAACAACAC CACGCGGTTC GACTACGGCG ACTGGCGCAC CGCGGTGACC TACGGCTTCG ACGCCTTCAA CGACAAGGTC ACGACCTCGG ACTCGCGAGG CAACTCCAAC ATCACGACCC CGAGCGGCGA GCGCACGGTG TCCGGCGGAT TCGTCCAGCT CAAGAACAAC TATGCGAGCT GGCTCGAGGT GATCAGCGCA GCGCGGTTCG ATCATTACGA GCTCAATTCG CAGACCAACT CCGCGAGCGG CAGCCGGCTG TCGCCCAAGA TCACGGTCGG CGTTACCCCG CTGGCGGGCC TCACGCCCTA TCTCAGCTAT GCCGAAGGCT ACCGCGCGCC GTCGATCACC GAGACGCTGA TCGCGGGCTC GCACGCCACC GGCGGCGGGC CGGCGCTGTT CGCCTGTGCG GACGGCGCGA CGGGTCTGTT CTGCCTGATC CCGAACACCG GGCTGCGGCC CGAAGTCGGA AAGAACAAGG AAGTCGGCAT CAACCTCAAA TACAACGACG TGTTCATCGC GGGTGACAGC TTCCGGGGCA AGATCAACGC CTTCCGCAAT GATATCGACA ACTACATCGA CCTGGTCGGG TCGCCGCCGC AGGCGTCGCG GCTGGGGGCT GCTTACGGCC TCTATAGCAA GAATTACCAG TACCAGAATA TCCCGCACGC GCGGATCGAC GGCGTCGAAC TCGAGACGTC CTACGATGCC GGGCTATGGT TCGTCGGCGT CAGCGCTTCT GCGCTGCGCG GCACCAACCC CGATACCGGA ATCGGTCTCG CCGCGGTTCC GTCGCGAAAG GTCGTCACCT CGGGCGGCGT CCGCTTGCTC GACCGTCAAT TGACGATCGC GGCGCAATGG GCATCTTATG CGGGCAACTC CAATCTTCCG ACCGGCTATC TGCCGGCGAC ATCCTATGAT CTGGTGAATC TCAACGTGTC GTACCGGCCG ACGTCGGACG TCACCGTGAA CTTCTCGATC GATAATCTGC TGAACAATTA CTATCGTCCC TATGCGATCC CGGGATCGTC GTCGGACGGA ACCACGCAGA ACGACGTACT GTTCAGCAGT CCCGGGCCGG GCATCGTGTA CAAGGGCGGG ATCAAGGTGC ACTTCGGAGG TGCATAG
|
Protein sequence | MVGLNSRICA LLLSVSVIAL AASPSAAQTA VLSPQSKKQK PVTLDQVAKP ALQMPATDPL DAYAQAGPST QSLDAITVVS TKNEERAIDA LAPASAITVD QIQRLQPNRL QDIFVATPGV SFQDRGDDPS TAINIRGLQD FGRVGVVVDG ARQNYQRSGH NAQGSLFLDP EMIGGVDVVR GPSANIYGSG AIGGVVSFRT KDIDDVLRAG ERWGVDMTGS YGSNNSRGLG SVFGGIRVDP TVDVFGGALY RTQGNYKDGA GTEIGNTGND LAGGLLKLTV RPAEGHEVKI GGLFQDYNYN IGQFNRGPVL TAAQRALYQG SSVYDSNVRN STGTLSWKYS RPDDMLFDWN ISLYGNRTDN DQTKTYHNST SGSAYCGTGN YGNNISGCIG DKRGYRLDTI GIDANNTTRF DYGDWRTAVT YGFDAFNDKV TTSDSRGNSN ITTPSGERTV SGGFVQLKNN YASWLEVISA ARFDHYELNS QTNSASGSRL SPKITVGVTP LAGLTPYLSY AEGYRAPSIT ETLIAGSHAT GGGPALFACA DGATGLFCLI PNTGLRPEVG KNKEVGINLK YNDVFIAGDS FRGKINAFRN DIDNYIDLVG SPPQASRLGA AYGLYSKNYQ YQNIPHARID GVELETSYDA GLWFVGVSAS ALRGTNPDTG IGLAAVPSRK VVTSGGVRLL DRQLTIAAQW ASYAGNSNLP TGYLPATSYD LVNLNVSYRP TSDVTVNFSI DNLLNNYYRP YAIPGSSSDG TTQNDVLFSS PGPGIVYKGG IKVHFGGA
|
| |