Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4046 |
Symbol | |
ID | 4024563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4495439 |
End bp | 4497808 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637964249 |
Product | TonB-dependent receptor |
Protein accession | YP_571166 |
Protein GI | 91978507 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.236751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCC GCTTCAAGCG CGCCTTCCTG GTCGGAAGCG CAGCCTTCGC GGCCTCCGAA GGCTTGCCGG CATCGCAGGC CCTCGCACAG CAGGCCGCCA CCGCATTGCC GGAAGTCGTC GTCACCGCGC CGAGCCCGAT CGTGCGACGC CATTCGGCCC CGCCGTCACG ACCCGCGACG CGGGTCGCCG CCCCCGCGCG CCAGCGCGGC ACCGCTCCGG CCGAGGCGCC GCCCGTCGTC GCCCAGCCGG CGCCGCCGCC CCCCGGCGCG CTGCCGATCT TCACCGATCA GTTCGCGACC GTTACCGTGG TCCCGAACGA GGAGATCCGG CGCAATGGCG GCGGCACGCT CGGCGACCTT CTGAACAACA AGCCGGGCAT TACCGGCTCC GGCTACGCGC CGGGGGCCTC CAGCCGGCCG ATCATCCGCG GCCTCGACGT CAACCGAGTC GGCATCGTCG AGAACGGCAT CGGCAGCAAC GGCGCGTCCG ATCTCGGCGA AGACCATTTC GTCCCGATCG ATCCGCTCGC GACCAACCAG GTCGAAGTGA TCCGCGGCCC GGCGACGCTG CGCTACGGCT CGACGGCGAT CGGCGGCGTG GTCAGCGCCA GCAACAACCG CATCCCCGAC GCGCTGCCGC CCTGCGCGAC GCCGTTCCAG AGCTACGGCC TGCCGGTGAA GGCGCCGGCT GCGATCGGCG GCGCGGCGGG CTGCATGAAC GCCGAGGTCC GCAGTGCGAT GAGCTCGGTC GATCGCGGCG TCGAAAGCGC GGTGCTGCTG GATGCCGGCG GCAACAACGT CGCGGTCCAT GCCGACGTGT TCGGCCGCAA TGCCGGCGAC TATAACGTGC CGAGCTATCC GTATCAGGCG CCGGGACTCC CCTTCAACGG ACGCCAGCCC AACTCCGCGA CGCAGGCGAC CGGCGCCTCG ATCGGCGGCT CCTATCTGTT CGACGGCGGC TTCATCGGCG CGGCGATCAC GCAGAACAAT TCGGTCTATC GGATTCCCGG GACCGAGGGC GCCGAATTCG GGACGCGGAT CGATGCGAAG CAGACCAAGG TCACCGCCAA GGGCGAGTAT CGCCCGGATG CGGCGGCGAT CGAGGCGATC CGGTTCTGGG TCGGCGCCAC CGACTACAAG CACAATGAGA TCGGCCTCGC CGTCGCAGGC GACCCGGCGT CCGATGGCGT GCGGCAGAGC TTCACCAACA AGGAGCAAGA GGGCCGCCTC GAGGCGCAAC TGACGCCGTT CAATGCCCGC TTCGCCACGG TCACGACGGC GGTGGGCGTG CAGGCCAGCC ACCAGGAACT GACAGCGCCC AGCCCCGACG ATCCGACCAG CCCGCTCAAC GGGCTGTTCG ATCCCAATAA GAACACCAAG CTCGCCGGCT ACGTCTTCAA CGAGCTGCGC TTCACCGAGA GCACCAAGGC GCAGGTGGCC GGACGAATCG AGCACGTCGA CCTGTCCGGA ACGACGCCGG CTTTCGTACC CGGCCTGTTC GACCTCTCCA CCGACCCCGG CGCGATCGGC CCTGCGACGT CGCGCAACCT GTCGTTCACG CCGAAGAGCT TCAGCCTCGG CCTGATCCAG GCGTTGCCGT GGGGGCTTTC GGCCAGCATC ACCGGGCAAT ATGTCGAGCG TGCGCCGAAG CCGGCGGAGC TGTTCTCGCG CGGCGGTCAC GACGCCACCA CCACCTTCGA CATCGGCAAC CCCAATCTGG GGATCGAGAC GGCAAAATCG GTCGAGGTCG GTTTGCGTCG GGCGGACGGC CCGTTCCGCT TCGAGATCAC CGCTTACTAC ACGCAGTTCA GCGGCTTCAT CTATCGGCGG CTGACCGGCA ATAGCTGCGA CGACGTCTCT TGCGTCGATC CGGCGACCGG CACGCTGGAA TTGAACCAGG CGATCTACGC GCAGCGCGAC GCCACCTTCA GAGGCGGCGA ATTCCAGAGC CAGCTCGACG TGGCGCAAAT CTATGGCGGA ACCTGGGGCA TCGAGAACCA GTTCGATGTC GTGCGGGCCA CCTTCGCAGA CGGCACCAAT GTGCCGCGGA TCCCGCCGCT GCGGGTCGGC GGTGGATTGT TCTGGCGTGA CGCCAACTGG CTGACCCGGA TCAACCTGTT GCACGCCTTC GCCCAGAACG ACGTCGCGCC GATCGCCGAG ACAACCACCT CGGGCTACAA TCTGCTCAAG GCGGAGATCA GCTACCGGAC CAAGCTCGAT CCGAACGCAT GGGGCGCCCG CGAGATGCTG GTCGGCCTTG TCGGCAACAA TCTGCTCAAC GAGAACATCC GCAACGCGGT GTCCTACAGC AAGGACAACG TGCTGATGCC CGGCATCGGC GTGCGGGCGT TCGCGAATCT GAAGTTCTGA
|
Protein sequence | MSLRFKRAFL VGSAAFAASE GLPASQALAQ QAATALPEVV VTAPSPIVRR HSAPPSRPAT RVAAPARQRG TAPAEAPPVV AQPAPPPPGA LPIFTDQFAT VTVVPNEEIR RNGGGTLGDL LNNKPGITGS GYAPGASSRP IIRGLDVNRV GIVENGIGSN GASDLGEDHF VPIDPLATNQ VEVIRGPATL RYGSTAIGGV VSASNNRIPD ALPPCATPFQ SYGLPVKAPA AIGGAAGCMN AEVRSAMSSV DRGVESAVLL DAGGNNVAVH ADVFGRNAGD YNVPSYPYQA PGLPFNGRQP NSATQATGAS IGGSYLFDGG FIGAAITQNN SVYRIPGTEG AEFGTRIDAK QTKVTAKGEY RPDAAAIEAI RFWVGATDYK HNEIGLAVAG DPASDGVRQS FTNKEQEGRL EAQLTPFNAR FATVTTAVGV QASHQELTAP SPDDPTSPLN GLFDPNKNTK LAGYVFNELR FTESTKAQVA GRIEHVDLSG TTPAFVPGLF DLSTDPGAIG PATSRNLSFT PKSFSLGLIQ ALPWGLSASI TGQYVERAPK PAELFSRGGH DATTTFDIGN PNLGIETAKS VEVGLRRADG PFRFEITAYY TQFSGFIYRR LTGNSCDDVS CVDPATGTLE LNQAIYAQRD ATFRGGEFQS QLDVAQIYGG TWGIENQFDV VRATFADGTN VPRIPPLRVG GGLFWRDANW LTRINLLHAF AQNDVAPIAE TTTSGYNLLK AEISYRTKLD PNAWGAREML VGLVGNNLLN ENIRNAVSYS KDNVLMPGIG VRAFANLKF
|
| |