Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4192 |
Symbol | |
ID | 3912000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4763062 |
End bp | 4765416 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886096 |
Product | TonB-dependent receptor |
Protein accession | YP_487795 |
Protein GI | 86751299 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0530533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCC CGTTCAAACG CGCGCTCCTG CTCGGAAGCG CTGCCGTCGT CGCCTTCGAA AGTCTGCCTG CCCCACAGGC CTTCGCGCAA ACCACCCTGC CCGAGGTGAC CGTCACCGCG CCGAGCCCGA TCGTGCGCCG CCACACCACC CCGCCGGCCC GCCCCGCGAC CCGCGTGGCC GCGCCCGCGC GGCAGCGCGG AGCCGCTCCG GCCGAAACGC AGCCCGTCGT CGCCCAGCCG GCTCCGGCGC TGCCCGGTAC GCTGCCGATC GTCACCGACC AGTTCGCCAC CGTCACCGTG GTGCCGAACG AGGAGATCCG GCGCAATGGC GGCGGCACGC TCGGCGATCT GCTGAACAAC AAGCCGGGCA TCACCGGCTC GAGCTACGCA CCGGGCGGCG CCAGCCGGCC GATCATCCGC GGTCTCGACG TCAATCGCGT CAACATCATC GAGAACGGCA TCGGCAGCAA CGGCGCCTCC GATCTCGGCG AAGACCATTT CGTGCCGATC GACCCGCTCG CGACCAACCA GGTCGAGGTG ATCCGCGGCC CGGCGACGCT GCGCTACGGC TCGACCGCGA TCGGCGGCGT GGTCAGCGCC ACCAACAACC GGATTCCCGA CGCGTTGCCG CCCTGCGCGC AACCGTTCCA GAGCTACGGC CTGCCGGTGA ACGCGCCCGC GGCGCTCGGC GGCTCGGCCG GCTGCATGAA CGCCGAGGTC CGCAGCGCCG TGAGTTCGGT CGATCGCGGC GTCGAAGGCG CCGTGCTGCT GGACGCCGCC GGCAACAATG TCGCGGTCCA TGCCGACGTC TACGGCCGCA ACACCCGCGA CTACAACGTA CCGAGCTACC CGTATGCCGA TGCCGGCATT CCGTTCAACG GGCGCCAGAC CAACTCGGCC TCGCAGGCGA GCGGGGCGTC GATCGGCGGC TCGTATCTGT TCCACGGCGG CTTCATCGGT GCGTCGGTCA CGCAGAACAA CTCGATCTAC CACATCCCCG GCCCCGAGGG AGTGGAACTG GGCACAAAGA TCGACGCCAA GCAGACCAAG TTCAACGCCA AGGGCGAGTA TCGTCCCGAC GCCGCCGCGA TCGACGCGAT CCGGTTCTGG GTCGGCGCCA CCGACTACAA GCACAACGAG ATCGGCCTCG CCGATGCCGC CGACCCGACC AGCGGCGGTG TGCGTCAGAC CTTCACCAAC CGCGAGCAGG AAGGCCGGCT CGAAGTTCAG CTGACGCCGT TCAACGCCGG CTTCGCGGCG GTGACCACGG CGGTCGGCGT CCAGGCCAGC CATCAGGAAC TGACTGCGCC CAGCCCCGAC GATCCGACCA GCCCGCTGAA CGGACTGTTC GATCCCAACA AGAACACCAA GGTCGCCGGC TACGTCTTCA ACGAACTGCA GTTCACCAAT ACCACCAAGG CGCAGGTCGC CGGCCGGATC GAGCACGTCG AACTGTCGGG ATCATCACCC TCCTCGGTGC CGGAGATCTT CGACCTCAAC ACCGATCCCA ATGCGATCGG CGCCGCCACC TCGCGCAACC TGTCCTTCAC GCCGAAGAGC TTCAGCCTCG GCCTGATCCA GGCGCTGCCA TGGGGCCTGT CGGCCAGCAT CACCGGGCAA TATGTCGAGC GCGCCCCGAA GCCCGCCGAA TTGTTCTCGC GCGGCGGCCA CGACGCCACC GCGACCTTCG ACATCGGCAA TCCCAATCTG AAGATGGAGA CGGCGAAGTC GGTCGAAGTC GGCCTGCGCC GGGCGGACGG CCCGTTCCGG TTCGAGATCA CCGGCTACTA CACCCAGTTC AGCGGCTTCA TCTATCGCCG GCTGACCGGC AACACGTGCG AGGACGGCGC GTGTATCGTC GGCACTGGCC TCGAACTGAA CCAGGCGATC TATTCGCAGC GCGACGCCAC CTTCAAGGGC GGTGAATTCC AGAGCCAGCT CGACGTCGCG CAGTTCTACG GCGGCACCTG GGGCATCGAG AACCAGGTCG ACGTGGTACG CGCCACCTTC GCCGACGGCA CCAACGTGCC GCGGATTCCC CCGGTGCGCC TCGGCGGCGG CCTGTTCTGG CGCGACGCCA ACTGGCTGAT GCGGGTCAAC CTGCTGCACG CCTTCGCGCA GAACAACGTC GCCGACATCG CCGAGACGAC GACGCCCGGC TACAATCTGC TGAAGGCCGA GATCAGCTAC CGCACCAAGC TCAACCCCAA CGTCTGGGGC GCACAGGAAA TGCTGGTCGG CCTGGTCGGC AACAATCTGC TCAACGAGGA CATCCGCAAC TCGGTGTCCT ACAGCAAGGA CAACGTGCTG ATGCCCGGTA TCGGCGTGCG CGCGTTCGCG AATCTGAAGT TCTGA
|
Protein sequence | MSLPFKRALL LGSAAVVAFE SLPAPQAFAQ TTLPEVTVTA PSPIVRRHTT PPARPATRVA APARQRGAAP AETQPVVAQP APALPGTLPI VTDQFATVTV VPNEEIRRNG GGTLGDLLNN KPGITGSSYA PGGASRPIIR GLDVNRVNII ENGIGSNGAS DLGEDHFVPI DPLATNQVEV IRGPATLRYG STAIGGVVSA TNNRIPDALP PCAQPFQSYG LPVNAPAALG GSAGCMNAEV RSAVSSVDRG VEGAVLLDAA GNNVAVHADV YGRNTRDYNV PSYPYADAGI PFNGRQTNSA SQASGASIGG SYLFHGGFIG ASVTQNNSIY HIPGPEGVEL GTKIDAKQTK FNAKGEYRPD AAAIDAIRFW VGATDYKHNE IGLADAADPT SGGVRQTFTN REQEGRLEVQ LTPFNAGFAA VTTAVGVQAS HQELTAPSPD DPTSPLNGLF DPNKNTKVAG YVFNELQFTN TTKAQVAGRI EHVELSGSSP SSVPEIFDLN TDPNAIGAAT SRNLSFTPKS FSLGLIQALP WGLSASITGQ YVERAPKPAE LFSRGGHDAT ATFDIGNPNL KMETAKSVEV GLRRADGPFR FEITGYYTQF SGFIYRRLTG NTCEDGACIV GTGLELNQAI YSQRDATFKG GEFQSQLDVA QFYGGTWGIE NQVDVVRATF ADGTNVPRIP PVRLGGGLFW RDANWLMRVN LLHAFAQNNV ADIAETTTPG YNLLKAEISY RTKLNPNVWG AQEMLVGLVG NNLLNEDIRN SVSYSKDNVL MPGIGVRAFA NLKF
|
| |