Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3699 |
Symbol | |
ID | 6411375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3955930 |
End bp | 3959490 |
Gene Length | 3561 bp |
Protein Length | 1186 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642713579 |
Product | TonB-dependent heme/hemoglobin receptor family protein |
Protein accession | YP_001992674 |
Protein GI | 192292069 |
COG category | [M] Cell wall/membrane/envelope biogenesis [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport [COG3637] Opacity protein and related surface antigens |
TIGRFAM ID | [TIGR01785] TonB-dependent heme/hemoglobin receptor family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0724462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGG ATACAAAGTG CGTATGCTCT AACGGGCGAC TGGCCGTTCT GTTGATGGCG GCGACGTTCT TGTCGACGGC CGCGACGGTC GAACAATCCT ACGCACAAGC TCCGCAACCA TCACGCGCAG GCGCCGGTTC TCGCGAGCTC GACATCCAGG CTCAGCCGCT CGGGAGCGCG CTGATCGCAT TCTCCCGTCA GTCGCGGCTC CAGATCTTCG TCGATCAAGC TCTGGTGGCG GGAAAGTCCG CACCGGCGGT CCGCGGCGCG CTGACGCCGT CTGCTGCGCT CGATCGGCTT CTGGCCGGGT CCGGGCTCTC CTATCGGTTC AAGCGTGGAG GCACTGTCAC GATCGTCGGT CCGGTCAGCG CCAGCATGAA CCAGATGCCG GCATCCGATG GATCGCTGAT GCTCGATACG ATCGACATCT CGGGTGGTCG CGGAACTCCG GCCGAGGTGC CGTACCGCAC GCCCGCGCCC GTGACCCAGA TTACGCAGGA GAGCATCGAG CGCTTTCGGG GCAGCAGCCC GGCCGACATC TTCCGCGGTA CGCCGGGTGT GATGTCAGGC GAAGCCAGGA ACGGCGGCGG TTCGATCGAC GTCAATATCC GGGGCATGCA GGGCATGGGC CGTGTCGCCG TCACGGTTGA TGGCGCTGAG AACGGACTGA CCGTCTATCA GGGCTATCAG GGCATCTCGA ACCGGACCTA CGTCGATCCT GATCTCCTGG CCGGCGTCGA CATCACCAAG GGCGCGGACG TCTCGTCACG GGGCATAGCC GGCACGGTGG CGATGCGCAC GGTCGGCGCC GACGATGTCG TGAAGGCGGG TGAAAAATGG GGCGTGCTGG TGAAGAGCAG CTTCGGCACC AACACCGCCA ACCCGCGGCC GGGCGATCTC GGTGGTTATA CCTGGCCGCG GCCCTATGTC AGCGATCCGC AGCCCTATCC GGTGCCGACG GCGCAGGCTT CCGGCCTCGA CCGCCCCGGA CTGTTCACTC CGACCAGTGG CTCGTTCAGC ACCGTCGCGG CGATCAAGGA GGCGAATTAC GATCTGCTGT GGGGCTACGC GTACCGGAAG CAGGGCAACT ATTTCGCCGG AAAGAACGGG CCGGGCGCGG GCGTCCTCGA CACCGGGCCG CAGCCGCTCT GCTATTCGAG TGGCTTCTGT TATTATCCGC CGAGCCCGAT CTCCTATGCG CATGTCTACC AAAACACCGG TCTGACCAGC TATCGCGCCG GCGAGCAGGT TCTCAACACG CAGCTCGAGA CCGAATCCTA TCTCGCCAAG GGCACCGTGC GATCCGACGA CGGCCAGAGT CTGCAGGTTG GCTACAACGG CTATCGCAGC AAGGCTGGCG ACGTCATCGC GTCTTTGTTC TCAAGCGCCC AAAACCAGGC CGTGCAACAG GTCCAGGAGG ACGTGACCAA GGTCGACACC GGCACCGTCC GCTATCGTTG GAAGCCGGAG GACAACGGTC TCGTCGACCT CAAAGCCAAC ATGTGGATGA CCAACCTGAC GTTGCTGACA CCGCCCCGGA CGAGCATCGC GACCAACGTC AAGCCTGAGG ATTTCGGTTT TCCGTACAAC TACCTGCCGG GCAATCAAAC CACGATGGGC GGCGGCGACC TCAGCAACAG GTCCCATCTG TCATTGGAGC GCTTCGGTTC GCTCGACGTC GAGTACGGTG CGTCATACCT CCAGGAGGCG ACCAGGCCGA CGCCGTTTAC GAACGAGCTC AATGCCGGCA TCCCGTCCCG TGAAGGCGCC AGGCAGGAGG TCGCCGGTTT CGGCAAAGTC GCGTACAAGC CGGTCGATTG GCTCACGCTG AACGGCGGTC TCCGGTATTC GCATTACTGG TCTCAGGATC GGAGCACGAT CACCAACGCC TCACAGGTGA ATCCGCAGCC GTCGCGTGAC ACCGGCGGAT ACAGTCCGTC CGCCGGAATC GTGGTCGAGC CGATCAAGGA CGCACAGTTC TACGTCAACT ACTCCAGCGC CCTGCGCATG CCGACCCTGT TCGAATCGGT TGCAGGCTAC AGTCTGATCC CCAACGCTGG TCTGCTTCCC GAGCGCTCCA ACAATTGGGA GGCTGGGGTG AACCTCTTCA GAGAGGGGGT GTTCGCGCCC GCCGACAAGG CGATGATGAA GTTCGGCTAC TTCAACTGGA ATGTGTCGAA CTACGTCGCC CGGATGAGCA AGTCGTTTGT GGATCCGACC TACGGCTACA CCTACTCGGC CCTTCAGGTT GTCAATATCG ATCGTGCGCG CTTCGAAGGA CTCGAATGGT CTGGCCGCTA TCAGAGCGGC GGATTTACCG CAGAGGTCGC AGCGAATTAC TATCTCAACG TCGAATATTG TCCGACTGCG GCCAATTGCG CCAATTCGAC TCTGTCCGGC GACTATTCGG CCAATCAGGT GCCTCCGAAA TACTCGGCGA ACATCACGTT ATCGCAGAAG CTGCTCGACG ATGATCTGAC GGTCGGTGGT CGTGCGAGTT ATATCGGGCC GCGCTCGATC GGATACGGGG CGGTGCAGTA TGGCGCGGCC GCGATCATCG CGCCGATTAT CTGGCAGCCG TATTGGCTGG TCGACGTGTT CGCCGAGTAT AAGCTCACCA AGGATATCAC GCTGCGTGCG ACCGTCGAGA ACTTGACGGA TCAGTATTAT GTCGACCCGC TCAGTCTGGT TCAGCAGCCG GGGCCGGGGC GCACGTTCCG CCTCGGAATG AGCGGGAAGT TCGGCGGCAG CGAGACGGTC ACACCCGGAT CTCTGACGCG CCTGTTCGCG CCGAGCACAG CGGTTGCGGA CTGGAGCGGT TTCCATGCCG GCGTCAACAC AGCCTACAAC TCCGCGAAAT TCGGTGGGGC CATGACCGCG TTGGACGGAT CGGCCGACAG TCACGCGGCG AACGAAGCGC CAAATCAGCA GGTCGGCGCG CTCTCGTTCG GCTTGCACGC AGGCTACGAC TATCAGTTCG CGAACCGGTT CATCGTCGGC ATCGAGGCCG ATATCGCCAA GACTGAGATC GGTGCGCCCC AGGTGACCTT CGCGGCCGAA GGCGTCGCCT GCAACTGCGC GAACAACCTG GCCAGGATGC GGCAGTTCGA AGCCGTGCAG CAGAGCGAGA TCAAATGGCT CTCCACCGTG CGCGGACGGC TCGGCTACGC CGTCAACGAC AGGTTGATGC TGTTTGCCAG TGGTGGTGTT GCCTTCATGA GGCAAGACGA ATCCCGGACG CAGTACCGTG CCGTCAATTC CGGAAACTTC TCGTGGCCGA CGACGACCGT GGCGGCTTTC ACCGAAGACA GCTCTCGTAC CCGAGTGGGA TATGTGATCG GTGGGGGCGG CGAATTCGCT GTGGGCGGTG CGTGGTCGAT CAAGGCTGAG TATCTGCTGG CCCGGTTTAC CGGCGAGGAT TTCACCTTCG CCGACGCAAG AGCAGGCGTA CTGCCAGGCA ATTTCGTGAC GCCGGCGACC TACGCGACGA GCAACGGACG CAAGCTCAGC AGCAACGTGG ATATCCCCAT GGTCCGCGTC GGAGTCAACT ATCGGTTTTA G
|
Protein sequence | MSLDTKCVCS NGRLAVLLMA ATFLSTAATV EQSYAQAPQP SRAGAGSREL DIQAQPLGSA LIAFSRQSRL QIFVDQALVA GKSAPAVRGA LTPSAALDRL LAGSGLSYRF KRGGTVTIVG PVSASMNQMP ASDGSLMLDT IDISGGRGTP AEVPYRTPAP VTQITQESIE RFRGSSPADI FRGTPGVMSG EARNGGGSID VNIRGMQGMG RVAVTVDGAE NGLTVYQGYQ GISNRTYVDP DLLAGVDITK GADVSSRGIA GTVAMRTVGA DDVVKAGEKW GVLVKSSFGT NTANPRPGDL GGYTWPRPYV SDPQPYPVPT AQASGLDRPG LFTPTSGSFS TVAAIKEANY DLLWGYAYRK QGNYFAGKNG PGAGVLDTGP QPLCYSSGFC YYPPSPISYA HVYQNTGLTS YRAGEQVLNT QLETESYLAK GTVRSDDGQS LQVGYNGYRS KAGDVIASLF SSAQNQAVQQ VQEDVTKVDT GTVRYRWKPE DNGLVDLKAN MWMTNLTLLT PPRTSIATNV KPEDFGFPYN YLPGNQTTMG GGDLSNRSHL SLERFGSLDV EYGASYLQEA TRPTPFTNEL NAGIPSREGA RQEVAGFGKV AYKPVDWLTL NGGLRYSHYW SQDRSTITNA SQVNPQPSRD TGGYSPSAGI VVEPIKDAQF YVNYSSALRM PTLFESVAGY SLIPNAGLLP ERSNNWEAGV NLFREGVFAP ADKAMMKFGY FNWNVSNYVA RMSKSFVDPT YGYTYSALQV VNIDRARFEG LEWSGRYQSG GFTAEVAANY YLNVEYCPTA ANCANSTLSG DYSANQVPPK YSANITLSQK LLDDDLTVGG RASYIGPRSI GYGAVQYGAA AIIAPIIWQP YWLVDVFAEY KLTKDITLRA TVENLTDQYY VDPLSLVQQP GPGRTFRLGM SGKFGGSETV TPGSLTRLFA PSTAVADWSG FHAGVNTAYN SAKFGGAMTA LDGSADSHAA NEAPNQQVGA LSFGLHAGYD YQFANRFIVG IEADIAKTEI GAPQVTFAAE GVACNCANNL ARMRQFEAVQ QSEIKWLSTV RGRLGYAVND RLMLFASGGV AFMRQDESRT QYRAVNSGNF SWPTTTVAAF TEDSSRTRVG YVIGGGGEFA VGGAWSIKAE YLLARFTGED FTFADARAGV LPGNFVTPAT YATSNGRKLS SNVDIPMVRV GVNYRF
|
| |