Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3038 |
Symbol | |
ID | 3910838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3463303 |
End bp | 3465525 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884945 |
Product | TonB-dependent receptor |
Protein accession | YP_486651 |
Protein GI | 86750155 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGGTA TTGATCTCCG GCAGATGGCT GGACGCGGCA TCGCGGTCGG CTGTGTGATT TTCAGCAGCG GCTTATCATC GGCAACGGCG CAGGAGCAAA CACCCCGGGC GCCGGTCGAA ACCTTGCCGC AGATCGCGGT CGATGCCCCG CCGCGCGTGG CGCGCAGCGC GGTACGACGT CCGGTACGCC CGTCCGCGAC ATTCCCGGAA CGTCGGGTGC CCGCACCGGC GCCAGCCGCG ATCGCGGATG CGGCGCTCGC CGCCGCCAGC TCCGGGCCGG CGCCGCTCAG CGCCCCACAG AACCAGGCCG CCAGCGCGCG CGTGATCAGC GGTGCCGAAA TCAACGCGAT GCCGGTCGCA CGGGTCGGTG AGGTGCTGGA GGCGGTGCCC GGGCTGGTCG TCACCCAGCA CTCCGGCGAG GGCAAAGCGA ACCAGTATTT CCTCCGCGGC TTCAATCTCG ACCACGGCAC CGATCTGGCG ATCACCGTCG ACGGCATGCC GGTCAACATG CCGACGCACG GCCACGGCCA GGGCTATGCC GATATCAATT TCATGATTCC GGAGTTGATC AGCGCCCTGA CGCTACGCAA GGGACCCTAT TTCGCCGATG TCGGCGATTT TGGTTCGGCA GGCGCCGTGG GTATCGATTA TTTTCGCGCG ATGCCGAAGA CCATCGCCGA AGTCACGATG GGCAGCTTCG GCTATCGACG GCTGCTCGGC ACCGGCTCGA CGAAGGCGGG CGAGGGGACG GTGCTCGCCG CCTTCGAAGC ACAAACCTAT AACGGCCCCT GGGACGTGCC CGACAACGTC CGCAAGCTGA ACGGCGTGCT GCGCTACAGC CAGGGCACGG TAACGGACGG CTTCTCGCTG ACCGGCATGG CCTATGCCAA CCGCTGGACC TCGACCGATC AGGTCGCGCA GCGCGCGATC GATCAGGGCG TGATCGGCCG CTACGGCTCG CTCGATCCGA CCGACGGCGG CAATTCCAGC CGATTCAGCC TGTCCGGACG GTTCGCGCGC TCGAGCGACA TCGGCCAGAC CGATCTCAAT GCCTATGTGA TCCGGTCGTC GATGCAGCTC TACAACAATT TCACTTACTA TCTCGACGAT CCGGTCGACG GTGATCAGTT CAACCAGTAC GACCGCCGCA TGGTGATCGG GCTGAACGGC ACCCAGCGCT TCGACTATCG GTTCGCAGGG CTACCGGTGG AAACGCGCGT CGGGCTTCAG AGCCGCGCCG ACAGTATCGA TCTGGGCCTC ACCAAGACGT TGCAACGCAA TTGGCTGTCG ACGGTGCGCG CCGACGACGT CACCGAGCAG TCGCTCGGAC TGTGGACCGA CACCACGGTG CGCTGGACCG ACTGGCTGCG CACCACGGCG GGTGTCCGCG AAGATTATGT CGGCGGCCGC GTGCGCAGCG ACACGCCGGC GAATTCCGGC TCGGCCTCCG CGACGATGAC CAGTCCCAAG GTCGGGATCG TGCTCGGCCC GTGGCTCGCG ACCGAGTTCT TCGGCAATGC CGGCACCGGC CTGCACAGCA ACGACATTCG CGGTGCGACC ATCACTGTCG ATCCGACCGA CAAGATCACG CCGGCAGACC GTGTGCCGCT GCTGGTGCGC TCGAAAGGCG CCGAACTCGG CGTCCGCAAT CGCCTGGTCC CTGGGTTGAC CACGTCGCTG GCGGTGTTCG TGCTCGATTT CGATTCCGAA CTGCTGTTCG TCGGTGACGC CGGCACCACC GAGGCGAGCC GGCCGAGCCG GCGCGTCGGC GTCGAGTGGA CCAGCCAGTA CAGGCCGCTG CCGTGGCTGG GGTTCGATTT CGACGTCGCC TACACGCGGG CTCGCTTCAC CGATGTCGAT CCGGCCGGCG ATCTGATTCC CGGCGCGCCC GCCTGGGTGG CGAGCACCGG GCTGACGTTC GGACGGGAAA CCGGCTGGTT CGGCGCGCTG AAAGGCCGCT ATTTCGGACC CCGGCCGCTG ATCGAGGACG GCAGCGAGCG CTCGCTGGCG TCACTGATTT TCAATGCCCG TGCGGGCTAT CGCTTCGAAA ACGGGCTGCG GTTTCAGCTC GACGTCTTGA ACCTGTTCAA CGCCAAGACC AACCAGATCG AGTACTACTA TCTTTCCCGC CTGCCGGGCG AGCCGCTCGA TGGCGTCGGC GACCGCCACG TCCATCCGGC GGAACCGCTG GCCGTGCGGC TGACATTGGC CGGCGCATTC TGA
|
Protein sequence | MLGIDLRQMA GRGIAVGCVI FSSGLSSATA QEQTPRAPVE TLPQIAVDAP PRVARSAVRR PVRPSATFPE RRVPAPAPAA IADAALAAAS SGPAPLSAPQ NQAASARVIS GAEINAMPVA RVGEVLEAVP GLVVTQHSGE GKANQYFLRG FNLDHGTDLA ITVDGMPVNM PTHGHGQGYA DINFMIPELI SALTLRKGPY FADVGDFGSA GAVGIDYFRA MPKTIAEVTM GSFGYRRLLG TGSTKAGEGT VLAAFEAQTY NGPWDVPDNV RKLNGVLRYS QGTVTDGFSL TGMAYANRWT STDQVAQRAI DQGVIGRYGS LDPTDGGNSS RFSLSGRFAR SSDIGQTDLN AYVIRSSMQL YNNFTYYLDD PVDGDQFNQY DRRMVIGLNG TQRFDYRFAG LPVETRVGLQ SRADSIDLGL TKTLQRNWLS TVRADDVTEQ SLGLWTDTTV RWTDWLRTTA GVREDYVGGR VRSDTPANSG SASATMTSPK VGIVLGPWLA TEFFGNAGTG LHSNDIRGAT ITVDPTDKIT PADRVPLLVR SKGAELGVRN RLVPGLTTSL AVFVLDFDSE LLFVGDAGTT EASRPSRRVG VEWTSQYRPL PWLGFDFDVA YTRARFTDVD PAGDLIPGAP AWVASTGLTF GRETGWFGAL KGRYFGPRPL IEDGSERSLA SLIFNARAGY RFENGLRFQL DVLNLFNAKT NQIEYYYLSR LPGEPLDGVG DRHVHPAEPL AVRLTLAGAF
|
| |