Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3552 |
Symbol | |
ID | 3911354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4065589 |
End bp | 4067802 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885454 |
Product | TonB-dependent receptor |
Protein accession | YP_487158 |
Protein GI | 86750662 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0127436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCG CTCGTCCGCG GTCGCGCAGC CTCGTGCTGC TGCAGGCCAC TGCGCTCGTT TCCACTTGTC TTGTTTCGCT GCCGGCGATG GCACAGAGCC CTCGCGACGG CACTTTGCCG CCGGTGACGG TGCAGGCGCC GGATCAGGCC CGCCCGGCCG TGCCGCGCGC GCAGCGCAGC CCGTCGCGGG CGACGCGAGC GGTGCGCGCC ACCAGCCCGG CGCCGTTGCC GGCCGCCGCA CCGGTCGACA GCGCGCAGGC GGCGAGAGGC CCGGCGTTGA CCGTGCTCAC GGTGCAGCAG GCCTTGAACG ACATTCAGCA GACGCCCGGC GGCGTCGCGC TGGTGCCCGC CAACGCCTAT CGCAACTCGA CGGTGTCCAA CACCATCAAG GACATTCTCG ACTACGTGCC GGGCGTATTC GCGCAGCCGA AATGGGGCGA CGACACCAGG CTGTCGATCC GCGGCTCGGG GCTGTCGCGC AACTTCCATC TGCGCGGCGT GCAGCTCTAC ATGGACGGCA TTCCGATCAA CACCGCGGAC GGCTATGGTG ATTTTCAGGA GATCGACCCC ACCGCGTACA AATACGTCGC GGTCTACAAG GGCGCCAACG CGCTGCAGTT CGGCGCCAAT TCGCTCGGCG GCGCGATCAA TTTCGTCACC GCGACCGGCC GCGATCCGTT CCCGAACGGC GTGAGCGTCG ACGCCGGTGC GTTCGGCTAT CGCCGGCTGC AGGCCAATGC CGGTGGCGTC AACGGCCCGT GGGACGGCTA TGTCACCGCC TCGACGCAGG CGGCTGAAGG TTTCCGCAAC CACAGCGACG GCGAGGCCCA CCGGCTCAGC GCCAATATCG GCTACCAGAT CACGCCGGAC ATCGAGACCC GGTTCTATCT CAACGCCAAT CACGTCCGGC AGCGGATCCC CGGCAGCGTC ACCAAGACCT CGGCGCTGAC CTCGCCGGAA ACAGCCGCAG CCAACAACGT CGCGCTCGAC CAGCAGCGCA ACATCGACAC GGTGCGGCTC GCCAACAAGA CCACGATCCG GTTCGACAAC ACGGTGGTCG ATTTCGGCGC CTTCGGCGTC GATCGCCACC TGATGCATCC GATCTTCCAG TGGCTGGACT ATCGCTATCA GGACTATGGC GGCTTCGCCA AAGTCACCGA CGATCGCTTC ATCGGCGGCT ATCGCAACCG TCTGGTCGCC GGCGTCAACC TGCTCAACGG TCGGATCGAC AACAAGCAAT ACGTCAACCT CGCCGGGCAG AAGGGTGCGC TGGCGTTCTC GTCGATCGAC AAATCCACCA ACACGTCGTT CTACGTCGAG GACTCGTTCT ATTTCCTGCG CAACGTTGCG CTGGTCGCCG GCACGCAGTT CCTGCACGCC ACCCGTGACC GGGTCGTCCG ATTCACGTCG AACAACGATC CGTCGGGGAC GACGGAGTTC AACTTATGGA GTCCCAAGGC TGGGTTGTTG TGGGACATCG ATCCGACCTG GCAGGCGTTC GGGAACATCT CGCGGAGCGC CGAAGTGCCG AGCTTCGGCG AGAGCTCTGC AGCGCAGTCG ATTCCCTTCA CCAGCATCCG GCCGCAGCGT GCGACGACTT ACGAGATCGG CACCCGTGGC CGTCGGCCCG ATCTGGTCTG GGAGCTGACC GGCTATCGCG CCGAGATCAA GGACGAGCTG CTCTGCATCT ACAGTTCGTT AGGCAATTGC AACGTGACCA ACGCCGATCG CAGCGTGCAT CAGGGCATCG AGGCCGGGCT CGGCGTCGCG CTGTTCAAGA ACCTGTTCGA GCGAGGACAC GCGCCCGACC GGCTGTGGCT GAACATGGCC TACACGCTGA ACGACTTCCG CTTCGACAAC GATGCGACCT TCGGCAACAA TCTGCTGCCC GGCGCGCCGC GGCACTATCT GCGCGCCGAA TTGCTCTACA AGAATCCGAA CGGGTTCTAT GCCGGCCCGA ACGTGGAGTG GGTGCCGCAG GCCTATTTCG TCGACAGCGC CAACACGCTG AAGACCGAGC CCTATGCGCT GCTCGGCCTC AAGGCGGGGA TCGACAATGG CGGGCCGTAC TCGATCTATA TCGAGGGCCG CAACCTCACC AACAAGGCCT ACATCGCCTC CGCCAGCATC ATCGACAAGG CCACCGCGAC CTCGCCTCTG TTCGAACCGG GCACCGGACG CGCGGTCTAT GCCGGCTTCA AGGTGCGCTG GTGA
|
Protein sequence | MSSARPRSRS LVLLQATALV STCLVSLPAM AQSPRDGTLP PVTVQAPDQA RPAVPRAQRS PSRATRAVRA TSPAPLPAAA PVDSAQAARG PALTVLTVQQ ALNDIQQTPG GVALVPANAY RNSTVSNTIK DILDYVPGVF AQPKWGDDTR LSIRGSGLSR NFHLRGVQLY MDGIPINTAD GYGDFQEIDP TAYKYVAVYK GANALQFGAN SLGGAINFVT ATGRDPFPNG VSVDAGAFGY RRLQANAGGV NGPWDGYVTA STQAAEGFRN HSDGEAHRLS ANIGYQITPD IETRFYLNAN HVRQRIPGSV TKTSALTSPE TAAANNVALD QQRNIDTVRL ANKTTIRFDN TVVDFGAFGV DRHLMHPIFQ WLDYRYQDYG GFAKVTDDRF IGGYRNRLVA GVNLLNGRID NKQYVNLAGQ KGALAFSSID KSTNTSFYVE DSFYFLRNVA LVAGTQFLHA TRDRVVRFTS NNDPSGTTEF NLWSPKAGLL WDIDPTWQAF GNISRSAEVP SFGESSAAQS IPFTSIRPQR ATTYEIGTRG RRPDLVWELT GYRAEIKDEL LCIYSSLGNC NVTNADRSVH QGIEAGLGVA LFKNLFERGH APDRLWLNMA YTLNDFRFDN DATFGNNLLP GAPRHYLRAE LLYKNPNGFY AGPNVEWVPQ AYFVDSANTL KTEPYALLGL KAGIDNGGPY SIYIEGRNLT NKAYIASASI IDKATATSPL FEPGTGRAVY AGFKVRW
|
| |