Gene Rpal_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2082 
Symbol 
ID6409742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2252218 
End bp2254533 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content65% 
IMG OID642711968 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001991080 
Protein GI192290475 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.340788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGG GTCTTGCCGG CGTGAATGCC GGGCGCAGTT CTATGGTCAA CACGCTGCTG 
CTGACGAGCG CGCTGATCGC ACCGGCGGTG TGGCCGTCAT CGCAAGCCGC CGCACAGGAT
GCCAAACCGC AAGCGACCCA ATTGCCCGCC GTCGAAGTCG CCGCCCCGGA GGCGAGCCGG
CGTCAGACGC AGCCCGCGCG CAACCGGAAT CGTGCCGCCG CAGGCCCGTC ACGACGCTCC
GCGCAGCGCG CCGCGCAAGC GCCGCAGCCG GCACAACCGG TGCAGCGCCC CGTGTTCGAA
CGCGGCACCG ATCCGGTGCC GGGCTTCGTG CCGAGCGTGA GCGCCAGCGG CACCAAGACC
GACACCAAGC TGATCGAGAC GCCGCAATCG GTTTCCGTGA TCAGCCGCGA CAATCTCGAT
GCGCGCGGCA TCGACACCGT CGCCCAGGCG CTGCAATATA CCGCCGGAGT CGCGGTGCAG
ACGTTCGGCG GCGATCCCCG CTACGATCAG GCGCGTATCC GCGGCTTTGA AACCAACGGC
TTCTCCAATT TCCGCGACGG CCTGCGCGAC ACCGCGAATG GCTCGGCCTA TTTCTCGGTG
TTCCGCAACG AGCCCTACGG CGTGCAGCGC ATCGACGTCG TCAAGGGCCC GAGCTCGGTG
ATGTACGGTC AGAGCCCGCC CGGCGGCCTG ATCGACCTGA TCAGCAAGCG CCCGACCGAC
CACGCGTTCG GCGAAGTGGT CGGCCTCGTC GGCACCGCCG ACCGGCTGCA GGGTGCGTTC
GACGTCGGCG GTCCGATCGA CAAGGACAAG ACCGTGCTCT ATCGCCTCAC CGGCGTGCTG
CGGGATTCCG ACGCGCAGGT CGCGAAGTTC TCCGACAAGG TCAAGGACGA CCGCGCCTAT
ATCGCGCCGG CGGTGACCTG GCGGCCCACC AACGACACCA CGCTGACGAT CCTCAGCGAC
TATCAGCACG ACGTTACCGG CATCGCCAGC CCGCTGTCGG TCGCAACGGT CCGCGGCGGC
AAGGTCGTCA ACATGCGCGC GCTGCCGCTG TTTCTCGGCG ATCCGTCGTA CAACACCTTC
GACCAGACTC AGTACCGCGT CGGCTATCAG TTCGAGCACC GCTTCAGCGA CGATCTGATC
GTGCGCTCGC GGGCGCGCTA CGGCCACGTC GATCTGGAGT ATCGTTCCAT CACCCAGGCC
GGCACGCCGC TCGACACCCA AACCGTGTTT CCACGAAACG CGCGCCGCGT GCTCGAAAAC
AGCGACAGTT TCGGCATCGA CAACCACGTC ATCGCCAAGA CCTGGACCGG CCCGCTGCAG
CACACCGTTC TGATCGGCAC CGACTATCAG GCGTTCAAGC TCGACGGCGA GTCGTTCGGC
GGCTTGGCGC CGTCGATCGA CGTGCTCAAT CCGGTCTACG GCCAGCCGGT GGCGATGCCG
ACGCTCCGGC TGCAGAGCTA CAAGCAGAAC CTGAACCAGG CCGGCGTCTA TCTGCAGGAT
CAGATCAAGC TGCAGAACTG GATCCTGACG CTCGGCGGCC GCTACGACGC GGCCCAGCAG
ACCATCCTCA ACCGCCTCAC CGGCGTGCCG CAGCCGAACG ACGACACCGC CTTCACCAAG
CGCGCCGGCC TGACCTATCT GTTCGACAAC GGCCTCGCGC CCTATGTCAG CTATTCCGAA
TCGTTTCTGC CTACAGGGGG CGTCGATTTC AACTCCAACG CCTTCAAGCC CACCAAGGGC
AAGCAATACG AGGGCGGCAT CAAGTTCCAG CCGAACCGCG ATCTGCTGTT CACCGCGGCG
GTGTTCGACC TCACCCAGGA CAATGTGCTG ACCGCCGATC CGAACCATCT GAACTACAAT
ATCCAGACCG GCCAGGTGAA TTCGCGCGGC CTGGAGCTGG AGATGTTGGC CAAGCCGATG
CCGGGCCTCA ATCTTCTGGC GAGCTACACG CTGCAAAATC TCAAGAATAC CCAGAGCAAC
AACGGCGACG TCGGCAAGAT GCCGGTGCTG ATCCCCCGCC ACATGGCGTC GGCCTTCGCC
GACTACACGC TGCAGAGCGG ACCGCTGGCC GGCTGGGGCT TCGGCGCCGG CTTCCGCTAT
ATCGGCGAGT CCTACATGGA CATCCTCAAC ACGCTGACCA ACGACGCCTA TACGGTGTTC
GACGCCGGGC TGCATTATCG CCAGCCGAAG GGCATCAATC TGGCGCTCAA CGTCAAGAAC
ATCGTCAACA AGGACAACGC GATGTGCACC GCCACCGGCG GCTGCCAGTA CATCTCCCCG
CGGGTGATCA CCGCGACCGC GAGCTATCGC TGGTGA
 
Protein sequence
MKLGLAGVNA GRSSMVNTLL LTSALIAPAV WPSSQAAAQD AKPQATQLPA VEVAAPEASR 
RQTQPARNRN RAAAGPSRRS AQRAAQAPQP AQPVQRPVFE RGTDPVPGFV PSVSASGTKT
DTKLIETPQS VSVISRDNLD ARGIDTVAQA LQYTAGVAVQ TFGGDPRYDQ ARIRGFETNG
FSNFRDGLRD TANGSAYFSV FRNEPYGVQR IDVVKGPSSV MYGQSPPGGL IDLISKRPTD
HAFGEVVGLV GTADRLQGAF DVGGPIDKDK TVLYRLTGVL RDSDAQVAKF SDKVKDDRAY
IAPAVTWRPT NDTTLTILSD YQHDVTGIAS PLSVATVRGG KVVNMRALPL FLGDPSYNTF
DQTQYRVGYQ FEHRFSDDLI VRSRARYGHV DLEYRSITQA GTPLDTQTVF PRNARRVLEN
SDSFGIDNHV IAKTWTGPLQ HTVLIGTDYQ AFKLDGESFG GLAPSIDVLN PVYGQPVAMP
TLRLQSYKQN LNQAGVYLQD QIKLQNWILT LGGRYDAAQQ TILNRLTGVP QPNDDTAFTK
RAGLTYLFDN GLAPYVSYSE SFLPTGGVDF NSNAFKPTKG KQYEGGIKFQ PNRDLLFTAA
VFDLTQDNVL TADPNHLNYN IQTGQVNSRG LELEMLAKPM PGLNLLASYT LQNLKNTQSN
NGDVGKMPVL IPRHMASAFA DYTLQSGPLA GWGFGAGFRY IGESYMDILN TLTNDAYTVF
DAGLHYRQPK GINLALNVKN IVNKDNAMCT ATGGCQYISP RVITATASYR W