Gene RPD_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2354 
Symbol 
ID4022843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2628950 
End bp2631286 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content64% 
IMG OID637962547 
ProductTonB-dependent haem/haemoglobin receptor 
Protein accessionYP_569487 
Protein GI91976828 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.548378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGGC TGAACTCGCG CATCTGTGCG CTGTTGCTAT CCGTATCCGT TATCGCGCTG 
GCGGCGTCGC CGAGCGCGGC GCAAACCGCG GTGCTGTCGC CGCAATCCAA AAAGCAGAAG
CCGGTCACGC TCGACCAAGT AGCAAAGCCG GCGCTACAGA TGCCGGCGAC TGACCCGCTC
GACGCCTATG CGCAAGCTGG TCCATCGACC CAGTCGCTCG ATGCGATCAC CGTGGTTTCC
ACCAAGAACG AAGAGCGCGC GATCGACGCG CTGGCGCCGG CGAGCGCGAT CACTGTCGAC
CAGATCCAGC GGCTTCAGCC GAACCGGCTG CAGGACATCT TCGTCGCCAC GCCGGGCGTA
TCGTTCCAGG ATCGCGGCGA CGATCCGTCG ACCGCGATCA ACATCCGCGG TCTGCAGGAT
TTCGGCCGGG TCGGCGTCGT GGTCGACGGT GCGCGGCAGA ACTATCAGCG CTCCGGCCAC
AATGCGCAGG GCTCGCTCTT CCTCGATCCG GAAATGATCG GCGGCGTCGA TGTCGTGCGC
GGTCCGAGCG CCAACATCTA CGGCTCGGGC GCGATCGGTG GCGTGGTCTC GTTTCGGACC
AAGGACATCG ACGACGTATT GCGGGCCGGC GAACGCTGGG GCGTCGACAT GACGGGCTCC
TATGGCAGCA ACAATTCCCG TGGGCTCGGC TCGGTGTTCG GCGGCATCCG GGTCGATCCG
ACCGTCGACG TGTTCGGCGG CGCGCTGTAC CGCACGCAGG GCAACTACAA GGACGGCGCC
GGCACCGAGA TCGGCAATAC CGGCAACGAC CTCGCCGGCG GGTTGCTCAA GCTCACGGTG
CGGCCGGCCG AGGGCCACGA GGTCAAGATA GGCGGCCTGT TCCAGGACTA CAATTACAAC
ATCGGCCAGT TCAACCGCGG ACCGGTGCTG ACCGCGGCGC AGCGCGCACT TTACCAAGGC
TCGTCGGTCT ACGACTCCAA CGTGCGGAAT TCGACCGGAA CGTTGAGCTG GAAGTACTCA
CGGCCGGACG ACATGCTGTT CGACTGGAAC ATCAGCCTGT ACGGCAACCG CACCGACAAC
GACCAGACCA AGACCTATCA CAATTCCACC AGCGGTTCGG CCTATTGCGG CACCGGAAAC
TACGGCAACA ACATCTCGGG TTGCATCGGC GACAAGCGCG GCTACCGGCT CGATACGATC
GGCATCGACG CCAACAACAC CACGCGGTTC GACTACGGCG ACTGGCGCAC CGCGGTGACC
TACGGCTTCG ACGCCTTCAA CGACAAGGTC ACGACCTCGG ACTCGCGAGG CAACTCCAAC
ATCACGACCC CGAGCGGCGA GCGCACGGTG TCCGGCGGAT TCGTCCAGCT CAAGAACAAC
TATGCGAGCT GGCTCGAGGT GATCAGCGCA GCGCGGTTCG ATCATTACGA GCTCAATTCG
CAGACCAACT CCGCGAGCGG CAGCCGGCTG TCGCCCAAGA TCACGGTCGG CGTTACCCCG
CTGGCGGGCC TCACGCCCTA TCTCAGCTAT GCCGAAGGCT ACCGCGCGCC GTCGATCACC
GAGACGCTGA TCGCGGGCTC GCACGCCACC GGCGGCGGGC CGGCGCTGTT CGCCTGTGCG
GACGGCGCGA CGGGTCTGTT CTGCCTGATC CCGAACACCG GGCTGCGGCC CGAAGTCGGA
AAGAACAAGG AAGTCGGCAT CAACCTCAAA TACAACGACG TGTTCATCGC GGGTGACAGC
TTCCGGGGCA AGATCAACGC CTTCCGCAAT GATATCGACA ACTACATCGA CCTGGTCGGG
TCGCCGCCGC AGGCGTCGCG GCTGGGGGCT GCTTACGGCC TCTATAGCAA GAATTACCAG
TACCAGAATA TCCCGCACGC GCGGATCGAC GGCGTCGAAC TCGAGACGTC CTACGATGCC
GGGCTATGGT TCGTCGGCGT CAGCGCTTCT GCGCTGCGCG GCACCAACCC CGATACCGGA
ATCGGTCTCG CCGCGGTTCC GTCGCGAAAG GTCGTCACCT CGGGCGGCGT CCGCTTGCTC
GACCGTCAAT TGACGATCGC GGCGCAATGG GCATCTTATG CGGGCAACTC CAATCTTCCG
ACCGGCTATC TGCCGGCGAC ATCCTATGAT CTGGTGAATC TCAACGTGTC GTACCGGCCG
ACGTCGGACG TCACCGTGAA CTTCTCGATC GATAATCTGC TGAACAATTA CTATCGTCCC
TATGCGATCC CGGGATCGTC GTCGGACGGA ACCACGCAGA ACGACGTACT GTTCAGCAGT
CCCGGGCCGG GCATCGTGTA CAAGGGCGGG ATCAAGGTGC ACTTCGGAGG TGCATAG
 
Protein sequence
MVGLNSRICA LLLSVSVIAL AASPSAAQTA VLSPQSKKQK PVTLDQVAKP ALQMPATDPL 
DAYAQAGPST QSLDAITVVS TKNEERAIDA LAPASAITVD QIQRLQPNRL QDIFVATPGV
SFQDRGDDPS TAINIRGLQD FGRVGVVVDG ARQNYQRSGH NAQGSLFLDP EMIGGVDVVR
GPSANIYGSG AIGGVVSFRT KDIDDVLRAG ERWGVDMTGS YGSNNSRGLG SVFGGIRVDP
TVDVFGGALY RTQGNYKDGA GTEIGNTGND LAGGLLKLTV RPAEGHEVKI GGLFQDYNYN
IGQFNRGPVL TAAQRALYQG SSVYDSNVRN STGTLSWKYS RPDDMLFDWN ISLYGNRTDN
DQTKTYHNST SGSAYCGTGN YGNNISGCIG DKRGYRLDTI GIDANNTTRF DYGDWRTAVT
YGFDAFNDKV TTSDSRGNSN ITTPSGERTV SGGFVQLKNN YASWLEVISA ARFDHYELNS
QTNSASGSRL SPKITVGVTP LAGLTPYLSY AEGYRAPSIT ETLIAGSHAT GGGPALFACA
DGATGLFCLI PNTGLRPEVG KNKEVGINLK YNDVFIAGDS FRGKINAFRN DIDNYIDLVG
SPPQASRLGA AYGLYSKNYQ YQNIPHARID GVELETSYDA GLWFVGVSAS ALRGTNPDTG
IGLAAVPSRK VVTSGGVRLL DRQLTIAAQW ASYAGNSNLP TGYLPATSYD LVNLNVSYRP
TSDVTVNFSI DNLLNNYYRP YAIPGSSSDG TTQNDVLFSS PGPGIVYKGG IKVHFGGA