Gene RPC_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1973 
Symbol 
ID3973646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2147026 
End bp2149377 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content64% 
IMG OID637925084 
ProductTonB-dependent haem/haemoglobin receptor 
Protein accessionYP_531849 
Protein GI90423479 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.41369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGGC TGAGCACGCG CGCCTGCGCG TTACTTCTAT CTGTATCTGT TGTTTCGCTG 
GCCACCGCGC CAAGCTACGC ACAATCGGCG GCGCCGGAAG CGCAGGCGCC GCAACCGGCA
AAACCCAAGC CGGCCAAGCG CAAGCGTGCG GTGGTCGAGC CGGCACCTGC GGCGCTTCAG
GCGCGCGCGC AGATCGGCCA GCCGACCTAT CAGTCGCTCG ACGTCATCAC CGTCGCCGCC
AGCAAGACTG AAGAGAGGGC GATCGACGCG CTGGCCCCGG TCAGCGTGGT GACGCTGGAG
CAAATCCAGG GCATTCAGCC GAACCGGCTG GCCGACGTGT TCCGCGCCGT GCCCGGGGTG
TCGTTCCAGG ATCGCGGCGA CGATCCGGCG ACCGCGATCA ATATCCGCGG CCTGCAGGAT
TTCGGCCGGG TCGCGGTGGT GGTCGACGGC GCGCGGCAGA ACTATCAGCG CACCGGCCAT
AACGCCAACG GCTCGTTCTT CCTCGACCCG GAACTGATCG GCAGCATCGA CGTGGTGCGC
GGGCCGACCG CGAACATCTA CGGCTCAGGC GCGATCGGCG GCGTGGCGTC GTTTCGGACC
AAGGACATCA ACGACGTGCT GCGGCCGGGC GAGCGTTGGG GCGTCGACCT CAACGGCGCG
GTCGGCTCCA ACAACGGTCG CGGGCTCGGC TCGGTGTTCG GCGGTGTCCG CGTCGATCCC
ACCGTCGACG TGTTCGGCGG CGCGGTCTAC CGCACCCAGG GCAACTACAA GGACGGCGCG
GGCACCGAAA TCGGCAACAC CGGCAACGAG ATCGCCGCGG GGCTGCTCAA GCTCACGGTG
CGGCCCGCCG ACGGCCACGA GATCAAATTC GGCGGCGTGT TCCAGGACTA TCAATACAGC
ATCGGCCAGT TCAACCGCGG CGCGACCACT ACGGCGGCGC CGGCGAGTGC GATCGCCGGG
TCCTCGGTTT ACGATTCCGA CGCCAAGAAC TACACCGGCA CGCTGAACTG GAAATACTCC
AAGCCAGACG ACATGTGGTT CGACTGGAAT ATCAGCCTGT ACGGCAATCG CACCGACAAC
GATCAGGTCA AGACCTACAA CAACCGGATC TCAGCCGGCA GCGGCGTCTG CACGATTGCC
AATCCCGGCA ACAACATCTC CGGCTGCGTC GGCGACAGGC GCGGCTATCT GCTGGACACC
ATCGGGTTCG ACGTCAACAA CACCAGCCGT TTCGATCTCG GCGACTGGCG CAACGCCGTC
ACCTACGGCG TCGATGCCTT CCAGGACGAC GTTTCAACCT GGGACAGCCG CGGCAATTCC
AACATCACCA CGCCGAGCGG AACGCGCACC GTATCCGGCG GCTTCGTGCA GCTGAAGAAC
AACTATTCCT CGTGGCTCGA GGTGGTCAGC GCGCTGCGCT ACGATCATTA TGAACTCGCC
TCGGGCGCGA CGAGCGCCAG CGGCGAACGC CTGTCGCCAA AGATCTCCGT CGGCGTCACG
CCGCTGCAGG GTTTCACGCC CTATGTCAGC TACGCCGAGG GCTATCGCGC CCCCTCGATC
ACCGAGACGC TGATCTCGGG CGGGCACGCC ACCGGCGGCG GCCCGGTGTT GTTCAACTGT
CCGGATGGCA CCTCGGGGCT GTTCTGCTTT CTGCCCAACG CCGCCTTGCG GCCGGAAGTC
GGCAAGAACA AAGAGGTCGG CGTCAACCTG AAATACGACA GCGTGTTCAC GGCGGGCGAC
AGCTTCCGCG GCAAGTTCAA TCTGTTCCGC AACGATATCG ACGACTACAT CGATCTCGTC
GCATCGACCC CGGAACGATA TACGTCGATC TTCATGGGCT TCCCGATTTC CGGAACGTCG
AGCAAATATT ATCAGTACCA AAACACGCCG CACGCCCGGA TCGAGGGCTT CGAGGCCGAA
ACCATGTACG ACGCCGGACA ATGGTTCGCT GGCGTGTCGG CGACCGTGCA GCACGGCAAG
AACACGCAGA CCAATATCGG CCTGGCCACC GTGCAGCCGC GCAAGGTGAC GACCAGCGGG
GGAGTGCGGT TCCTGGAACG AAAGCTGACG ATTGCGGCGC TATGGTCGGC GGTCGCCGCC
AATACCGATA TCCCGGTTGG ATATCTGCCG TCGACCGCGT ACGATCTGGT CAACCTCTAT
GTGACCTACC AACCGACCGC CGACATCACG TTGAGCTTTT CGGTCGACAA CGTTCTGAAC
GAGTACTATC GGCCGTATGC AATTCCTAGC TCGGCAAGCG ACGGGACCAC GCAAAACGAT
GCCATTTGGA GCAGCCCCGG TCCCGGCATC GTCTACAAGG GAGGATTGAA AGTCCATTTC
GGAGGCGCTT AG
 
Protein sequence
MLGLSTRACA LLLSVSVVSL ATAPSYAQSA APEAQAPQPA KPKPAKRKRA VVEPAPAALQ 
ARAQIGQPTY QSLDVITVAA SKTEERAIDA LAPVSVVTLE QIQGIQPNRL ADVFRAVPGV
SFQDRGDDPA TAINIRGLQD FGRVAVVVDG ARQNYQRTGH NANGSFFLDP ELIGSIDVVR
GPTANIYGSG AIGGVASFRT KDINDVLRPG ERWGVDLNGA VGSNNGRGLG SVFGGVRVDP
TVDVFGGAVY RTQGNYKDGA GTEIGNTGNE IAAGLLKLTV RPADGHEIKF GGVFQDYQYS
IGQFNRGATT TAAPASAIAG SSVYDSDAKN YTGTLNWKYS KPDDMWFDWN ISLYGNRTDN
DQVKTYNNRI SAGSGVCTIA NPGNNISGCV GDRRGYLLDT IGFDVNNTSR FDLGDWRNAV
TYGVDAFQDD VSTWDSRGNS NITTPSGTRT VSGGFVQLKN NYSSWLEVVS ALRYDHYELA
SGATSASGER LSPKISVGVT PLQGFTPYVS YAEGYRAPSI TETLISGGHA TGGGPVLFNC
PDGTSGLFCF LPNAALRPEV GKNKEVGVNL KYDSVFTAGD SFRGKFNLFR NDIDDYIDLV
ASTPERYTSI FMGFPISGTS SKYYQYQNTP HARIEGFEAE TMYDAGQWFA GVSATVQHGK
NTQTNIGLAT VQPRKVTTSG GVRFLERKLT IAALWSAVAA NTDIPVGYLP STAYDLVNLY
VTYQPTADIT LSFSVDNVLN EYYRPYAIPS SASDGTTQND AIWSSPGPGI VYKGGLKVHF
GGA