Gene RPC_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4555 
Symbol 
ID3971835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5085283 
End bp5087640 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content70% 
IMG OID637927666 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_534396 
Protein GI90426026 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAG ATCGCGGCGA CCGCGATGCG CCGCTGAGCG TGGGCGAACG CATCCGCGTC 
CGCGGCCTGG TGCAGGGAGT CGGCTTCCGC CCCTTCGTGC ACGCGCTGGC GCAGCGCCTG
GCGCTTCTGG GCTCGGTGCA CAACGACAGC GAAGGCGTGC TGATTCACGT CGCCGGCGAA
CGCGCCGCAA TCGAGGCGTT GGTCGCGGCG ATCAGCGATG AGGCGCCGGC ACTGGCCCAT
GTGGTCACGA TCGAGCGCGC GCCGTGGAGC GCGCCCTCTG GCCTCGTGAC ATTTGCGATC
GCGTCCTCGC CGGCCGAGCA CGGTGGTCGC AACACCGCGG GCGTGGTGCC GGATGCGCGG
ATCTGCGCCG CCTGCGCCGC CGAAATCGAC ACGCCCGGCG AGCGCCGCTA TCGCTATGCG
TTTGCCAGTT GCACCGCGTG CGGTCCGCGG TTCTCGATCA TCGACGCGAT CCCCTACGAC
CGACCCAACA CCACGATGCG CGACTTCGCG CTGTGCGCGC CATGCCGCAG CGAATACGAC
TCCCCGGCCG ACCGCCGCTT CCACGCCCAG CCGATCGCCT GCCCGGATTG CGGCCCGCAG
CTGTGGCTGG AGAACGCCGC CGGCGCGCGG GTGGCCGCCG ACGATCCACT CACCGCCGCC
GTCGACGCCT TGCGGCAGGG AAAAATCCTC GCGCTGAAAG GCCTCGGCGG ATTTCACTTG
GCCTGCGACG CCGGCAGCGA AGCCGCGGTC GACGCCTTGC GCGCGCGAAA ACGTCGCCCC
GCCAAACCCT TCGCGCTGAT GGCGGCGGAT CTCGAAGCTA TCCGGCGCTT CTGCCGGGCC
GATGCGCAGG AAGCCGCGCT GCTTTTGAGC CCGGCGGCGC CGATCGTGCT GCTGCCGTGG
CGACACCATG AGGGGCTGGC CGCCGCGGTG GCGCCGAACC AGTCGCTACT CGGCTTCATG
CTGCCGACCA CGCCGCTGCA TCATCTGCTG CTGAATGAAT TCGGCGGCGC GCTGGTGATG
ACCAGCGGCA ACGTCTCCGG CGAACCTCAA GTGATCGACA ATGCCGACGC GCAGCGCAAG
CTCGGCGGCT TCGCCGACTG TTTCCTGATG CACGACCGGC GGATCGCGCG GCGGCTCGAC
GATTCGGTGG CGCGCGTGGT CGGCGGCGAA ACGCGGCTGT TGCGCCACGC CCGCGGCTAC
GCGCCGGCGC CGCGCACGCT GCCGCCGGGG TTCGCTGCGG CGCCGCCGGT GCTGGCGCTG
GGCGGCGAAA TGAAAGGCGC GATTTGTCTC ACGCGCAACG ACGAGGCGCT GCTGTCGCAT
CACCTCGGCG ATCTCGAAGA GCCGTTGACC TATCGCGAAT TCGTCCGCGC CATCGACGAC
TACGCGCAGC TGTTCGACCA TCGCCCTTCT CTGCTCGCGG CCGATCTGCA CCCGGCCTAT
CGCAGCTCAG CCTGGGCCGA GCAAGCCGCC GCCGAGCGTG CTCTGCCGCT TGCGCGCGTG
CAGCATCATC ACGCGCATAT CGCGTCCGCG ATGGCGGAGC GGGGCTGGCC GCGCGACGGC
GGCCGCGTCG TCGGCATCGC GCTCGACGGC ATCGGCTATG GCAGCGACGG CACGGTGTGG
GGCGGCGAGA TTCTGCTGTG CGACTACATG GACTTCACCC GGATGGCGCA CTTGAAACCG
GTGCCGCTGC CGGGCGGCGC CCGCGCGGTG ACGCAGCCGT GGCGCAATCT GCTGGCGCAG
CTCGACGCGG CGTTCGGCAC TGAGGGCACC GCGGCGTGCC TGCCGGCATT GCCGGGCGGC
GCGATCCTCG CCGCGCAACA ACTTGGCGTG CTGCGGCAGG CGATGGCGCG CGGCATCAAT
TCGCCGCCGT CGTCGTCCTG CGGCCGGCTA TTCGATGCGG TCGCTGCAGC CCTCGCGCTG
GCGCCGGCGC AACTAAGCTT CGAGGGCGAA GCGGCGATGG CGCTGGAGGC GTTGGCCAGC
GGCAGCGCTG ATACGCGCGG CTATCCGTTT GCGCTCGACA CCGCGACGAC GCCCTGGAGC
ATCGATCCGG CGCCGATGTG GCGGGCGCTG TTAGACGACC TTGCGGCCGG CGTGCCGATC
GCCGCTATCG CGGCACGGTT TCACTTCGGG CTCGCCGACG CATTCTGTGA TTGCGCGCTA
AAGATCGCCA AGGCCAACGA CGCGCAAGCG ATCGCGCTCG GCGGCGGCGT GTTCCAGAAC
GGTCTGTTAC TTCAGGCTTG TCTCGCGCGG CTCAGTGCAA GTTCGCTGCC GGTGCTGTCG
CCGGCACAAA TCCCCGCCAA TGACGGCGGG CTGGCTTACG GCCAGGCGAT CATCGCCGCC
GCGCGGGCGC TGGCGTAG
 
Protein sequence
MTIDRGDRDA PLSVGERIRV RGLVQGVGFR PFVHALAQRL ALLGSVHNDS EGVLIHVAGE 
RAAIEALVAA ISDEAPALAH VVTIERAPWS APSGLVTFAI ASSPAEHGGR NTAGVVPDAR
ICAACAAEID TPGERRYRYA FASCTACGPR FSIIDAIPYD RPNTTMRDFA LCAPCRSEYD
SPADRRFHAQ PIACPDCGPQ LWLENAAGAR VAADDPLTAA VDALRQGKIL ALKGLGGFHL
ACDAGSEAAV DALRARKRRP AKPFALMAAD LEAIRRFCRA DAQEAALLLS PAAPIVLLPW
RHHEGLAAAV APNQSLLGFM LPTTPLHHLL LNEFGGALVM TSGNVSGEPQ VIDNADAQRK
LGGFADCFLM HDRRIARRLD DSVARVVGGE TRLLRHARGY APAPRTLPPG FAAAPPVLAL
GGEMKGAICL TRNDEALLSH HLGDLEEPLT YREFVRAIDD YAQLFDHRPS LLAADLHPAY
RSSAWAEQAA AERALPLARV QHHHAHIASA MAERGWPRDG GRVVGIALDG IGYGSDGTVW
GGEILLCDYM DFTRMAHLKP VPLPGGARAV TQPWRNLLAQ LDAAFGTEGT AACLPALPGG
AILAAQQLGV LRQAMARGIN SPPSSSCGRL FDAVAAALAL APAQLSFEGE AAMALEALAS
GSADTRGYPF ALDTATTPWS IDPAPMWRAL LDDLAAGVPI AAIAARFHFG LADAFCDCAL
KIAKANDAQA IALGGGVFQN GLLLQACLAR LSASSLPVLS PAQIPANDGG LAYGQAIIAA
ARALA