Gene Pars_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1280 
Symbol 
ID5055675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1156982 
End bp1159222 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content59% 
IMG OID640468827 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001153496 
Protein GI145591494 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.565285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.208692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGCAT TCAAGATATA CGCGGTGGGC ATAGTGCAAG GGGTGGGCTT TCGCCCTTAC 
GTGAAGATGT TGGCGGACTC GCTAGGAGTC GCAGGATACG TCAAGAACAT GGGAGGAGGC
GAGGTGGAGA TCTTCGTGGA GGGCGAGAGG GCCGCGAAGT TCGTCGAGGC GCTTTGGAAC
AGCAGGCCTA GGGCCATAGT GCTGGAGGAG CTCATAGTGC AGGAAGCCGA GCCCCAAGGC
CGTAGATCGT TCGAGATCTT GAAAAGCGAC GTGGAGGCCC GGACGCCTTC TAACGTGCCC
CCGGATCTCG CCATCTGCGA AGAGTGTCTG AGAGAAGTCC TCGAGGGGGA TGAGAGGAGG
CGCGGGTACT ATTTCAACTC TTGCAGCTTC TGCGGCCCCC GCTTCTCAGT GATGAGACGC
CTTCCCTACG ATAGGGAAAA CACCAGCTGG GTCTCCTTCC CCATGTGCCC CCAATGCGCG
GCCGAGTACT CTTCGCCCCG GCTCGGGGGT GTCAGGAGGT TCTTCTACCA GGGAATATCG
TGTAAGCTTG ATGGGCCTCG GGTGAGGCTC CTCGATTCTT CAGGCAAGCC GGTAGAGTCC
GACAATCCGG TCTTGAAAGC GGCTGAGCTG GTATCAAGGG GGTACATTGT CGCCGTTAAG
GGCATAGGTG GGTACCACAT ATTTGCCAGG GCGACTGACG ACTCTGTTGT GGCTGAGCTA
AGGCGCAGAA AGAGAAGGCC GAGCCAGCCC TTTGCGGTGA TGGCCTTGGA CTTGAGAGTC
GCGGAGCGCC TAGTCCACAT AGACGAGAGG GCTGCCAGCT TGTTAACGTC TCCTCAGAGG
CCCATAGTCC TTCTGCCCAA GAAAGAGGAC AGCCCAATCT CGCCGCTTGT GGCGCCCGGC
TTGGACAAGG AAGGTGTGTT CCTGCCCTAT ACGGCTCTGC AGTATCTGCT GTTGGCCCAC
ACCGATGACA AATTCGCTGT TGCCACCAGC GGAAACGTGC ACGGAGAACC TATGTGTAAG
GATCTCAAAT GCGCTTTAGA GAAGCTGGGG CGGGTAGTAG ACTACTTGCT GGATCACGAC
TTGGAGATCG TGCATAGGGT CGACGACAGC GTGTTGAGGT TCACAAACGG CGTGCCGACT
TTTCTGAGGA GGTCTAGGGG GTATGCCCCA GCGTGGATCC GCATACCCAG AAGGCTCAAG
AGACCGGTGG TGGCCTTCGG GGCGGATCTC CAGACGGCCG GCGCCGTGGC CTTCGAGGAC
AAGGTGGTTT TGACCCAGTA CATAGGCGAT CTCGACAACT TCGACGCCCT TAAGGATCTC
GACCAGGAGC TGAGGTGGTT CTTCAAGGCA TATAGGCTTA GGGATCCTGT GTTGGTCTGT
GACAAGAACC CGGCGTATAA CAGCACGAGG CTCTGCAAGG AGTGGGCAGA GGAGCTGGGG
GCGGAGACGT TCGTAGTTCA ACACCACCAC GCCCACGCAC TCGCCGCCGC AGCAGACGCC
AAGCTCGACG AGCCCTTCGT GGCCATAGCC ATAGACGGCG TAGGGTATGG CGATGACGGC
ATGGCTTGGG GTGGCGAGAT CCTCTTCGTG GAAGGGGCCA AGTATGTGAG GGAGCGCCAT
CTGCGCTACG TGCCCATGCC CGGAGGCGAT TTGGCCGCTT TGAGACCTGC CAGAATGGCC
GCGGCGTATT TCCACGAGGC CTTCGGCGAG GTGCCGATGT GGCTGGCGGA GCGTCTCCCC
GGAGGCCTTC CGGAGCTGGA GGTGGTAGAG AGGGAGCTCA AGGCGCCTAA GCTATTCACG
TCAAGCACGG GGAGGTTCCT AGACGCCGTG GCGGCGGCGT TGGGCGTGGC TTGGGAGCGC
ACCTACGAGG GCGAGCCTGC CATAAGGCTG GAGGCGGCGG CCGCAGGCGG AAGGCCTCTG
CCGCTTGAGG CAGAGGACCA AGTAGAGCTC TTCGCGCAGG CTGTCGAGGC GTATAGATCC
CGCCGACCAC TAAAAGACGT GGCGTACTCA GTACAGCTAA AGCTAGGCTC TATCCTTGGG
ACGTGGGCTT GCGAGAGCGC CCAGAGGAGA GGTGTCGATA CAGTAGCTGT GTCGGGAGGA
GCCGCAGTGA ACGACTTCAT AATAGCCGGC ATAGCCCAGG AGATCGTGAG TTGCGGCCTG
AGATTTATAC AACACATCAG GGCACCTCCA GGAGATGAAG GCATCGCCCT TGGTCAGTCA
TATATGACAT CATTTACATG A
 
Protein sequence
MRAFKIYAVG IVQGVGFRPY VKMLADSLGV AGYVKNMGGG EVEIFVEGER AAKFVEALWN 
SRPRAIVLEE LIVQEAEPQG RRSFEILKSD VEARTPSNVP PDLAICEECL REVLEGDERR
RGYYFNSCSF CGPRFSVMRR LPYDRENTSW VSFPMCPQCA AEYSSPRLGG VRRFFYQGIS
CKLDGPRVRL LDSSGKPVES DNPVLKAAEL VSRGYIVAVK GIGGYHIFAR ATDDSVVAEL
RRRKRRPSQP FAVMALDLRV AERLVHIDER AASLLTSPQR PIVLLPKKED SPISPLVAPG
LDKEGVFLPY TALQYLLLAH TDDKFAVATS GNVHGEPMCK DLKCALEKLG RVVDYLLDHD
LEIVHRVDDS VLRFTNGVPT FLRRSRGYAP AWIRIPRRLK RPVVAFGADL QTAGAVAFED
KVVLTQYIGD LDNFDALKDL DQELRWFFKA YRLRDPVLVC DKNPAYNSTR LCKEWAEELG
AETFVVQHHH AHALAAAADA KLDEPFVAIA IDGVGYGDDG MAWGGEILFV EGAKYVRERH
LRYVPMPGGD LAALRPARMA AAYFHEAFGE VPMWLAERLP GGLPELEVVE RELKAPKLFT
SSTGRFLDAV AAALGVAWER TYEGEPAIRL EAAAAGGRPL PLEAEDQVEL FAQAVEAYRS
RRPLKDVAYS VQLKLGSILG TWACESAQRR GVDTVAVSGG AAVNDFIIAG IAQEIVSCGL
RFIQHIRAPP GDEGIALGQS YMTSFT