Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1280 |
Symbol | |
ID | 5055675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1156982 |
End bp | 1159222 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468827 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_001153496 |
Protein GI | 145591494 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.565285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.208692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGGCAT TCAAGATATA CGCGGTGGGC ATAGTGCAAG GGGTGGGCTT TCGCCCTTAC GTGAAGATGT TGGCGGACTC GCTAGGAGTC GCAGGATACG TCAAGAACAT GGGAGGAGGC GAGGTGGAGA TCTTCGTGGA GGGCGAGAGG GCCGCGAAGT TCGTCGAGGC GCTTTGGAAC AGCAGGCCTA GGGCCATAGT GCTGGAGGAG CTCATAGTGC AGGAAGCCGA GCCCCAAGGC CGTAGATCGT TCGAGATCTT GAAAAGCGAC GTGGAGGCCC GGACGCCTTC TAACGTGCCC CCGGATCTCG CCATCTGCGA AGAGTGTCTG AGAGAAGTCC TCGAGGGGGA TGAGAGGAGG CGCGGGTACT ATTTCAACTC TTGCAGCTTC TGCGGCCCCC GCTTCTCAGT GATGAGACGC CTTCCCTACG ATAGGGAAAA CACCAGCTGG GTCTCCTTCC CCATGTGCCC CCAATGCGCG GCCGAGTACT CTTCGCCCCG GCTCGGGGGT GTCAGGAGGT TCTTCTACCA GGGAATATCG TGTAAGCTTG ATGGGCCTCG GGTGAGGCTC CTCGATTCTT CAGGCAAGCC GGTAGAGTCC GACAATCCGG TCTTGAAAGC GGCTGAGCTG GTATCAAGGG GGTACATTGT CGCCGTTAAG GGCATAGGTG GGTACCACAT ATTTGCCAGG GCGACTGACG ACTCTGTTGT GGCTGAGCTA AGGCGCAGAA AGAGAAGGCC GAGCCAGCCC TTTGCGGTGA TGGCCTTGGA CTTGAGAGTC GCGGAGCGCC TAGTCCACAT AGACGAGAGG GCTGCCAGCT TGTTAACGTC TCCTCAGAGG CCCATAGTCC TTCTGCCCAA GAAAGAGGAC AGCCCAATCT CGCCGCTTGT GGCGCCCGGC TTGGACAAGG AAGGTGTGTT CCTGCCCTAT ACGGCTCTGC AGTATCTGCT GTTGGCCCAC ACCGATGACA AATTCGCTGT TGCCACCAGC GGAAACGTGC ACGGAGAACC TATGTGTAAG GATCTCAAAT GCGCTTTAGA GAAGCTGGGG CGGGTAGTAG ACTACTTGCT GGATCACGAC TTGGAGATCG TGCATAGGGT CGACGACAGC GTGTTGAGGT TCACAAACGG CGTGCCGACT TTTCTGAGGA GGTCTAGGGG GTATGCCCCA GCGTGGATCC GCATACCCAG AAGGCTCAAG AGACCGGTGG TGGCCTTCGG GGCGGATCTC CAGACGGCCG GCGCCGTGGC CTTCGAGGAC AAGGTGGTTT TGACCCAGTA CATAGGCGAT CTCGACAACT TCGACGCCCT TAAGGATCTC GACCAGGAGC TGAGGTGGTT CTTCAAGGCA TATAGGCTTA GGGATCCTGT GTTGGTCTGT GACAAGAACC CGGCGTATAA CAGCACGAGG CTCTGCAAGG AGTGGGCAGA GGAGCTGGGG GCGGAGACGT TCGTAGTTCA ACACCACCAC GCCCACGCAC TCGCCGCCGC AGCAGACGCC AAGCTCGACG AGCCCTTCGT GGCCATAGCC ATAGACGGCG TAGGGTATGG CGATGACGGC ATGGCTTGGG GTGGCGAGAT CCTCTTCGTG GAAGGGGCCA AGTATGTGAG GGAGCGCCAT CTGCGCTACG TGCCCATGCC CGGAGGCGAT TTGGCCGCTT TGAGACCTGC CAGAATGGCC GCGGCGTATT TCCACGAGGC CTTCGGCGAG GTGCCGATGT GGCTGGCGGA GCGTCTCCCC GGAGGCCTTC CGGAGCTGGA GGTGGTAGAG AGGGAGCTCA AGGCGCCTAA GCTATTCACG TCAAGCACGG GGAGGTTCCT AGACGCCGTG GCGGCGGCGT TGGGCGTGGC TTGGGAGCGC ACCTACGAGG GCGAGCCTGC CATAAGGCTG GAGGCGGCGG CCGCAGGCGG AAGGCCTCTG CCGCTTGAGG CAGAGGACCA AGTAGAGCTC TTCGCGCAGG CTGTCGAGGC GTATAGATCC CGCCGACCAC TAAAAGACGT GGCGTACTCA GTACAGCTAA AGCTAGGCTC TATCCTTGGG ACGTGGGCTT GCGAGAGCGC CCAGAGGAGA GGTGTCGATA CAGTAGCTGT GTCGGGAGGA GCCGCAGTGA ACGACTTCAT AATAGCCGGC ATAGCCCAGG AGATCGTGAG TTGCGGCCTG AGATTTATAC AACACATCAG GGCACCTCCA GGAGATGAAG GCATCGCCCT TGGTCAGTCA TATATGACAT CATTTACATG A
|
Protein sequence | MRAFKIYAVG IVQGVGFRPY VKMLADSLGV AGYVKNMGGG EVEIFVEGER AAKFVEALWN SRPRAIVLEE LIVQEAEPQG RRSFEILKSD VEARTPSNVP PDLAICEECL REVLEGDERR RGYYFNSCSF CGPRFSVMRR LPYDRENTSW VSFPMCPQCA AEYSSPRLGG VRRFFYQGIS CKLDGPRVRL LDSSGKPVES DNPVLKAAEL VSRGYIVAVK GIGGYHIFAR ATDDSVVAEL RRRKRRPSQP FAVMALDLRV AERLVHIDER AASLLTSPQR PIVLLPKKED SPISPLVAPG LDKEGVFLPY TALQYLLLAH TDDKFAVATS GNVHGEPMCK DLKCALEKLG RVVDYLLDHD LEIVHRVDDS VLRFTNGVPT FLRRSRGYAP AWIRIPRRLK RPVVAFGADL QTAGAVAFED KVVLTQYIGD LDNFDALKDL DQELRWFFKA YRLRDPVLVC DKNPAYNSTR LCKEWAEELG AETFVVQHHH AHALAAAADA KLDEPFVAIA IDGVGYGDDG MAWGGEILFV EGAKYVRERH LRYVPMPGGD LAALRPARMA AAYFHEAFGE VPMWLAERLP GGLPELEVVE RELKAPKLFT SSTGRFLDAV AAALGVAWER TYEGEPAIRL EAAAAGGRPL PLEAEDQVEL FAQAVEAYRS RRPLKDVAYS VQLKLGSILG TWACESAQRR GVDTVAVSGG AAVNDFIIAG IAQEIVSCGL RFIQHIRAPP GDEGIALGQS YMTSFT
|
| |