Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4854 |
Symbol | |
ID | 5673194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5820928 |
End bp | 5823315 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243709 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_001509125 |
Protein GI | 158316617 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.623858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.1985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGCGG CCTCGCCGGC CGCGGGTGAC CGGTCGGCCG GCCGGATCCG GCGGCGGGTG ACGGTCGAGG GGGTCGTCCA GGGGGTCGGC TTCCGGCCGC ATGTCCACCG GCTCGCCACC GCGCTGGGGC TGGCCGGTCT CGTCGGCAAC GAAGCCGGCT GCGTCGTCGC CGAGGTGGAG GGCGACGGGC CGGCGGTCGC CGAGTTCCTG CGCCGGCTGG CCACGCCGGC CCCGCCGCTG GCCCGTGTCG ACCGCGTCGC CGTCACCCAC CTGAATCACC GTGGAGACGA CGGATTCCGG ATCGTGGCCA GCACCTCGGC GGCCGGTGCC CGGACCATGG TGGCGCCCGA CGCCGCCGTG TGCGCCGACT GCCTGCGAGA ACTGTTCGAC CCGGCGGACC GGCGGTACCG CCACCCCTTC GTCACCTGCA CGAACTGCGG GCCACGCTTC ACCATCATCG AGGCGCTGCC CTACGACCGC GCCACCACGA CGATGGCCCG GTTCCCGATG TGCGCACGGT GCGCAGCCGA GTACACCGAC CCCCGCGACC GGCGGTTCCA CGCCGAGCCC GTCTGCTGCC CGGCCTGCGG GCCGCGGCTG TGGTTCCGCG TGGCGGTCGC GGCCCAGGCC GGCCGGGAGG GACGCGAGAC GCACGGCACG GACACCGCGC TGGCCGCGGC CCAGCGCGCC CTCGCCGCGG GACGGATCGT GGCGGTCAAG GGGATCGGCG GCTTCCACCT GGCCTGCGGC GCGGACGACA GCCGCGCCGT CGAGCTCCTG CGCGCGCGCA AGGGACGGCC CGACAGGCCC TTCGCAGTGA TGGTGCGCGA CCTGGCGACG GCGGCCGAGA TGGCCGATCT CTCGGCCGCC GAGGCCGAAC TGCTCACCTC CACGGCCGCT CCGATAGTGC TCGCCAGGCG CCGGCCCGGT GCCCCCCTGT CGGACCGGGT CGCGCCCGGG AGCCCCCTGG TGGGCCTGCT GCTGCCGTAC ACGCCGGTGC ACCACCTGCT GTTCGCGCCG GTGCCCGGCG GCGGCCCGCC CCCGCCCCGG GCGCTGGTGA TGACCAGCGG CAACCGCTCG GGCGACCCGA TCTGCTTCGC CGACGCCGAC GCCGACGCCG ACGCCGGGCG GCTGGCCGGG CTGGCCGACG CGTACCTGCT GCACGACCGG CCGATCCTGC AGCCGTGCGA CGACTCGGTC GTGCGGTGGG ACGGCGAGCA GGTGCTGCCG CTGCGCCGCT CCCGCGGCTA CGTGCCGCTG CCGGTGGACC TCGGCCGTCC GGTGGAGTCC GTGCTCGCCG TGGGCGGCGA CGGCAAGTCC GCGTTCTGCC TGACCGCCGG CCGCCGGGCG ATCGTCTCGC AGCACCTGGG CGATATGGGC GGACTCGACG CGCTGCTGGC GCTGGAGCGT GCGAGCGCGC AACTGACCGA CCTGTACGCC GCCGAGCCGG CGACCGTCGC CGCGGACCTG CACCCCGGCT ACGTCACCCG GGCCTGGGCG GGCCGGCGGG CGGCCGGCCA GGGCGGCCGG ACCCACCTGG TGCAGCACCA CCACGCCCAC GTGGCCGCGC TGCTCGCCGA ACACGGCCGC CTCGGCGACA CCATCCTCGG GATGGCCTTC GACGGCACGG GCTACGGGCT CGACGGCACG ATCTGGGGCG GCGAGGCACT GTTGGTCGGC CCGGACGTCA CCCACGCCGA CCGGGTCGCG CACCTGCGTC CCGTCGCGCT GCCCGGCGGC GACGCGGCGG CGCGCGGCCC CTACCGCTGC GCGCTGGCCC ACCTGGCCGC CGCCGGCGTC GAGTGGACGG CCGACCTGGC CCCGGTGCGG GCCTGCTCCC CCACCGAGCT GCGGGCGCTG CGGGCGATGG TGGACCGCGG CGTGTCCTGC GTGCCCAGCA GCAGCATGGG CCGCCTGTTC GACGCGGTCG CCTCCCTGCT CGGCGTGCGC CAGCGGAGCA CCTTCGAGGC CCAGGCCGCC CTCGAGCTGG AGGCGCTCGC CGCCGCCAGC CGCCGGCCCG GTCCCGCGGT CGCCTTCAGG TTCGACGGCC GCGCGCTCGA TCCGGCGCCG GTGATCGCGG AGATCGTCGA CGGGTTGCAC GCGGGCCTCG CCCCGGACGC GCTCGCCGCG GCGTTCCACC TGGCGGTCGC CGACGCGGTG ACCCGGGTCG CGCAGACGAC CCGGCGCCGC CGCGGTGTCG GCCTGGTTGG GCTGACCGGG GGGGTGTTCG CCAATGTCGT CCTCGTGCGG GCCTGCCGGG CCCAGCTTGC CGCCGCGGGA TTCGAGGTGC TCGTCCACCG TGTAGTCCCG CCGGGTGACG GCGGGCTGGC CCTCGGCCAG GCCGCGATCG CCACGGCCGC CGCCCGGGCC CTTGCCGACC CTTGTTAG
|
Protein sequence | MPAASPAAGD RSAGRIRRRV TVEGVVQGVG FRPHVHRLAT ALGLAGLVGN EAGCVVAEVE GDGPAVAEFL RRLATPAPPL ARVDRVAVTH LNHRGDDGFR IVASTSAAGA RTMVAPDAAV CADCLRELFD PADRRYRHPF VTCTNCGPRF TIIEALPYDR ATTTMARFPM CARCAAEYTD PRDRRFHAEP VCCPACGPRL WFRVAVAAQA GREGRETHGT DTALAAAQRA LAAGRIVAVK GIGGFHLACG ADDSRAVELL RARKGRPDRP FAVMVRDLAT AAEMADLSAA EAELLTSTAA PIVLARRRPG APLSDRVAPG SPLVGLLLPY TPVHHLLFAP VPGGGPPPPR ALVMTSGNRS GDPICFADAD ADADAGRLAG LADAYLLHDR PILQPCDDSV VRWDGEQVLP LRRSRGYVPL PVDLGRPVES VLAVGGDGKS AFCLTAGRRA IVSQHLGDMG GLDALLALER ASAQLTDLYA AEPATVAADL HPGYVTRAWA GRRAAGQGGR THLVQHHHAH VAALLAEHGR LGDTILGMAF DGTGYGLDGT IWGGEALLVG PDVTHADRVA HLRPVALPGG DAAARGPYRC ALAHLAAAGV EWTADLAPVR ACSPTELRAL RAMVDRGVSC VPSSSMGRLF DAVASLLGVR QRSTFEAQAA LELEALAAAS RRPGPAVAFR FDGRALDPAP VIAEIVDGLH AGLAPDALAA AFHLAVADAV TRVAQTTRRR RGVGLVGLTG GVFANVVLVR ACRAQLAAAG FEVLVHRVVP PGDGGLALGQ AAIATAAARA LADPC
|
| |