Gene Franean1_2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2481 
Symbol 
ID5670877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2959567 
End bp2961774 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content75% 
IMG OID641241398 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001506819 
Protein GI158314311 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGTCT CCGGCGGTCC CGCGCCGCCC GGAGCCGGGC CGGCGGCGCC GGGGTTCCGG 
ATCGCCCCCA GCGACCCGGC CCGCGGCGGG GCCGGGCCGC GGACGCGGGT CTGCGCGGAC
AGTGCCACCT GTGCGGACTG CCTGGCCGAG CTCGCCGACC CGGCCGACCG GCGCTACCGC
TACCCGTTCA TCAACTGCAC CAACTGCGGC CCCCGCTTCA CGATCATCCG GGACGTCCCC
TACGACCGGC CGGCCACGAC GATGGCCGGA TTCCGGATGT GCGCGCGGTG CCGGGCGGAG
TACGACGACC CGGCCGACCG CCGCTTCCAC GCCCAGCCGG TGTGCTGTCC GGACTGCGGC
CCCACCCTCA GCTTCCGGCC CACCGCCGCC GCGTCGCCGA CGTCCGACGA GCCGCTGGAG
CGAGCAGCGG GGCTGCTGCG GGGCGGCGGG ATTCTCGCGG TGAAGGGGCT GGGCGGGTAC
CACCTGGCCG TGCTCGCCGC GGACGGGTCC GCGGTGGCGA CCCTGCGCCG GCGCAAGCAC
CGGGACGCCA AGCCGTTCGC CGTCATGGTG CGCGACCTGG CGACCGTCGA GGAGATGTGC
TGCCTCGACC CCACCGCGCG CCTCCTGCTC ACCGGGCGGC GCCGGCCGAT CGTCCTGGTC
GAGCGCTGCC GGCCCGCTAG GCGGGTCGCC GAGGCGGTGG CGCCGGGCAC CGGGACGCTC
GGCGTGATGC TGCCGTACAC GCCGCTGCAC CACCTTCTGC TCGGCCTGCT GCCCGGCCCG
ATCGTGCTGA CCAGCGGCAA CATCTCCAAC GAGCCGATCG TGTACCACGA CGAGGAGGCC
CTCGCCGCGC TCAGCCCGGT CGCGGACGGT TTCCTCGCCC ACGACCGGGC CATCCACACC
AGGCTCGACG ACTCGGTCGC CCGGTCGGTG CGCGGCCGGG AGACGCTCGT GCGCCGCTCC
CGCGGCTACG CCCCGGAGCC GCTGCCGCTG CCCTGGGACG TCCAGCGGCC GGTGCTGGCC
TGCGGGGCCG AGCTGAAGAA CACCGTGTGC CTGGCGGCGG GGCGGCGGGC GTACCTCTCC
GGCCACATCG GGGACCTGGA GAACCACGAG ACGCTGCGGT CGTTCGTCAG CGGGATCGAC
CACCTGCGCC GGCTGTTCGA CATCTCGCCC GAGCTGGTAG CGCACGACCT GCACCCGGAG
TATCTGTCCA CGAAGTGGGC GCTGGAGCAG GACCTGCCGG TCGTCGGCGT GCAGCATCAC
CACGCGCACA TTGCCTCCTG CCTGGCGGAC AACGGCCATC CCGGGCCGGT GCTCGGTGTG
GCCTTCGACG GCCTCGGCTT CGGTCCGGAC GGCACCCTGT GGGGCGGGGA GTTCCTGTGG
GCCGACCTGG CCGGCTTCAC CCGGCTCGCA CATCTCACGC CCGTCCCGAT GCCCGGGGGC
GTCGCCGCGA TCCGGGCACC ATGGCGGATG GCCGCCGCCC ACCTGCTCGC CGCCGGCGTC
GCCGACCCGG ACCGCCTCGC GGTCGCCCGC CGCAACGCCG ACCGCTGGGA CGCGGTCGCC
GCCCTGGCCA GGTCCGGTGC CGCCCCGTCC CCGGTGGGTG CCCCGCTGAC CAGCAGCGTC
GGACGGCTGT TCGACGCCGT CGCGGCTCTC GTCGGCGTCC GGGACACGAT CACCTACGAA
GGGCAGGCCG CGATCGAGCT GGAGCGGATC GCCGCCCCCG GCGACTACGG CCGCTACCCG
GCGAGCCCCC CGCCGTCGGG TGCCGGGCTG CTGCTGCCGG GCGCCGACCT GATCCGCGGC
GTCGTGGACG ACCTGCGCGA ACGCGTCGAC CCCACGGTGA TCGCCGCCCG GTTCCACAGC
ACGCTGGCCG GGCTCACCGT CGACATCTGT GCCGCCCTGC GTGCGGGGCT CCCGGGGCGC
CAGGCGGGAG CGGTGGCGCT CTCCGGCGGC GTCTTCGGCA ACCTGCGGCT GCTCGGCGAG
ATCATGGACG GGCTGGTAGA CCGGGGCTTC ACCGTCCTGA CCCACTCCCG GGTGCCCTGC
AACGACGGCG GGATCAGCCT CGGTCAGGCG GTGGTCGCGA ACGCCCGCAG CCAGCCGTAC
CAGTCGTCGA GCCCGGTGCC GGCGATCGCC GACACCGGCA GCACCGGGGC GGACGGGTTG
ACCAGCCGCA CGGACTCGGC GAACAGGTCA CCGTCGAAGG CGAGGTAG
 
Protein sequence
MDVSGGPAPP GAGPAAPGFR IAPSDPARGG AGPRTRVCAD SATCADCLAE LADPADRRYR 
YPFINCTNCG PRFTIIRDVP YDRPATTMAG FRMCARCRAE YDDPADRRFH AQPVCCPDCG
PTLSFRPTAA ASPTSDEPLE RAAGLLRGGG ILAVKGLGGY HLAVLAADGS AVATLRRRKH
RDAKPFAVMV RDLATVEEMC CLDPTARLLL TGRRRPIVLV ERCRPARRVA EAVAPGTGTL
GVMLPYTPLH HLLLGLLPGP IVLTSGNISN EPIVYHDEEA LAALSPVADG FLAHDRAIHT
RLDDSVARSV RGRETLVRRS RGYAPEPLPL PWDVQRPVLA CGAELKNTVC LAAGRRAYLS
GHIGDLENHE TLRSFVSGID HLRRLFDISP ELVAHDLHPE YLSTKWALEQ DLPVVGVQHH
HAHIASCLAD NGHPGPVLGV AFDGLGFGPD GTLWGGEFLW ADLAGFTRLA HLTPVPMPGG
VAAIRAPWRM AAAHLLAAGV ADPDRLAVAR RNADRWDAVA ALARSGAAPS PVGAPLTSSV
GRLFDAVAAL VGVRDTITYE GQAAIELERI AAPGDYGRYP ASPPPSGAGL LLPGADLIRG
VVDDLRERVD PTVIAARFHS TLAGLTVDIC AALRAGLPGR QAGAVALSGG VFGNLRLLGE
IMDGLVDRGF TVLTHSRVPC NDGGISLGQA VVANARSQPY QSSSPVPAIA DTGSTGADGL
TSRTDSANRS PSKAR