Gene Franean1_4854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4854 
Symbol 
ID5673194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5820928 
End bp5823315 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content77% 
IMG OID641243709 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001509125 
Protein GI158316617 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.623858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGG CCTCGCCGGC CGCGGGTGAC CGGTCGGCCG GCCGGATCCG GCGGCGGGTG 
ACGGTCGAGG GGGTCGTCCA GGGGGTCGGC TTCCGGCCGC ATGTCCACCG GCTCGCCACC
GCGCTGGGGC TGGCCGGTCT CGTCGGCAAC GAAGCCGGCT GCGTCGTCGC CGAGGTGGAG
GGCGACGGGC CGGCGGTCGC CGAGTTCCTG CGCCGGCTGG CCACGCCGGC CCCGCCGCTG
GCCCGTGTCG ACCGCGTCGC CGTCACCCAC CTGAATCACC GTGGAGACGA CGGATTCCGG
ATCGTGGCCA GCACCTCGGC GGCCGGTGCC CGGACCATGG TGGCGCCCGA CGCCGCCGTG
TGCGCCGACT GCCTGCGAGA ACTGTTCGAC CCGGCGGACC GGCGGTACCG CCACCCCTTC
GTCACCTGCA CGAACTGCGG GCCACGCTTC ACCATCATCG AGGCGCTGCC CTACGACCGC
GCCACCACGA CGATGGCCCG GTTCCCGATG TGCGCACGGT GCGCAGCCGA GTACACCGAC
CCCCGCGACC GGCGGTTCCA CGCCGAGCCC GTCTGCTGCC CGGCCTGCGG GCCGCGGCTG
TGGTTCCGCG TGGCGGTCGC GGCCCAGGCC GGCCGGGAGG GACGCGAGAC GCACGGCACG
GACACCGCGC TGGCCGCGGC CCAGCGCGCC CTCGCCGCGG GACGGATCGT GGCGGTCAAG
GGGATCGGCG GCTTCCACCT GGCCTGCGGC GCGGACGACA GCCGCGCCGT CGAGCTCCTG
CGCGCGCGCA AGGGACGGCC CGACAGGCCC TTCGCAGTGA TGGTGCGCGA CCTGGCGACG
GCGGCCGAGA TGGCCGATCT CTCGGCCGCC GAGGCCGAAC TGCTCACCTC CACGGCCGCT
CCGATAGTGC TCGCCAGGCG CCGGCCCGGT GCCCCCCTGT CGGACCGGGT CGCGCCCGGG
AGCCCCCTGG TGGGCCTGCT GCTGCCGTAC ACGCCGGTGC ACCACCTGCT GTTCGCGCCG
GTGCCCGGCG GCGGCCCGCC CCCGCCCCGG GCGCTGGTGA TGACCAGCGG CAACCGCTCG
GGCGACCCGA TCTGCTTCGC CGACGCCGAC GCCGACGCCG ACGCCGGGCG GCTGGCCGGG
CTGGCCGACG CGTACCTGCT GCACGACCGG CCGATCCTGC AGCCGTGCGA CGACTCGGTC
GTGCGGTGGG ACGGCGAGCA GGTGCTGCCG CTGCGCCGCT CCCGCGGCTA CGTGCCGCTG
CCGGTGGACC TCGGCCGTCC GGTGGAGTCC GTGCTCGCCG TGGGCGGCGA CGGCAAGTCC
GCGTTCTGCC TGACCGCCGG CCGCCGGGCG ATCGTCTCGC AGCACCTGGG CGATATGGGC
GGACTCGACG CGCTGCTGGC GCTGGAGCGT GCGAGCGCGC AACTGACCGA CCTGTACGCC
GCCGAGCCGG CGACCGTCGC CGCGGACCTG CACCCCGGCT ACGTCACCCG GGCCTGGGCG
GGCCGGCGGG CGGCCGGCCA GGGCGGCCGG ACCCACCTGG TGCAGCACCA CCACGCCCAC
GTGGCCGCGC TGCTCGCCGA ACACGGCCGC CTCGGCGACA CCATCCTCGG GATGGCCTTC
GACGGCACGG GCTACGGGCT CGACGGCACG ATCTGGGGCG GCGAGGCACT GTTGGTCGGC
CCGGACGTCA CCCACGCCGA CCGGGTCGCG CACCTGCGTC CCGTCGCGCT GCCCGGCGGC
GACGCGGCGG CGCGCGGCCC CTACCGCTGC GCGCTGGCCC ACCTGGCCGC CGCCGGCGTC
GAGTGGACGG CCGACCTGGC CCCGGTGCGG GCCTGCTCCC CCACCGAGCT GCGGGCGCTG
CGGGCGATGG TGGACCGCGG CGTGTCCTGC GTGCCCAGCA GCAGCATGGG CCGCCTGTTC
GACGCGGTCG CCTCCCTGCT CGGCGTGCGC CAGCGGAGCA CCTTCGAGGC CCAGGCCGCC
CTCGAGCTGG AGGCGCTCGC CGCCGCCAGC CGCCGGCCCG GTCCCGCGGT CGCCTTCAGG
TTCGACGGCC GCGCGCTCGA TCCGGCGCCG GTGATCGCGG AGATCGTCGA CGGGTTGCAC
GCGGGCCTCG CCCCGGACGC GCTCGCCGCG GCGTTCCACC TGGCGGTCGC CGACGCGGTG
ACCCGGGTCG CGCAGACGAC CCGGCGCCGC CGCGGTGTCG GCCTGGTTGG GCTGACCGGG
GGGGTGTTCG CCAATGTCGT CCTCGTGCGG GCCTGCCGGG CCCAGCTTGC CGCCGCGGGA
TTCGAGGTGC TCGTCCACCG TGTAGTCCCG CCGGGTGACG GCGGGCTGGC CCTCGGCCAG
GCCGCGATCG CCACGGCCGC CGCCCGGGCC CTTGCCGACC CTTGTTAG
 
Protein sequence
MPAASPAAGD RSAGRIRRRV TVEGVVQGVG FRPHVHRLAT ALGLAGLVGN EAGCVVAEVE 
GDGPAVAEFL RRLATPAPPL ARVDRVAVTH LNHRGDDGFR IVASTSAAGA RTMVAPDAAV
CADCLRELFD PADRRYRHPF VTCTNCGPRF TIIEALPYDR ATTTMARFPM CARCAAEYTD
PRDRRFHAEP VCCPACGPRL WFRVAVAAQA GREGRETHGT DTALAAAQRA LAAGRIVAVK
GIGGFHLACG ADDSRAVELL RARKGRPDRP FAVMVRDLAT AAEMADLSAA EAELLTSTAA
PIVLARRRPG APLSDRVAPG SPLVGLLLPY TPVHHLLFAP VPGGGPPPPR ALVMTSGNRS
GDPICFADAD ADADAGRLAG LADAYLLHDR PILQPCDDSV VRWDGEQVLP LRRSRGYVPL
PVDLGRPVES VLAVGGDGKS AFCLTAGRRA IVSQHLGDMG GLDALLALER ASAQLTDLYA
AEPATVAADL HPGYVTRAWA GRRAAGQGGR THLVQHHHAH VAALLAEHGR LGDTILGMAF
DGTGYGLDGT IWGGEALLVG PDVTHADRVA HLRPVALPGG DAAARGPYRC ALAHLAAAGV
EWTADLAPVR ACSPTELRAL RAMVDRGVSC VPSSSMGRLF DAVASLLGVR QRSTFEAQAA
LELEALAAAS RRPGPAVAFR FDGRALDPAP VIAEIVDGLH AGLAPDALAA AFHLAVADAV
TRVAQTTRRR RGVGLVGLTG GVFANVVLVR ACRAQLAAAG FEVLVHRVVP PGDGGLALGQ
AAIATAAARA LADPC