Gene Francci3_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1072 
Symbol 
ID3906415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1275677 
End bp1278223 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content76% 
IMG OID637878406 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_480183 
Protein GI86739783 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.825071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAG GGCGGACCGG GCGGACCGGG CGGACCGGGC GCCGGGTCCG GCGCCGGGTC 
GTGGTGGAGG GCGTCGTCCA GGGCGTCGGT TTCCGGCCCT ACGTCCACCG GCTCGCGGGC
ACCCTCGGGC TCGCGGGATT CGTCGGCAAC GACTCGACCT CGGTGTTCGC CGAGGTCGAA
GGCGACGAGG TGGCCGTGGC CGAGTTCCTG CGCCGACTCG AACCGCAGGC GCCACCGCTC
GCGCGCATCG AGCGCGTCAC GGTGACGAGC GTGACCCGGT CGACCGCCGG CGACGCCGGG
TTCCGGATCG TGGCCGGCGC GACCGTTCCC GGCGATCGCA CCCTGGTACC CGCGGACACC
GCGGTGTGCG CCGACTGCCT GCGCGAGATG TTCGACCCGA CCGACCGGCG CCACCGCCAC
CCGTTCATCA CCTGCACCAA CTGCGGCCCG CGGTACACGA TCATCAGCGC GCTGCCGTAT
GACCGCGCGG CCACGACGAT GGCCCGCTTC CCGATGTGCG CGCGGTGCGC GGCGGAGTAC
GCCGACCCGG CGGACCGCCG TTTCCACGCC GAACCCGTCG CCTGCCCGTC CTGCGGACCG
AAGCTGTGGT TCAGCCCCGC GGACGGCACC GGACCGGAGG TCCACGGCAC GGACGCCGCG
CTGGCGGCGG CCCAGCGGGC ACTGGCGGAC GGCGCGGTGG TCGCGGTCAA GGGCGTCGGC
GGCTACCACC TGGCCTGCTC GGCCGAGAAC GACCCGGGCG ACACGGCCGG CGACACGGCC
CTGGCGCGCC TGCGCGCCCG CAAGGACCGG CCCGACAAGC CGTTCGCGGT GATGGTGCGC
GACCTGGCGA CGGCCGCCCT GGTGGCCGAC CTGTCGTCGG CGGAGGCGGC GCTGCTGGCA
TCGCCGGCCG CTCCGATCGT GCTCGTCCGC CGCTCGCCCG CGGCGGGGAT GTCAGGGATA
TCGGGGATGT CGCGGCTGGT CGCCCCCGGC ACTCCGCTGG TGGGGCTGCT GCTGCCGTAC
ACACCCGTGC ACCACCTGCT GTTCGCGCCG GTCCCGGGTG GCGGCCCGCG ACCGCCGCGG
ATGCTGGTGA TGACCAGTGG CAACCGTTCG GGCGAGCCCA TCTGCTTCGC CGACGACGAC
GCGCGGGACC GGCTCGCCGG CCTCGCTGAT GCGTTCCTCC GGCACGACCG ACCGATCCTC
CTGCCGTGTG ACGACTCGGT GGTGCGCTGC GGCTACGGGG ACGACGGAGA CGACGGGCAG
GTGCTGCCGG TGCGGCGCTC CCGCGGATAC GTTCCGCTGC CGGTGGACCT CGGCCGCCCC
GTCCGGTCGG TGCTCGCCGT GGGCGGCGAG GGCAAGAGCA CGTTCTGCCT CACCACGGAC
CGTCGCGCGT TCGTGTCCCA GCATCTGGGT GACATGGGCA GCCTGGCGGC GCTCAGGGCC
TTCGAGCGCT CCGCCGCGCA CCTGACCGAT CTGTACGGCG TCGCGCCGGA GACGCTCGCC
GCCGACCTCC ATCCGGGATA CGTCACCCGG GCCTGGGCCG AACGCGCGGC GGCTACGGGG
ACCGAACAGG CGGCAGCCGG GGACCCGGCG GCGGGGGCGA ACCAGACCTC CACCCACGGG
TCACGGTCCG CCGGCCGATT GCCGCACCTG GTCCAGCACC ACCATGCCCA TGTGGCCGCC
CTACTCGCCG AGCACGGCAG GCTCGGCGAC ACCATCCTCG GGGTCGCGTT CGACGGCACC
GGCTACGGGC TCGACGGCAC GATCTGGGGC GGCGAGGTGC TGCTGGTAGG CCCGGACGTC
GCCCGCGCCG AGCGGGTCGC GCACCTGCGT CCGATCGGCC TGCCCGGGGG CGACGCCGCC
GTCCGCAATC CGTACCGGGT CGCGCTCGCC CATCTGGCCG CCGCCGACCT GGACTGGACC
GCGGACCTGG CGCCGGTGCG GGCCTGCTCC GTGGCGGAGT TGCGCACCCT GCGCACCGCG
CTGGACCGGG GCGTGGCCTG CGTGCCCTGC ACGAGCATGG GCCGCCTGTT CGACGCGGTG
GCGTCACTGC TCGGGGTGCG TCACCGGATC ACCTACGAGG CGCAGGCCGC CGTCGAGCTG
GAGGCGCTCG CCGCGGCGGA GTTCGGCGGC GCCGGACTCG GCGGCGCCGG ACTCGGCGGC
GCCGGTGCCT GCCGGCTCTC CTTCGGCCTG GCGAACGGCG TCCTCGACCC GGTGGGGGTG
CTCGCGGGGA TCGTCGCCGG TCTGCGGGCG GGAGTGGCGG TGCCGGTGCT GGCCGGCGCG
TTCCATCTGG CGGTCGCGGA CGCGGTGGCC GAGGTCGCGG GGCTCGCGCG GCGGCGGTAC
GGAGTCCGCC TGGTCGGTCT GACCGGCGGG GTGTTCGCCA ACGTAGTGCT GACGCGGGCC
TGCCGCGCCC GGCTCGCCAC TGCCGGGTTC GACGTACTCG TCCATCGGGT GGTGCCGCCG
GGCGACGGTG GGCTGGCGCT GGGCCAGGCG GCGGTCGCGG CGCTGAGCCG CCCGACGGAG
CCGACCTTCG GCGGGAAGGA AGCCTGA
 
Protein sequence
MTGGRTGRTG RTGRRVRRRV VVEGVVQGVG FRPYVHRLAG TLGLAGFVGN DSTSVFAEVE 
GDEVAVAEFL RRLEPQAPPL ARIERVTVTS VTRSTAGDAG FRIVAGATVP GDRTLVPADT
AVCADCLREM FDPTDRRHRH PFITCTNCGP RYTIISALPY DRAATTMARF PMCARCAAEY
ADPADRRFHA EPVACPSCGP KLWFSPADGT GPEVHGTDAA LAAAQRALAD GAVVAVKGVG
GYHLACSAEN DPGDTAGDTA LARLRARKDR PDKPFAVMVR DLATAALVAD LSSAEAALLA
SPAAPIVLVR RSPAAGMSGI SGMSRLVAPG TPLVGLLLPY TPVHHLLFAP VPGGGPRPPR
MLVMTSGNRS GEPICFADDD ARDRLAGLAD AFLRHDRPIL LPCDDSVVRC GYGDDGDDGQ
VLPVRRSRGY VPLPVDLGRP VRSVLAVGGE GKSTFCLTTD RRAFVSQHLG DMGSLAALRA
FERSAAHLTD LYGVAPETLA ADLHPGYVTR AWAERAAATG TEQAAAGDPA AGANQTSTHG
SRSAGRLPHL VQHHHAHVAA LLAEHGRLGD TILGVAFDGT GYGLDGTIWG GEVLLVGPDV
ARAERVAHLR PIGLPGGDAA VRNPYRVALA HLAAADLDWT ADLAPVRACS VAELRTLRTA
LDRGVACVPC TSMGRLFDAV ASLLGVRHRI TYEAQAAVEL EALAAAEFGG AGLGGAGLGG
AGACRLSFGL ANGVLDPVGV LAGIVAGLRA GVAVPVLAGA FHLAVADAVA EVAGLARRRY
GVRLVGLTGG VFANVVLTRA CRARLATAGF DVLVHRVVPP GDGGLALGQA AVAALSRPTE
PTFGGKEA