Gene Francci3_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1945 
Symbol 
ID3904307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2284957 
End bp2287407 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content74% 
IMG OID637879282 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_481049 
Protein GI86740649 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.368168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGGCGG ACGGCACGGC GGGCCGCATG ACCCGGCCGG TGCTCTCGGT GGGCGGGGCC 
GGTGAACCGA GCCGGGAGGC CGGCCCAGGC GGCCGGGCCC GGCGACGGTT CGTCGTCCAG
GGGCTGGTCC AGGGGGTCGG CTTCCGGCCG TTCGTCCACG CCGCCGCGAC GGAGCTGGCT
CTGACCGGCT GGGTGCGCAA CGACACCGGC GGCGTGGTCG CGGAGGTCGA GGGAACCCCC
GGCGCAGTCG AGGACTTCGC CCGCCGGCTT CGCTGCGACG TGCCGCCGCT GGCCATGGTC
GAGCGGGTCG TGACCACGGA ACTTCCGCCC TGCGGCGGAT CCGGCTTCGC GATCGTCCGA
TCCCGGCCGG CCGCGGGTGG GCGTACGGCG GCGGCCCCCG ACGTGGCGAC ATGCCCGGAT
TGCCTGCGGG AACTGGCCGA TCCCGCGGAC CGGCGTTACC GGCATCCGTT CATCACGTGC
ACCAACTGCG GACCCAGGTT CACGATCATC ACCGACCTTC CCTACGACCG GCCGGCAACG
ACGATGGCCC GGTTCGCCAT GTGCCCCGTG TGTGAGCGGG AGTACTCCGA TCCGGCCGAC
CGACGTTTCC ATGCCCAGCC GATCGCCTGC CCGGCGTGCG GACCGCGCCT GGAGTTCGTG
TCGCCGTTGG GCGCGCCGCG GACCGGCGAG GACGCGCTCG CCGCGGCGCG GCGGCTGCTC
GCCGACGGTG GCGTCCTGGC GGTGAAGGGG GTGGGCGGCT ACCACCTCGC CTGCGTCGCC
ACCGACCCGA CGGCGGTGAC GACCCTGCGA CGCCGCAAAC GGCGCGGCGG CAAGCCGTTC
GCCGTCATGG TCCACGACCT GGCTGCGGCC CGGGCCCTGG CGCATGTCGA CGAGCGGGAG
GCGGCCCTGC TCACCGACCC CGCTCGGCCC ATCGTGCTGG TGCGCCGCCG TCGCGACGGC
CGTCCCGCGC TCGCCGACTC CGTTGCCCCG GACAACCCCG ATCTCGGTCT GCTGCTGCCC
TACACGCCGG TGCACCATCT GCTGCTCGGC CTGCCAGGCG AGCCCGTGCC CCTCCATCGA
ACCGTCGGCA CCGCCGGATG GCCGCCGTTG GTGATGACCT CCGGGAACCT CGGCGGCGAG
CCGATCGCGG CCGAGGACAC CGACGCCCTG CGCCGGCTCG CCCCGCTTGT CGATGGCTGG
CTGCGCCACG ACCGTCCGAT CCGGGTGCCC TGCGACGACT CCGTTGTCCG GGTGGTCGAT
GGTGCCCCGT CGCCGGTACG CCGCTCGCGT GGCTACGCGC CGTTGCCGGT GCCGCTGCCG
TTCGAGGTGC CCCCGACGCT GGCCGTCGGG GCCGACCTCA AGAACACCTG CGGCCTGGGC
GCGGGCCGGT CGGCCTGGCT CAGCGGGCAC ATCGGGGACA TGGACGACCT GGCCACCCTG
CGCGCCTTCG ACGCCGCCGA GCGGCACCTC GAGCACCTGA CCGGCGTCAC CCCCCGCCAG
CTCGTCACCG ACGCGCATCC GGGCTATCGG TCGCGCCAGT GGGCTGTGCG GCACGCCGCC
GGCCGACCGA TACGGACCGT GCAGCATCAC CATGCCCATG TGGCCGCGCT GATGGCCGAG
CACGGCCTCG ACGGCACCCG GCCGGTGGTG GGATTCGCCT TTGACGGGAC CGGCTACGGC
CCCGACCGGG CCGTGTGGGG CGGCGAGGTG CTGATCGCCG ACTACCGCGG GTTCCGGCGC
TTCGCCCACC TGGGCTATGT GCCGCTTGCC GGTGGGGACG CCGCGGTTCG GCGGCCCTGC
CGGATGGCGC TCGCCCATCT GCACGCCGCC GGAGTCCGCT GGGATCCCAC CCTGCCGCCG
GTCGTCGCCT GCCCGCAGCC GGAGCGGCGG GTGCTGACCC ACCAGCTGGC CAGCGGGCTG
GGCTGCGTGC CGACCTCCAG CATGGGGCGG CTCTTCGACG CGGTCAGCTC GCTGCTCGGC
ATCCGTCACG AGGTCGACTT CGAGGCGCAG GCGGCGATCG AGCTGGAGGC CCGGGCGCGC
ACCGCGGCCG GCGGCGGCGC GGACGACCGC GAACGCCACG CGTTCGCCCT GCGCGGGCAG
GGGCTGGGCA GGCCGCTGAT TATCGACCCG GCGCCGGTCA TCCGGGCGAT GGTGAGGGAT
CTGTCGGCCG GGCTCGCCGT GGACATCGCC GCGGTCCGTT TCCACACAGC CGTCGTGGCG
ATGATCGTGG ACCTCGCCCG GCGGGCACGC GGCGAGGTGG GCCTTGACCT GGTCGGACTC
ACCGGCGGGG TCTTCCAGAA CGCCGTTCTG ACGACGGCTG CATCCAGGGC ACTGCATGCG
GGGGGCTTCA CGGTGCTGCG GCATGCGCGG GTACCTCCGA ATGACGGGGG GATCGCTCTC
GGTCAGCTGC TCGTCGCCGC AGCGCGGGAG AAGACGCGGG AAAAGGGGTG A
 
Protein sequence
MMADGTAGRM TRPVLSVGGA GEPSREAGPG GRARRRFVVQ GLVQGVGFRP FVHAAATELA 
LTGWVRNDTG GVVAEVEGTP GAVEDFARRL RCDVPPLAMV ERVVTTELPP CGGSGFAIVR
SRPAAGGRTA AAPDVATCPD CLRELADPAD RRYRHPFITC TNCGPRFTII TDLPYDRPAT
TMARFAMCPV CEREYSDPAD RRFHAQPIAC PACGPRLEFV SPLGAPRTGE DALAAARRLL
ADGGVLAVKG VGGYHLACVA TDPTAVTTLR RRKRRGGKPF AVMVHDLAAA RALAHVDERE
AALLTDPARP IVLVRRRRDG RPALADSVAP DNPDLGLLLP YTPVHHLLLG LPGEPVPLHR
TVGTAGWPPL VMTSGNLGGE PIAAEDTDAL RRLAPLVDGW LRHDRPIRVP CDDSVVRVVD
GAPSPVRRSR GYAPLPVPLP FEVPPTLAVG ADLKNTCGLG AGRSAWLSGH IGDMDDLATL
RAFDAAERHL EHLTGVTPRQ LVTDAHPGYR SRQWAVRHAA GRPIRTVQHH HAHVAALMAE
HGLDGTRPVV GFAFDGTGYG PDRAVWGGEV LIADYRGFRR FAHLGYVPLA GGDAAVRRPC
RMALAHLHAA GVRWDPTLPP VVACPQPERR VLTHQLASGL GCVPTSSMGR LFDAVSSLLG
IRHEVDFEAQ AAIELEARAR TAAGGGADDR ERHAFALRGQ GLGRPLIIDP APVIRAMVRD
LSAGLAVDIA AVRFHTAVVA MIVDLARRAR GEVGLDLVGL TGGVFQNAVL TTAASRALHA
GGFTVLRHAR VPPNDGGIAL GQLLVAAARE KTREKG