Gene Francci3_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1069 
Symbol 
ID3906412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1273235 
End bp1274296 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content76% 
IMG OID637878403 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_480180 
Protein GI86739780 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.582687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACC CCGTCGACCC GACCGGTTGG ACCTGCCCAC TTCCGCTGCG TGACCATCAC 
CAGGTGGTAC TGGGGCACGG CGGCGGCGGC GTGCTGTCGA GCGAGCTGAT CGAGCATCTG
TTCCTGCCCG CGTTCGGGAC CACCGACACG GCGCGGGCAC CGGCGGACTC GGCGGTCCTG
GATGTCGCGG GCGCCCGGCT CGCGTTCTCC ACCGACTCGT ACGTGGTGCG CCCGCTGTTC
TTCCCCGGTG GCTCCATCGG CGAGCTCGCG GTCCACGGCA CCATCAACGA CCTGGCCTGT
GCCGGCGCGG TGCCGGTGGC GCTCTCGGCC GGGTTCATCC TCGAGGAGGG CCTGGAGCTC
GCGGTCCTGG GCCGGGTGGC GCAGGCGATG GGCCGGGCCG CCGCCGCGGC GGGGGTGCGG
CTGGCGACCG GGGACACCAA GGTCGTCGAG CGCGGGCTGG CCGACGGTCT GTACGTGAAC
ACCAGCGGCA TCGGGCTCGT CCCGGCCGAG GTGGACATCC GCCCCGAACG GGCGAGACCC
GGCGACCGGG TCATCGTCTC CGGTCCCGTC GGCGAGCACG GTGTCGCCGT GCTGAGCGTG
CGCGACGGGC TGGAGTTCGG CGGCGAGGTC CGCTCCGACA CGACGGCGCT GCACGGGCTG
GTCGCGGCGG TGCTGGCGGC CGCCCCGGGG GTCCACGCGC TGCGCGACCC GACCCGAGGT
GGCCTCGCGA CCGCGCTGTG CGAGATCGCC GCCGCGTCCG GGACGGGCAT CGAGTTCGCC
GAGCGCGCCG TGCCGGTGCC GCCCGCGGTC GAGGCGGCCT GCGGGTTCCT CGGCCTCGAC
CCGCTGCACG TGGCGAACGA GGGCAAGCTG GTCGCGTTCG TCGCCGACGC CGACGCCGAC
GCGGCGCTCG CGGCGATGCG GGCGCATCCG GCGGGGCGTG ACGCGGCCGT CATCGGCACG
GTCACCGCGG AGCATCCGGG CGTGGTTGTC GGGCGCACCG CGTTCGGGGG AACCCGCATC
GTCGACCGGC CGCTCGGCGA GCAGCTCCCC CGCATCTGCT GA
 
Protein sequence
MADPVDPTGW TCPLPLRDHH QVVLGHGGGG VLSSELIEHL FLPAFGTTDT ARAPADSAVL 
DVAGARLAFS TDSYVVRPLF FPGGSIGELA VHGTINDLAC AGAVPVALSA GFILEEGLEL
AVLGRVAQAM GRAAAAAGVR LATGDTKVVE RGLADGLYVN TSGIGLVPAE VDIRPERARP
GDRVIVSGPV GEHGVAVLSV RDGLEFGGEV RSDTTALHGL VAAVLAAAPG VHALRDPTRG
GLATALCEIA AASGTGIEFA ERAVPVPPAV EAACGFLGLD PLHVANEGKL VAFVADADAD
AALAAMRAHP AGRDAAVIGT VTAEHPGVVV GRTAFGGTRI VDRPLGEQLP RIC