Gene Francci3_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4496 
Symbol 
ID3907472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5367977 
End bp5369269 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content68% 
IMG OID637881828 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_483571 
Protein GI86743171 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.401222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGA CTAGGACCAT CGCACCTCCC CCACTCACCA GGGTCGAGGG CGAAGGAAGG 
CTGCTAATCA AGATCACCGA CGGGCGGGTG GACGAGGCCC ACCTGAAGAT CTTCGAACCG
CCGCGCTTCT TTGAGGCGTT CCTCCGCGGC CGGGCCTACA CCGAGCCACC CGACATCACC
GCCCGGATCT GCGGCATCTG CCCGGTCGCC TACCAGATGA GCGCGCTGGC GGCCATCGAA
CAAATCTGCG ACGTCACGGT CACCGGCCCA CCCGCCGCCC TACGGCGGCT CATCTACTGC
GGTGAATGGA TCGAGAGTCA CGCACTACAC GTCTTCCTGC TGCACCTACC GGACTTCCTC
GGCTACGACA GCGCACTGCA TCTTGCCCAG GACCAGCCCG CCCTGGTCAA GCTGGGACTC
ACGCTGAAGA AGGCCGGCAA CACGCTCATG ACGGTCATCG GCGGCCGCGC GATCCACCCC
GTCAACGCGC GGGTCGGCGG CTGGTACCGG GCACCACGCC GGCGTGACCT GACCGAGCTT
GTCGGGCAAC TGGAACAGGC GCGAGACATC GCCCGGGACA CGGCCCGATT CACCGCCGCC
CTGGACTTCC CCGAAGACGA ACTCCACCAG ACCTTCGTCG CGCTCCACCA ACCTGGCGAA
TACCCCGTTG AGCGCGGCCG GATCGCCTCC ACCGCTGGCC TCGACATCGC CCCGGCCGAC
TACGACCGGC ACTTCACCGA AGAACAGGTG CCCTGGTCGA ACGCCCTGCA CTCGACCCTG
GCCGCGGGCG GCTCCTACCT CACCGGACCG CTGGCTCGCT TCGCGCTGGG CGCGGAGCGG
CTGGCGCCCG CCGCCCGCGA GACCGCCGCC GAGATCGGCC TGCGCCCACC GGAGCGCAAC
CCCTACCGCA GCATCATCGT GCGCTGCATC GAGATGGTCC ACGCCGCCGA CGAGGCGCTG
CGGATCATCG CGGACTACAC CGAGCCCGAC CCCTCCGCGC TGGAGGCCCC GCCCCGGGCG
GGAACCGGAT ACGGGGTCAC GGAGGCACCC CGCGGCCTGC TCTACCACCG CTACACGATC
GACCACAACG GCACCATCCT CGACGCAAAG ATCGTGCCAC CAACCGCCCA GAACCAACGT
CCGATCGAAG AAGACCTGCG CGGTGTGGTG GAACGCTTCA TGAACCTGTC GGAGCCCGAA
CTCGCCCTGC GCTGCGAACG GGCCATCCGC AACTACGACC CCTGCATCTC ATGTGCGACC
CACTTCCTGA CTCTCCACAT CGAACACGGC TGA
 
Protein sequence
MRATRTIAPP PLTRVEGEGR LLIKITDGRV DEAHLKIFEP PRFFEAFLRG RAYTEPPDIT 
ARICGICPVA YQMSALAAIE QICDVTVTGP PAALRRLIYC GEWIESHALH VFLLHLPDFL
GYDSALHLAQ DQPALVKLGL TLKKAGNTLM TVIGGRAIHP VNARVGGWYR APRRRDLTEL
VGQLEQARDI ARDTARFTAA LDFPEDELHQ TFVALHQPGE YPVERGRIAS TAGLDIAPAD
YDRHFTEEQV PWSNALHSTL AAGGSYLTGP LARFALGAER LAPAARETAA EIGLRPPERN
PYRSIIVRCI EMVHAADEAL RIIADYTEPD PSALEAPPRA GTGYGVTEAP RGLLYHRYTI
DHNGTILDAK IVPPTAQNQR PIEEDLRGVV ERFMNLSEPE LALRCERAIR NYDPCISCAT
HFLTLHIEHG