Gene Franean1_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3404 
Symbol 
ID5671775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4035225 
End bp4036946 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content70% 
IMG OID641242292 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_001507712 
Protein GI158315204 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.37903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGC TTGTCATCGA CCCGGTGTCG CGGATCGAGG GCCACCTGCG GGTCGAGGTG 
CAGGTGGACG GCGGTCAGGT CACCGAGGCC TACGCGTCCT CGACGATGTG GCGCGGCATC
GAGACGATCC TGACCGGCCG GGACCCCCGG GACGCCTGGC TGTTCGCTCA GCGGATCTGC
GGTGTGTGCA CCTCGGTGCA CGCGCTGGCC TCGGTGCGGG CGGTGGAGGA CGCGGTCGGC
GCCGTCCCGC CGCTGAACGC CCGGCTGCTG CGGGACCTGA TCGCCACGTC ACTGTGTGTG
CACGACCATG TCGTCCACTT CTACCACCTG CAGGCCCTGG ACTGGGTCGA TCCGACCGCG
GCGCTCGCGG CCGACCCGGT GCGTGCCGCG GCGCTGGCGC AGTCCATCTC CGACTATCCG
CGGTCCACGG CCGGTCTGTT CGCCTCGGTG CAGACCCGGC TCAAGGTCTT CCTGGAAAGC
GGGAAGATCG GGCCGTTCAC CAACGGCTAC TGGGGGCATC CGGCCTACCG GCTGAGCCCG
GAGCTGAACC TGGTGGCGTT CAGCCACTAT CTCGACGCGC TGGACTTCCA GCGGGACTAC
ATCCGGGTGC ACGCCCTGCT CGGCGGAAAG AACCCGCACC CGCAGACCTA CGTGGTCGGC
GGAATGGCGT CCCCGATCGA CCTCAACAGC CAGGACGCCA TCAACGCGAA CACATTGCAG
GAGGTGACGC AGATCCTGCA GCGGGGCCTC GAATTCGTCG AGCAGGTCTT CCTGCCGGAC
CTGCAGGCGA TCGGCGCCGC CTATCCCGAG TGGACGACCT ACGGTCGGGG CCTGGGCTCC
TACATGGTGT TCGGTGACTA TTCACTGTCG CCGCCGGGCA TCGCCCGGCC ACCCCGGGAC
GGCCTTTTCC CCGGCGGGAT CATCCGGAAC GGGAACCTCG CGGTGAAACC GGAACCCTTC
GACCCGTCCG GTATCGCCGA GTCCGTCGCC CATTCGTGGT TCCGCTACGA CGATCCCGGC
GCCGCGCTGC CCCCCTGGAA GGGTGAGACG ACCCCGCAGT ACACCGGACC TGAACCCCCT
TTCGAGCAGC TCGACGTCGC CGGGAAGTAC ACCTGGCTGA AGGCGCCGCG CTACCAGGGA
GCGGCCATGG AGGTCGGCCC GGTGGCCCGC ATGCTCGTCG GCTACACCTC CGGGGACGCC
CGTATCCGCC CGCTCGTGCA GGGCGCGCTC GACGCCCTGC GACTGCCGCC CGAGGCACTG
ATGTCCACCC TCGGCCGGGT GGTGGCCCGC GGCCTGGAGA CCAGGCTGAT GGCGCAGTAC
TCCCTGGAGC TCGTCAACCG GCTGCGTGAC AACGTCGCCG CCGGCGACCT GGCCGTCGCC
GACACCGGGC GATGGCGTCC GACATCCTGG CCGGAGGGCG CCCTGCTCGG CGTCGGCTTC
CACGAGGCGC CCCGCGGGTC GCTGTCGCAC TGGGTCGTCA TCGAGGACGG CCGGATCCGC
AACTACCAGG CCGTGGTGCC CACCACCTGG AACGCCAGCC CCCGCGACGC GGAGGGGAAT
CCCGGCGCCT ACGAGGCCGC GCTAGTCGGC ACACCCGTCG CCGATCCCCA GCGGCCGCTG
GAGATCCTGC GCACCCTGCA CTCCTTCGAC CCCTGCATGG CCTGCGCGGC GCACATCTAC
GACGTCGAGG GCCGCGACAT CATCGAGGTG CGGGTCCAGT GA
 
Protein sequence
MSRLVIDPVS RIEGHLRVEV QVDGGQVTEA YASSTMWRGI ETILTGRDPR DAWLFAQRIC 
GVCTSVHALA SVRAVEDAVG AVPPLNARLL RDLIATSLCV HDHVVHFYHL QALDWVDPTA
ALAADPVRAA ALAQSISDYP RSTAGLFASV QTRLKVFLES GKIGPFTNGY WGHPAYRLSP
ELNLVAFSHY LDALDFQRDY IRVHALLGGK NPHPQTYVVG GMASPIDLNS QDAINANTLQ
EVTQILQRGL EFVEQVFLPD LQAIGAAYPE WTTYGRGLGS YMVFGDYSLS PPGIARPPRD
GLFPGGIIRN GNLAVKPEPF DPSGIAESVA HSWFRYDDPG AALPPWKGET TPQYTGPEPP
FEQLDVAGKY TWLKAPRYQG AAMEVGPVAR MLVGYTSGDA RIRPLVQGAL DALRLPPEAL
MSTLGRVVAR GLETRLMAQY SLELVNRLRD NVAAGDLAVA DTGRWRPTSW PEGALLGVGF
HEAPRGSLSH WVVIEDGRIR NYQAVVPTTW NASPRDAEGN PGAYEAALVG TPVADPQRPL
EILRTLHSFD PCMACAAHIY DVEGRDIIEV RVQ