Gene Franean1_2493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2493 
Symbol 
ID5670889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2970877 
End bp2971932 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID641241410 
ProductNADH ubiquinone oxidoreductase 20 kDa subunit 
Protein accessionYP_001506831 
Protein GI158314323 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.248726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACAG TCGAGGATCC GATCCACATT CTCTGGATCA ACGCCGGACT CAGCTGCGAC 
GGGGACTCGG TCGCGCTCAC CGCAGCCACC CAGCCCAGCA TCGAGGACAT CGTTCTGGGC
ACCCTGCCGG GCCTGCCGAA GGTCAGCGTC CACTGGCCGC TCATCGATTT CGAGTCCGGG
CCGGAGCAGG GCGCCGACAC CTTCATCGAC TGGTTCCGCA GGGCCGACCA CGGCGAGCTC
GATCCGTTCG TCCTGGTCGT CGAGGGGTCC ATTCCGAACG AGGACCTGAT CACGAACGGC
GGCTACTGGA GCGGTTTCGG GAACGACCCG GTGACCCGCC AGCCGGTGAC GACCAGCACC
TGGCTCGACC GGCTCGCGCC GAAGGCGCTG GCGATCCTCG CCGCGGGAAC GTGCGCCACC
TACGGCGGCA TCCACGCGAT GGCCGGGAAT CCGACCGGGG CGATGGGGGT GCCCGACTAT
CTCGGCTGGG ACTGGAAGTC GAAGGCGCAG ATCCCGATCG TGTGCGTTCC GGGCTGCCCC
GTCCAGCCGG ACAACCTGTC GGAGACGATC ACCTACCTGC TCTACCAGGC GAGCGGGCAG
GCGCCGATGA TCCCCCTCGA CGACCAGCTC CGCCCGCGCT GGCTGTTCGG CGCCACCGTG
CACCAGGGCT GCGACCGGGC CGGGTACTAC GAGGAAGGTC AGTTCACCAC GGAGTACGGC
ACCCCGCAGT GCCTGGTGAA GATCGGCTGC TGGGGCCCGG TGGTGAAGTG CAACGTCCCC
AAGCGCGGCT GGATCAACGG GGTGGGCGGC TGCCCGAACG TCGGCGGGAT CTGCATCGCA
TGCACGATGC CGGGCTTCCC GGACCGTTTC ATGCCGTTCA TGGACGAGCC GCCCGGCGCC
CACATCTCGA CCACCGCGAG CGGCCTGTAC GGCGCGGTCA TCCGCAGACT GCGCGCCATC
ACGATGCGTA AGGCGGACGT CGAACCGCGC TGGCGGCGCC GGGACGTCCA CGCAGGCCAG
GACGACGATC GCGCGCGGGA GAAGGTGCTG ACATGA
 
Protein sequence
MSTVEDPIHI LWINAGLSCD GDSVALTAAT QPSIEDIVLG TLPGLPKVSV HWPLIDFESG 
PEQGADTFID WFRRADHGEL DPFVLVVEGS IPNEDLITNG GYWSGFGNDP VTRQPVTTST
WLDRLAPKAL AILAAGTCAT YGGIHAMAGN PTGAMGVPDY LGWDWKSKAQ IPIVCVPGCP
VQPDNLSETI TYLLYQASGQ APMIPLDDQL RPRWLFGATV HQGCDRAGYY EEGQFTTEYG
TPQCLVKIGC WGPVVKCNVP KRGWINGVGG CPNVGGICIA CTMPGFPDRF MPFMDEPPGA
HISTTASGLY GAVIRRLRAI TMRKADVEPR WRRRDVHAGQ DDDRAREKVL T