Gene Franean1_6089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6089 
Symbol 
ID5674410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7413262 
End bp7414605 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID641244941 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_001510339 
Protein GI158317831 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0248618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTCA CCCCGGTCCT CACCCGGCGC TGGACGGCGC CCGAGTCGTG GACGCTGGCG 
ACCTACGAGC GCCTTGACGG CTACCAGGGC CTGCGCCGGG CGCTGGCGCA GAGCCCGGAC
GACCTGATCA AGCTGGTCAA GGACTCCGGC CTGCGCGGTC GCGGCGGCGC GGGCTTCCCC
ACCGGTATGA AGTGGGGCTT CATCCCGCAG GGCGACGGCA AACCGCACTA CCTCGTCATC
AACGCCGACG AGGGCGAGCC GGGCACCTGC AAGGACGCCC CGCTGATGAA GGCCGACCCG
CACTCGCTGA TCGAGGGCAT CGTGATCGCC GCCTACGCGG TGCGCGCGAA CCGGGCCTTC
ATCTACCTGC GCGGCGAGCT GATCCACGCC GGCCGGCGGC TGCGCGCCGC CGTCGCCGAG
GCGTACCGCG CCGGCTACCT GGGACGCGAC ATCCTCGGTA GCGGGTTCGA CCTCGACCTG
GTGGTGCACT CCGGCGCCGG CGCGTACATC TGCGGCGAGG AGACGGCGCT GCTGGACTCG
CTGGAGGGCC GGCGCGGCCA GCCGCGGCTG CGCCCGCCGT TCCCGGCGAC CCACGGCCTG
TACGCGTCCC CCACGGTCGT GAACAACGTC GAGACGATCG CCACCGTTCC CTTCATCGTG
AACTACGGCG TCGACTGGTT CCGGTCGATG GGCCGCGAGC GCGCCCCGGG CCCGAAGATC
TACAGCCTCT CCGGCCACGT GACCCACCCC GGCCAGTACG AGGCGCCGAT GGGCACGACG
CTGCGCGAGC TGCTCGACAT GGCGGGCGGC GTCCTCGGCG GCCGCAAGCT CAAGGCGTGG
ACCCCGGGCG GCTCGTCGAC GCCGCTGCTG ACCGCCGACC ACCTTGACGT CCCGCTGGAC
TTCGAGGGCG TGCAGGAGGC CGGCTCGCTG CTCGGCACGG CCGCCCTCAT GATCATGGAC
GACTCGGTCG ACATGCTCAA GATCGTGCGG CGGCTGACCC AGTTCTACGC GCACGAGTCG
TGCGGCAAGT GCACCCCGTG CCGGGAGGGC ACCACCTGGA TGGTGCAGAT CCTGTCCCGG
ATGGAGCGCG GCCAGGGCGA CCCCGACGAC GTCGACACCC TCGTCGACGC CTGCGACAAC
ATCTTCGGAC GCGCCTTCTG CGCGCTCGCG GACGGCGCCA CCTCGCCGAT CGTCTCCGGG
ATCAAGTTCT TCCGGAACGA GTTCCTCCCG ATCACCCCGG TGGGGCCGTC GGGTTCCACC
ACGTCGGTAG CCGGCTCGGC GAACGGCGCG GCCGCGGGTG GCGCGGCCGC GGGCACGCCG
GGCGCCTACG CGGGAGCGCA CTGA
 
Protein sequence
MPVTPVLTRR WTAPESWTLA TYERLDGYQG LRRALAQSPD DLIKLVKDSG LRGRGGAGFP 
TGMKWGFIPQ GDGKPHYLVI NADEGEPGTC KDAPLMKADP HSLIEGIVIA AYAVRANRAF
IYLRGELIHA GRRLRAAVAE AYRAGYLGRD ILGSGFDLDL VVHSGAGAYI CGEETALLDS
LEGRRGQPRL RPPFPATHGL YASPTVVNNV ETIATVPFIV NYGVDWFRSM GRERAPGPKI
YSLSGHVTHP GQYEAPMGTT LRELLDMAGG VLGGRKLKAW TPGGSSTPLL TADHLDVPLD
FEGVQEAGSL LGTAALMIMD DSVDMLKIVR RLTQFYAHES CGKCTPCREG TTWMVQILSR
MERGQGDPDD VDTLVDACDN IFGRAFCALA DGATSPIVSG IKFFRNEFLP ITPVGPSGST
TSVAGSANGA AAGGAAAGTP GAYAGAH