Gene Francci3_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0543 
Symbol 
ID3904194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp629203 
End bp630534 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content71% 
IMG OID637877872 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_479656 
Protein GI86739256 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.315884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.338133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCA CCCCGGTCCT CACCCGGCGC TGGAACACGC CGGAGTCGTG GACGATCGAG 
ACCTACACCC GCCTCGACGG CTACACCGCG CTGCGCACCG CCTTCGCGAT GGCCCCGGAC
GACCTCATCA AGCTGGTGAA GGACTCCGGG CTGCGGGGGC GCGGCGGCGC CGGCTTCCCC
ACCGGGATGA AGTGGGGCTT CATCCCGCAG GGCGACGGCA GGCCGCACTA CCTCGTCATC
AACGCCGACG AGGGGGAGCC GGGCACCTGC AAGGACGCGC CGCTGATGAT GGCGGACCCG
CACTCGCTCA TCGAGGGGAT CATCATCGCG GCCTACGCGG TGCGGGCCGG CCGGGCGTTC
GTCTACCTGC GCGGCGAACT GATCCACGCG GCCCGTCGGC TGCAGGCCGC CGTGGCCGAG
GCCTACCGGG CCGGCTACCT CGGCCGGGAC ATCCTCGGCA GCGGCTTCGA CCTCGACCTC
GTCGTGCACT CCGGCGCGGG CGCCTACATC TGCGGCGAGG AGACAGCCCT GCTCGACTCC
CTGGAGGGCC GTCGCGGCCA GCCGCGGCTG CGCCCGCCGT TCCCCGCGAC GCACGGCCTG
TACGCCTCCC CGACGGTGGT CAACAACGTC GAGACGATCG CGTCGGTGCC GTACATCGTG
AACTACGGCG TGGACTGGTT CCGGTCAATG GGCCGGGAGC GCTCGCCGGG GCCGAAGATC
TACAGCCTGT CCGGGCACGT GACCCGGCCC GGCCAGTACG AGGCCCCGAT GGGCACCACG
CTGCGGGAGC TGCTCGACTT GGCCGGCGGC GTGCGGGGCG GTCACGGGCT CAAGGCGTGG
ACCCCGGGCG GGTCGTCCAC TCCGATGCTG ACCGCCGAGC ACCTCGACGC GCCGCTCGAC
TTCGAGGGCA TGCAGGAGGC CGGGTCGCTG CTTGGCACGG CCGCACTCAT GATCATGGAT
GACACGGTCG ACATGCTCAA GGTCGTCCGC CGGCTCACCC AGTTCTACGC GCACGAGTCG
TGCGGCAAGT GCACTCCGTG CCGCGAGGGC ACCACCTGGA TGGTGCAGAT CCTGTCCCGG
ATGGAACGGG GCCAAGGCGA CGCCGAGGAC GTGGACACCC TCGTCGACGC CTGCGACAAC
ATCTTCGGCC GGGCATTCTG CGCGCTGGCC GACGGGGCCA CCTCCCCGAT CGTCTCGGGG
ATCAAGTACT TCCGGGACGA GTTCATCCCG GTGAGTGCGG TGACCACGAA GCCGAACCCC
ACCCCGGGAG ACGCCACGGC CGGCGACGGC TCGCCCGCGT CCGCCCCGGG TGCCTACGCG
GGAGCGCACT GA
 
Protein sequence
MPVTPVLTRR WNTPESWTIE TYTRLDGYTA LRTAFAMAPD DLIKLVKDSG LRGRGGAGFP 
TGMKWGFIPQ GDGRPHYLVI NADEGEPGTC KDAPLMMADP HSLIEGIIIA AYAVRAGRAF
VYLRGELIHA ARRLQAAVAE AYRAGYLGRD ILGSGFDLDL VVHSGAGAYI CGEETALLDS
LEGRRGQPRL RPPFPATHGL YASPTVVNNV ETIASVPYIV NYGVDWFRSM GRERSPGPKI
YSLSGHVTRP GQYEAPMGTT LRELLDLAGG VRGGHGLKAW TPGGSSTPML TAEHLDAPLD
FEGMQEAGSL LGTAALMIMD DTVDMLKVVR RLTQFYAHES CGKCTPCREG TTWMVQILSR
MERGQGDAED VDTLVDACDN IFGRAFCALA DGATSPIVSG IKYFRDEFIP VSAVTTKPNP
TPGDATAGDG SPASAPGAYA GAH