Gene Francci3_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0550 
Symbol 
ID3904201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp638324 
End bp640012 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content69% 
IMG OID637877879 
ProductNADH dehydrogenase subunit M 
Protein accessionYP_479663 
Protein GI86739263 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.291589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACAG TCCCCTGGTT GACGATCATG TTGATCGTCC CGGCCGCGGG TGCGGTGGTC 
GTCGCCGCTC TGCCCCGCCG GCTGTCGACC CTCGCCAAGC AGCTCACCCT CGGGCTCTCG
CTGGCGGTTC TGGTGCTCGC GGTCCTTGCG ACCGCGGCCT ACAACCCGGA CAAGGCCGGC
TTCCAGTTCG CCCAGTCCTA CGACTGGATC AAAACCTTCG GCATCTCCTA CTCGGTGGGG
GCCGACGGCA TCTCGCTGGT GCTGATCCTG CTTGCCGCGC TGCTGGTGCC GGTCGTGGTC
CTGGCGTCCT GGGACGAGGC AGGCGCGGAT GGCGGGACGA CCGGTGCGAC GGATCCGACC
GGCGCGGCCG CCGTCGGCGC GGCCGCCGTC GGGGTGGACG GCGCCGGGGT GGACGGCGCC
GGGGTGGACG GCGCCGGGGT GGACGGCGCC GGGACGCGGA GCAGGCGGTC GGTCCCGGCG
TTCTTCGCGC TGCTGCTGGC GCTGGAGGCC GGGATGATCG GCGTGTTCGC CGCTACCGAC
GTCTTCCTGT TCTACGTCTT CTTCGAGGCG ATGCTCATCC CGATGTACTT TCTCATCGGG
AGCTACGGCC CGGTCCGGGA GCAGGCCCAG CGCTCCTACG CGGCGGTCAA GTTCCTGCTC
TACAGCCTCT TTGGCGGCCT GCTGATGCTC GCTGCCGTGA TCGGACTGTA CGTCGTCTCC
GCCGACAACC TCGGCAGCGG AACCTTCGAC TTCGCCACCC TGCGGCAGAT GGACATCACC
CCCGGGGTGC AGAAGCTGCT GTTCCTCGGT TTCTTCCTGG CGTTCGCCAT CAAGGCCCCC
CTGTTCCCGT TCCACACCTG GCTGCCCGAC GCCGGCGCGC AGTCGCCCAC CGGCGGCGCG
GTGCTGCTGG TCGGGGTGCT GGACAAGGTG GGCACGTTCG GACTGATCCG GTACTGCATC
CCGCTGTTTC CCGACGCGGC CGACTACTTC GCCCCGCTGG TGCTTGGTCT GGCGGTGATC
GGCATCTTCT ACGGCGCCCT GCTCGCCATC GGGCAGCGGG ACATGAAACG GCTGGTCGCC
TACACCTCGC TGGCCCACTT CGGCTTCATC GCGCTGGGCA CCTTCGCCTT CACCTCCCAG
GCGGGCAGCG GCGCGGTGCT TTACATGGTC AACCACGGCC TGTCCACCGG CCTGCTTTTC
ATGGTCGTGG GCTTCCTGGT GGCGCGCCGC GGCACTCGTG ACGTCGGTGC TTACGGCGGC
CTGGCCAGGG TGACGCCGGT GCTTGCCGGG GTGTTCCTCG TCGCCGGACT GTCGTCGTTG
GCGTTGCCTG GAACGAACAG CTTCGTCAGC GAGTTCCTGG TGCTGGTGGG GACGTTCACC
CGGAACAGGC CGCTGGCGAT CGTCGCGACC ACCGGCATCG TGCTGGCCGC GATCTACATC
CTGTACCTCT ACCAGCGGAC GATGACCGGA CCGGTGGTGC ACGAGGAGAA CAAGGTCCTG
GTCGACCTCA GCCTGCGCGA GAAGCTCGTC GTCGCCCCGA TGGTCGCGCT CATCGTCGCG
CTCGGGGTCT ACCCCAAGCC GCTGCTCGAC ATCATCACGC CGACGGTGAC GGCGACCTAC
GCCGATATCG GCAAGTCTGA CCCGGCTCCG ACGCACTCGG TGGCCGCGGA GTCCGGAGGC
CACTCGTGA
 
Protein sequence
MHTVPWLTIM LIVPAAGAVV VAALPRRLST LAKQLTLGLS LAVLVLAVLA TAAYNPDKAG 
FQFAQSYDWI KTFGISYSVG ADGISLVLIL LAALLVPVVV LASWDEAGAD GGTTGATDPT
GAAAVGAAAV GVDGAGVDGA GVDGAGVDGA GTRSRRSVPA FFALLLALEA GMIGVFAATD
VFLFYVFFEA MLIPMYFLIG SYGPVREQAQ RSYAAVKFLL YSLFGGLLML AAVIGLYVVS
ADNLGSGTFD FATLRQMDIT PGVQKLLFLG FFLAFAIKAP LFPFHTWLPD AGAQSPTGGA
VLLVGVLDKV GTFGLIRYCI PLFPDAADYF APLVLGLAVI GIFYGALLAI GQRDMKRLVA
YTSLAHFGFI ALGTFAFTSQ AGSGAVLYMV NHGLSTGLLF MVVGFLVARR GTRDVGAYGG
LARVTPVLAG VFLVAGLSSL ALPGTNSFVS EFLVLVGTFT RNRPLAIVAT TGIVLAAIYI
LYLYQRTMTG PVVHEENKVL VDLSLREKLV VAPMVALIVA LGVYPKPLLD IITPTVTATY
ADIGKSDPAP THSVAAESGG HS