Gene Francci3_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0551 
Symbol 
ID3904202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp640054 
End bp641589 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID637877880 
ProductNADH dehydrogenase subunit N 
Protein accessionYP_479664 
Protein GI86739264 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.170778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACAC CGCCGTCGAT CGAGTACTCC TCGCTCAGCC CGATCCTGAT CGTGTTCGGG 
GTCGCGCTCG TCGGGGTGCT CGTCGACGCC TTCGCCACGA AGCGGGCCCG GCGGACCTTC
CAACCGATCC TGGCGGGTGC GGGCTTCGTC GCCGCGCTCG TGGCCGTGGC GGTGCTGCAC
GGCCGGCAGG CCATCCTCGC CTCCGGTGCG CTGGCGATCG ACGCGCCGAC CTTGTTCATG
CAGGGCACGA TCCTGGTCTT CGCCCTGCTG TCGGTGCTGC TGGTGGCCGA ACGCCAGCTT
GACTCGTCCG GCGGGGCTAT TGTGGCCTCG GCCGCGATCA CCCCGGGCTC GAAGGGATCG
ACGGCGCAGC AGACCTCGGC AGACGTGCAG ACCGAGGCGT ATCCGCTGAT GGTCTTCTCG
GTCACCGGGA TGATGCTCTT CGTCGCCTCG AACAACCTGC TAGTGATGTT CGTGGCGCTG
GAGATCCTCT CGTTGCCGCT GTACCTGCTG GCCGGGCTCG CCCGGCGCCG TCGGCTGCTG
TCGCAGGAAG CGGCGATGAA GTACTTCCTG CTCGGGGCGT TCTCCTCGGC CTTCTTCCTC
TACGGCGTCG CGTTCGCCTA CGGATTTGCC GGCAGCGTGG AGCTCGGGGC GGTCGCGGAC
GCGGTCAGCA ACGCCGGTGC GAACGACACC TACCTCTATC TGTCGCTCGC GCTGCTGGCG
GTGGGGCTGT TCTTCAAGAT CGGCGCCGTG CCGTTCCACT CCTGGACGCC GGACGTCTAC
CAGGGCTCGC CGACGCCGGT TACCGCGTTC ATGGCGGCGG GGACGAAGGT CGCCGCGTTC
GGTGCCCTGT TGCGGGTCTT CTACGTCGCC TTCGGAGGGC TGCGCTGGGA CTGGCGACCA
ATCCTGTGGA CGATCGCCAT CCTCACCATG GTGGTCGGCG CGGTGCTCGC CCTGACCCAG
CGTGACATCA AGCGTATGCT GGCCTACTCG GCGATCGCGC ACGCCGGGTT CCTGCTGGTG
GGCCTCGCCG GCACCAACAC CGACGGCCTG CGCGGCTCGA TGTTCTACCT GGTGACCTAC
GGCTTCACGA CGATCGCCGC CTTCGCCGTG GTCTCCCTGG TCCGTACCGG CGACGGCGAG
GCCGGCGACC TGTCCCAGTG GCGGGGGCTC GGCAGGACCT CGCCCCTGCT GGCCGGGACG
TTCTCGTTCC TGCTGCTCGC GCTCGCGGGG ATCCCGCTGA CGAGCGGGTT CACCGGGAAG
TTCGCGGTGT TCCAGGCCGC GATCGCCGGG GACGCCACCC CGCTGGTAGT CGTTGCACTG
GTGTGCAGCG CCATCGCCGC CTTCTTCTAC GTGCGGGTCA TCGTGCTGAT GTTCTTCTCC
GAGCCGCTTG CCGAGGGACC GGTGGTGGTG ACCCGCCCGA CGCTGACCTT CGCCGCGGTC
GCCATCGGTA CCGTGGCTAC TCTTGTACTG GGAGTAGCGC CACAGCCACT CCTGGACCTC
GCGACGACCG CCGCGACGTC CGGCTTCGTA CGCTGA
 
Protein sequence
MITPPSIEYS SLSPILIVFG VALVGVLVDA FATKRARRTF QPILAGAGFV AALVAVAVLH 
GRQAILASGA LAIDAPTLFM QGTILVFALL SVLLVAERQL DSSGGAIVAS AAITPGSKGS
TAQQTSADVQ TEAYPLMVFS VTGMMLFVAS NNLLVMFVAL EILSLPLYLL AGLARRRRLL
SQEAAMKYFL LGAFSSAFFL YGVAFAYGFA GSVELGAVAD AVSNAGANDT YLYLSLALLA
VGLFFKIGAV PFHSWTPDVY QGSPTPVTAF MAAGTKVAAF GALLRVFYVA FGGLRWDWRP
ILWTIAILTM VVGAVLALTQ RDIKRMLAYS AIAHAGFLLV GLAGTNTDGL RGSMFYLVTY
GFTTIAAFAV VSLVRTGDGE AGDLSQWRGL GRTSPLLAGT FSFLLLALAG IPLTSGFTGK
FAVFQAAIAG DATPLVVVAL VCSAIAAFFY VRVIVLMFFS EPLAEGPVVV TRPTLTFAAV
AIGTVATLVL GVAPQPLLDL ATTAATSGFV R