Gene Francci3_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0544 
Symbol 
ID3904195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp630540 
End bp633047 
Gene Length2508 bp 
Protein Length835 aa 
Translation table11 
GC content75% 
IMG OID637877873 
ProductNADH dehydrogenase subunit G 
Protein accessionYP_479657 
Protein GI86739257 
COG category[C] Energy production and conversion 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) 
TIGRFAM ID[TIGR01973] NADH-quinone oxidoreductase, chain G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.39762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG CCCCATCCAG GCCTCGCGCC TCCGAGCCGG CCCAGCCGGC CGCACCCGAC 
CTCATCACCC TCACCATCGA CGGGCTGTCG GTGAGCGTGC CCAAGGGCAC CCTCATCATC
CGCGCGGCCG AGCTGCTCGG CATCGAGATC CCGCGGTTCT GCGACCATCC GCTCCTCGAC
CCCGTCGGAG CCTGCCGGCA GTGCATCGTC GAGGTGGAGG GCCAGCGCAA GCCGATCGCC
TCCTGCACCA CGACGGTGGC CGCCGACATG GTCGTGAAGA CCCAGCTGAC CTCGCCGGTC
GCCCGCAAGG CCCAGGCCGG CACGCTTGAG TTCCTGCTCC TCAACCATCC GCTCGACTGC
CCGATCTGCG ACAAGGGCGG CGAGTGCCCG CTGCAGAACC AGTCGATGGC GAACGGCGGC
GCGGTCTCCC GGTTCAAGGA GACCAAGCGG GTGTACCCGA AGCCGCTGGC GATCTCCACC
GAGATCCTGC TGGACCGGGA GCGCTGCGTG CTGTGTGCCC GCTGCACCCG GTTCTCCGCC
CAGATCGCCG GGGACCCGTT CATCGAGCTG TTCGAGCGGG GCGCCGCCGA GCAGGTCGCG
GTGAGCGACG GGCAGCCGTT CTCCTCCTAC TTCTCCGGCA ATACCGTGCA GATCTGCCCA
GTGGGGGCGT TGACCAGCGC CGCCTACCGC TTCCGGGCCC GGCCCTTCGA CCTGGTCTCC
ACGCCCACGG CGTGCGAGCA CTGCGCGTCG GGCTGCTCCC TGCGCACCGA CCACCGCCGG
GGCAGGGTGA CCCGTCGCCT CGCCGGCGAC GACCCGGCGG TGAACGAGGA GTGGAACTGC
GACAAGGGCC GGTTCGCCTT CACCTACGCG CGCGCGGCCG ATCGGCTCAC CACCCCGCTG
ATCCGCGACG ACGACACCGG GCAGCTCGTC CCGGTCTCCT GGAGCGAGGC GCTCAAGTAC
GCCGCGCGGG GGCTGGCCGA GTGCCGGGAC CGGCGCGGGG TCGGGGTGCT CACCGGCGGG
CGGCTGACCC GCGAGGACGC CTACGCCTAC GCGAAGTTCA CCCGGGTGGC GCTGGCCAGC
AACGACGTGG ACTTCCGGGC CCGGCCGCAC TCCGCCGAGG AGGAGCAGTT CCTCGGGTAT
GCCGTCGCCG GTACCGGCAT CGGCGTCACC TACGCCGATC TCGAGGCCGC CCCGGCCGTG
CTCCTGGTGG CCTTCGAACC GGAGGAGGAG TCGCCCATCG TCTTCCTGCG GCTGCGTAAG
GCCGTGGACA AGCACGCCGC GGCCGTGCAC GCGCTCGCCC CCCTGGCGAG CCGGGGGCTG
ACCAAGCTCG CCGGCACCCT CGTGCCGACG CGGCCGGGCG AGGAGGCGGC CGTCCTCGAC
GCGCTCGCCG CCCCCGACCG GGGCGGGCCG GAAGCACCGA CCACGCGGGC CCTGCGGGCG
CCGGGGGCGG TGATCCTGGT GGGCGAGCGG GCCGCGGAGT TCCCCGGGGC CCTGTCCGCG
GCCGTCCGGC TGGCCGAGGC GACGGGTGCC TCGCTCGCCT GGGTTCCGCG GCGGGCCGGT
GACCGCGGCG CGGTCTCGGC CGGACTGCTT CCCTCGCTGC TGCCCGGCGG TCGCCCGGTC
ACGGATGCCG CCGGGCGCGC CGAGGTCGAG GAGGTCTGGG GGGGTCCGCT GCCCGGCGCG
CCCGGCCGCG ACACCGACGG CATGCTGGCT GCCGCCGCGG CCGGGCGGCT CGACGGGATG
ATCGTCGCCG GCGTTGACGC GGAGGACCTG CCGAATCCGG CGAGTGCGCT CGCGACGCTC
GCCCGGATGC CGTTCTGCGT CTCGATCGAG CTGCGCCGGT CGTCGATCGC CGAGGTCGCG
GACGTGGTTC TGCCCATCGC CCCGGTGGCG GAGAAGGCCG GCTCGTTCGT CGACTGGGAG
GGCCGGCTGC GACCGTTCCA GCGGGCCCTG GACACCCCGG CCCTGCCAGA CGTCCGGGTG
TTGCACCTGC TTGCCGCCGA GATGGGCCTC GACCTCGGCC TGCCCGACGC CGGCGCGGCC
GCCCGCGAGC TGCGCGCCCT GGGTCGGGCC GGCGACGGCG TGAGCCGGGT GCCCGCACCG
TCGGAGCCGA TCGCCGAGCC GCCCGTCGCG GGGGTGGGTG AGGCGGTGCT CGCGACCTGG
CACCAGCTCG TCGACGACGG CGCTCTGCAG GCCGACGAGC CCTACCTGGC GGGCACCGCG
CGCCCGGCGG TGGCCCGCCT GTCCGCCGCG ACCGCCGCCG AGATCGGCGC CGTGGCCGGG
AGACGGGTGA CCATCACGGC CTCCCGTGGC TCGATCACCC TGCCCGTCGA GGTCACGGCG
ATGCCCGACC GGGTGGTCTG GGTGCCGACC CACTCGCCGG GCTCGCACGT GCGCCGGGCG
CTCGCCGGGG ACGCCGGGGT CCTCGTCCGG GTGGGCCCCG CCGAGGACGA TCCCCCTGCC
GAGGACGGCA CCCCCGCCGG GGACGACACT CCCGGAGGCC GCGCATGA
 
Protein sequence
MTVAPSRPRA SEPAQPAAPD LITLTIDGLS VSVPKGTLII RAAELLGIEI PRFCDHPLLD 
PVGACRQCIV EVEGQRKPIA SCTTTVAADM VVKTQLTSPV ARKAQAGTLE FLLLNHPLDC
PICDKGGECP LQNQSMANGG AVSRFKETKR VYPKPLAIST EILLDRERCV LCARCTRFSA
QIAGDPFIEL FERGAAEQVA VSDGQPFSSY FSGNTVQICP VGALTSAAYR FRARPFDLVS
TPTACEHCAS GCSLRTDHRR GRVTRRLAGD DPAVNEEWNC DKGRFAFTYA RAADRLTTPL
IRDDDTGQLV PVSWSEALKY AARGLAECRD RRGVGVLTGG RLTREDAYAY AKFTRVALAS
NDVDFRARPH SAEEEQFLGY AVAGTGIGVT YADLEAAPAV LLVAFEPEEE SPIVFLRLRK
AVDKHAAAVH ALAPLASRGL TKLAGTLVPT RPGEEAAVLD ALAAPDRGGP EAPTTRALRA
PGAVILVGER AAEFPGALSA AVRLAEATGA SLAWVPRRAG DRGAVSAGLL PSLLPGGRPV
TDAAGRAEVE EVWGGPLPGA PGRDTDGMLA AAAAGRLDGM IVAGVDAEDL PNPASALATL
ARMPFCVSIE LRRSSIAEVA DVVLPIAPVA EKAGSFVDWE GRLRPFQRAL DTPALPDVRV
LHLLAAEMGL DLGLPDAGAA ARELRALGRA GDGVSRVPAP SEPIAEPPVA GVGEAVLATW
HQLVDDGALQ ADEPYLAGTA RPAVARLSAA TAAEIGAVAG RRVTITASRG SITLPVEVTA
MPDRVVWVPT HSPGSHVRRA LAGDAGVLVR VGPAEDDPPA EDGTPAGDDT PGGRA