Gene Franean1_6085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6085 
Symbol 
ID5674406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7407674 
End bp7408645 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content73% 
IMG OID641244937 
ProductNADH dehydrogenase subunit J 
Protein accessionYP_001510335 
Protein GI158317827 
COG category[C] Energy production and conversion 
COG ID[COG0839] NADH:ubiquinone oxidoreductase subunit 6 (chain J) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.302185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0550219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGC AGATTCTCGC CCAGGCGGCG GAGATCAGCA GCACGTCGAA CGGCGAGGCC 
TGGACGTTCT GGCTGCTCGC GCCGGTGGCG CTGCTGGCGG CGCTCGGCAT GGTCCTGATG
CGCAGCGCGG TGCACTCCGC CCTGCTGCTC GTGGTGAACC TGTTCTGCGT CGCGGTGTTC
TACCTGATCC AGGACGCGCC CTTCCTGGGC TTCGTCCAGA TCATCGTCTA CACCGGCGCG
ATCATGGTGT TGTTCCTGTT CGTGCTGATG CTGGTCGGAG TCGATTCGTC CGACTCGCTG
GTCGAGACCC TGCGCGGCCA GCGGATCGCC GCGGTGATCC TCGGGCTCGG CTTCGCCGGC
CTGCTGGCGT TCCCGATCGG CCGCGCGATC GACGGTGGCA AGGCGGCGGG CCTGGAGGCG
GCGAACACCG GCGGCAACGT CCACGCGATC GGACGTCTGC TGTTCACCCA GTACGTCTTC
GTCTTCGAAG CGATCTCGGT GCTGCTCGTG GTCGCGGCGA TCGGGACGAT GGTCCTCGGC
CACCGCGAGC ACACCGGTGA GAAGGTCACG CAGAAGGAAC GGATGCGCGC TCGCTTCGCC
GAGGGCGGCA CCGTCACCCC GCGGCCCGGG CCGAAGGTCT TCGCGACCAA CCCCGGCCCG
GAGCAGCCCG AGCTCGTCGC GGCCGGCGGT GGCGATGCCG CCGGCGGCGG CCCGGACCTC
GGCGGGCCCG GTGCGGGCCC GTCCGGCCCG GCCGGCGCGG GTGGTCCGGA CGACGGCGGT
ACCGGTGGTC CGGGCGGTCC TGGGACAGGC GGTCCCGGTG CGGGCGGTGC TGGTGGTCCG
GGTACGGACG GCCCGGGCGG CGGTGGGCCC GACGGCGGGC CGAGCGCGAA CGGGACGAGT
GAGGACTCCG ACGACGAGAC CGTCGTCGGC GGGTCGGCGC CGGTGGGCGC GGGCAGGGGG
AGCGGTCGGT GA
 
Protein sequence
MNPQILAQAA EISSTSNGEA WTFWLLAPVA LLAALGMVLM RSAVHSALLL VVNLFCVAVF 
YLIQDAPFLG FVQIIVYTGA IMVLFLFVLM LVGVDSSDSL VETLRGQRIA AVILGLGFAG
LLAFPIGRAI DGGKAAGLEA ANTGGNVHAI GRLLFTQYVF VFEAISVLLV VAAIGTMVLG
HREHTGEKVT QKERMRARFA EGGTVTPRPG PKVFATNPGP EQPELVAAGG GDAAGGGPDL
GGPGAGPSGP AGAGGPDDGG TGGPGGPGTG GPGAGGAGGP GTDGPGGGGP DGGPSANGTS
EDSDDETVVG GSAPVGAGRG SGR