Gene Franean1_6081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6081 
Symbol 
ID5674402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7402070 
End bp7403647 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content71% 
IMG OID641244933 
ProductNADH dehydrogenase subunit N 
Protein accessionYP_001510331 
Protein GI158317823 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG CAACGACTGT GCTGGCGCAG GGCGCCGCGC AGAAGATCAC ACCGCCGTCG 
ATCGAGTACT CGTCCCTCAG CCCGATGCTC ATCCTGTTCG GCGTCGCGCT GGCCGGGGTC
CTCGTCGACG CCTTCGCCCC GCCGAAGGCC CGGCGGGTGC TCCAGCCGCT GCTGGCCGGG
GCGGGGTTCA TCGGCGCGTT CGTCGCGGTG GTGCTCCTGC ACGCCCACCG CCAGGTGCTC
GCCGCCGGCG CGGTGGCCAT CGACGGGCCG ACGCTGTTCC TGCAGGGCAC CATCCTGGTG
TTCGCGCTGC TGTCGGTGCT GCTGGTCGCC GAGCGCCGGC TGGACTCCTC CGGCGGCGCG
CTCGTCGCCT CCGCCGCGGT CGTCCCCGGC TCGCGCGGCT CCACCGCGCA GCGCACCTCG
CCGGACGTGC AGACCGAGGC GTATCCGCTG ATGGTCTTCT CGGTCAGCGG CATGCTGCTC
TTCGTCGCCT CGAACAACCT GCTGGTCATG TTCGTGGCGC TGGAGATCCT CTCGCTGCCG
CTGTACCTGC TGTGCGGGCT GGCGCGGCGC CGGCGGCTGC TGTCGCAGGA AGCCGCGATG
AAGTACTTCC TGCTCGGGGC GTTCTCGTCC GCGTTCTTCC TGTACGGCGT CGCGTTCGCC
TACGGCTACG CTGGCAGCGT CGAGCTCGGC CGGGTCGCCG ACGCGGTCGG CACGGTCGGC
CAGAACGACA CCTACCTCTA CCTGTCGCTC GCGCTGCTCG GCGTCGGCCT GTTCTTCAAG
ATCGGTGCCG CCCCGTTCCA CTCCTGGACG CCGGACGTCT ACCAGGGCGC GCCGACACCG
ATCACCGCCT TCATGGCGGC CGGGACGAAG GTCGCCGCTT TCGGGGCGCT GCTGCGTGTC
TTCTACGTGG CCTTCGGCGG CATGCGCTGG GACTGGCGGC CGGTGATCTG GGCTGTGGCG
ATCCTCACCA TGGTCGTCGG CGCGGTGCTC GCGCTCACCC AGCGTGACAT CAAGCGGATG
CTCGCGTACT CCGCGGTCGC CCACGCCGGC TTCCTGCTCG TCGGGATGGC CGGCTCGAAC
ATCGACGGCC TGCGCGGCGC CATGTTCTAC CTGGTGACCT ACGGCTTCAC CACGATCGCG
GCGTTCGCGG TGGTCTCCCT GGTCCGCACC GGCGACGGCG AGGCCAGCGA CCTCTCCCAG
TGGCAGGGCC TGGGCCGGAC GTCGCCGCTG CTGGCAGGCA CGTTCGCCTT CCTACTGCTC
GCGCTCGCCG GCATCCCGCT GACCAGCGGG TTCACCGGGA AGTTCGCGGT CTTCCAGGCG
GCGATCGCCG GTGACGCCAC CCCGCTGGTG GTCGTGGCCC TGGTGTGCAG CGCGATCGCG
GCGTTCTTCT ACGTCCGGGT CATCGTGCTG ATGTTCTTCT CCGAGCCGCT CGCCGACGGG
CCGGTCGTTG TCACCCGGCC GACTCTTACC TTCGCTACGG TGGGTATAGG TGCACTCATG
ACCCTGCTGT TGGGCGTGGC GCCGCAGCCG CTTCTCGACC TGGCGACCAC CGCGGCGACC
TCCGGCTTCG TACGCTGA
 
Protein sequence
MSAATTVLAQ GAAQKITPPS IEYSSLSPML ILFGVALAGV LVDAFAPPKA RRVLQPLLAG 
AGFIGAFVAV VLLHAHRQVL AAGAVAIDGP TLFLQGTILV FALLSVLLVA ERRLDSSGGA
LVASAAVVPG SRGSTAQRTS PDVQTEAYPL MVFSVSGMLL FVASNNLLVM FVALEILSLP
LYLLCGLARR RRLLSQEAAM KYFLLGAFSS AFFLYGVAFA YGYAGSVELG RVADAVGTVG
QNDTYLYLSL ALLGVGLFFK IGAAPFHSWT PDVYQGAPTP ITAFMAAGTK VAAFGALLRV
FYVAFGGMRW DWRPVIWAVA ILTMVVGAVL ALTQRDIKRM LAYSAVAHAG FLLVGMAGSN
IDGLRGAMFY LVTYGFTTIA AFAVVSLVRT GDGEASDLSQ WQGLGRTSPL LAGTFAFLLL
ALAGIPLTSG FTGKFAVFQA AIAGDATPLV VVALVCSAIA AFFYVRVIVL MFFSEPLADG
PVVVTRPTLT FATVGIGALM TLLLGVAPQP LLDLATTAAT SGFVR