Gene Franean1_6091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6091 
Symbol 
ID5674412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7415429 
End bp7416802 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID641244943 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001510341 
Protein GI158317833 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.489458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0112827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC ACACCGAGAC GCCCGTCGAC GGTTCGGCCG AGACCATCAC CGGCGCGCAG 
CCCTACGAGG CCGGGTTCAC CGAGTCCTCC GCCGGCCGCG TCTACACCGT GACGGGCGGT
GACTGGGAGC AGGTCCTCGG CGTCGGCGAG GACGACGGTG AGCGGATCAC CGTCAACATG
GGCCCGCAGC ACCCGTCCAC CCACGGCGTG CTGCGGCTGG TGCTGGAGAT CGAGGGCGAG
ACGGTTACCG AGACCCGCCT GGTCATCGGC TACCTGCACA CCGGGATCGA GAAGAGCTGC
GAGTACCGCA CCTGGACGCA GGCGGTCACG TTCCTCACCC GCGCCGACTA CCTCTCGCCG
CTGTACAACG AGGCGGCGTA CTGCCTGTCG GTGGAGAAGC TGCTCGGCAT CACCGGCGAG
GTGCCGGAGC GGGCGACCGT CATCCGGGTG CTCGTCATGG AGCTGCAGCG GATCGCCTCG
CACCTGGTGT GGCTGGCGAC CGGAGGCATG GAGCTCGGCG CCACCACCGG CATGATCTTC
GGGTTCCGTG AGCGGGAGAA GATCCTCGAC CTGCTCGAGA CGATCACCGG CCTGCGGATG
AACCACGCCT ACATCCGCCC CGGCGGCCTG GCCCAGGACA TCCCGGACGA GGTGATCCCG
GAGATCCGCG CGTTCCTCGA CTACATGCCC AAGCGCATCC GCGAGTACCA CGCGCTGCTG
ACCGGCCAGC CCATCTGGAA GGCGCGGATG GTCGACGTCA ACTTCCTCGA CGCCGCCGCC
TGCCTCGCGC TGGGGACGAC CGGCCCGGTG CTGCGCGCCG CCGGTCTGCC CTGGGACCTG
CGCAAGACCA TGCCGTACTG CGGCTACGAG ACCTACGAGT TCGACGTCCC GACCGCGCTC
GAGGGCGACT CCTACGCCCG GTACCTGGTG CGGATCGAGG AGATGGGCGA GTCCCTCAAG
ATCATCGAGC AGTGCCTGGA CCGGCTGCGC CCCGGCCCGG TCATGGTCGC CGACAAGAAG
ATCGCCTGGC CGTCGCAGCT GGCCATCGGC TCGGATGGCA TGGGCAACTC GCTCGAGTAC
ATCCGCAAGA TCATGGGGAC CTCGATGGAG GCCCTGATCC ACCACTTCAA GCTCGTCACC
GAGGGCTTCC GGGTGCCCGC CGGCCAGGTG TACACGCAGA TCGAGTCGCC GCGCGGCGAG
CTCGGCTACC ACGTCGTCAG CGACGGCGGC ACCAGGCCGT TCCGCGTCCA CGTGCGGGAC
CCAAGCTTCG TGAACCTGCA GGCCGTGCCC GCGCTGACGG AGGGCGGGCA GGTGGCCGAC
GTGATCGTCG GCGTCGCCTC CGTCGACCCG GTGCTCGGGG GAGTTGATCG CTGA
 
Protein sequence
MSTHTETPVD GSAETITGAQ PYEAGFTESS AGRVYTVTGG DWEQVLGVGE DDGERITVNM 
GPQHPSTHGV LRLVLEIEGE TVTETRLVIG YLHTGIEKSC EYRTWTQAVT FLTRADYLSP
LYNEAAYCLS VEKLLGITGE VPERATVIRV LVMELQRIAS HLVWLATGGM ELGATTGMIF
GFREREKILD LLETITGLRM NHAYIRPGGL AQDIPDEVIP EIRAFLDYMP KRIREYHALL
TGQPIWKARM VDVNFLDAAA CLALGTTGPV LRAAGLPWDL RKTMPYCGYE TYEFDVPTAL
EGDSYARYLV RIEEMGESLK IIEQCLDRLR PGPVMVADKK IAWPSQLAIG SDGMGNSLEY
IRKIMGTSME ALIHHFKLVT EGFRVPAGQV YTQIESPRGE LGYHVVSDGG TRPFRVHVRD
PSFVNLQAVP ALTEGGQVAD VIVGVASVDP VLGGVDR