Gene Franean1_5391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5391 
Symbol 
ID5673723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6501764 
End bp6502978 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID641244247 
Productalcohol dehydrogenase 
Protein accessionYP_001509653 
Protein GI158317145 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.515242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAG TCGTGTGGCA CGGCGTGGGC GACATCCGCC TTGACACCGT CACCGAACCG 
AAGATCGAAC AGCCGACCGA TGCCGTCGTT CGCATCACCA CCGCGGCGAT CTGTGGCACG
GATCTTCATT TCGTCCGGGG CACAGTACCG GGGATGAAGC CGGGCCTCAT CATCGGGCAC
GAAGGGGTCG GCGTGGTCGA AGAGGTCGGC CAGAGCGTCC GCAACTTCAG GCGCGGCGAT
CGGGTGCTGC TGTCCGCCGT CCTCGGGTGC GGCTCCTGCA CCTACTGCCG CAGCGGCTAT
TTCGCCCAAT GTGACGACAT CAACCCGTAC GGCCGCAGGA CCGGATCTGC CTTCTACGGC
GCTCCGAGGG ACAATGGCTC GTTCGACGGG TTGCAGGCTG AGTACGCCCG CGTACCCTAC
GCCCACACCA ATCTGTTCCG GCTGTCAGAC TCGATCTCCG ATGATCAGGC GATCCCGCTG
TCGGACATCT ACCCCACCGG ATACTTCGGC GCAGTCATCG CGGAAGTATC GGATGGCGAC
GTGGTGGCGG TCTGGGGCTG CGGGCCGGTG GGACAGTTCG CCGTTCTGTC CTCATTCCAG
CGCGGCGCCG CGCGGGTGAT CGCGATCGAT GGTCACGCCG ACCGACTCGA CCGTGCCCAG
GCGCTCGGAG CCGAGGTGGT CAACTTCAAC GAAGAGGACC CTGTCGAGGC AATCCTGGAT
CTGACACGCG GTATCGGTCC CGACCGGGCC ATCGACGCCG TGGGGGTGGA CGCGGAAAGC
CCGAAGTCCG GCCCCGCCGC CGCCCGCGCC CGCGAACAGG ACGATCAGCA CCGCGAGGAA
CTGCGTCAGA TCGCCCCCGA GACTCACGCG CACAACGGAC ACTGGAAGCC TGGCGACGCG
CCGACGCAGG CCCACTCCTG GGCAGTCGAG AGCCTGGCCA AGGCAGGCAC GCTGGGCATC
ATCGGGGTGT ATCCGCCGAC CGACAGGTTC TTCCCGATCG GCACCGCGAT GAACAAGAAC
CTCACCATCA ACATGGGAAA CGGCAACCAT CCGCGGTACA TCCCGAAGCT GCTGGATATG
GTGGAGTCGG GAGTGGTGCA CCCACAGAAA ATGGTCACCC AGCATGAGCC GATGCGGGAC
GTGCTCGCCG CCTACGAGGA GTTCGACCTG CGCCGTCCTG GCTGGCTCAA GGTGGCCCTG
GACCTGACCA GCTAA
 
Protein sequence
MRAVVWHGVG DIRLDTVTEP KIEQPTDAVV RITTAAICGT DLHFVRGTVP GMKPGLIIGH 
EGVGVVEEVG QSVRNFRRGD RVLLSAVLGC GSCTYCRSGY FAQCDDINPY GRRTGSAFYG
APRDNGSFDG LQAEYARVPY AHTNLFRLSD SISDDQAIPL SDIYPTGYFG AVIAEVSDGD
VVAVWGCGPV GQFAVLSSFQ RGAARVIAID GHADRLDRAQ ALGAEVVNFN EEDPVEAILD
LTRGIGPDRA IDAVGVDAES PKSGPAAARA REQDDQHREE LRQIAPETHA HNGHWKPGDA
PTQAHSWAVE SLAKAGTLGI IGVYPPTDRF FPIGTAMNKN LTINMGNGNH PRYIPKLLDM
VESGVVHPQK MVTQHEPMRD VLAAYEEFDL RRPGWLKVAL DLTS