Gene Franean1_7154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7154 
Symbol 
ID5675457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8735572 
End bp8736627 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID641245993 
Productalcohol dehydrogenase 
Protein accessionYP_001511381 
Protein GI158318873 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0462629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0194652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCG CTGTGCTGCG GGAGGGAGTC GTCGAGGCCC GGGTCATCGA CGACCCGGTG 
CCGGGGCCGG GCCAGCTGCT GGTGCGGTCG CTCGCGTGTG GGATCTGCGC GTCGGACATC
CACTTCATGG ATCATCTGGA AGCGGGCGTC GACGATGACA GCGGGATGTC GACCTACGAC
CGTGATGTCG ACATCGTCAT GGGTCACGAG TACTGCGCCG AGGTCGTCGA CTACGGCCCC
GGCACCGAGC GGCGGATCCC CGTGGGCGCC CGGGTGAGCT CGCTGCCGGT GCTGTCCACG
GCCACCGGGC GGAAGATCAT CGGGCAGAAT CCGGAGTCGC CCGGCGGGTT CGGTGAGTAT
CTCCTGCTCG ACGAGGCCAT GACCCGGGTC GCGGTCTCCG AGCTCCCGAA CGAGATCGTG
TGCATCGCGG ACGCGGTCTC GGTCGGCTTG TCGGCCGCCT CCCGAGCGCA GGTGACGGCG
AAGGAGGTGC CGCTGGTCAT CGGCTGCGGG GCGATCGCTC TGTCCGTGAT CGCGCAGCTG
AAGCGGCTGG GGGTCGGGCC GATCCTGGCG GTGGACTTCG TCGCCTCGCG TCGCGAGACC
GCGCTGGCCA TGGGAGCGGA CGTGGTCATC GACCCCGCCG CGGTGTCCCC GTACCAGGCC
TGGCGTGACG TGGCCTACGG GTCGCCCGAG GCGATGAGGG AACTGATGGC GGTCGCCGGC
CTGCCGGGAT GCGTCGTGTT CGAGTGCGTC GGTATTCCCG GTGTCCTGGA TTCGATCATC
AAGGGCTGCG AGCGCAACAC CCGGATCTTC TCGGTGGGAG GTCCGCCGGA AGGCGATCAC
CTGCACACCC TCACCGCCAA GCGGAAAGGC ATCAACATCC AGTTCGGGGG CGGCCCGTCG
ATGCAGCACT GGGACGAGGC ATTCGCGGCG GTCGGCTCGG GCGACCTCGA CGTCACACCG
ATGCTCGGCC GAACCGTCGG GCTCGACGAC GTCGCCGAGG CGCTCAACGC CTCCCGCGAC
GCCAACGGAC CCGTCCGCAT CGTCGTCGTG CCCTGA
 
Protein sequence
MRAAVLREGV VEARVIDDPV PGPGQLLVRS LACGICASDI HFMDHLEAGV DDDSGMSTYD 
RDVDIVMGHE YCAEVVDYGP GTERRIPVGA RVSSLPVLST ATGRKIIGQN PESPGGFGEY
LLLDEAMTRV AVSELPNEIV CIADAVSVGL SAASRAQVTA KEVPLVIGCG AIALSVIAQL
KRLGVGPILA VDFVASRRET ALAMGADVVI DPAAVSPYQA WRDVAYGSPE AMRELMAVAG
LPGCVVFECV GIPGVLDSII KGCERNTRIF SVGGPPEGDH LHTLTAKRKG INIQFGGGPS
MQHWDEAFAA VGSGDLDVTP MLGRTVGLDD VAEALNASRD ANGPVRIVVV P