Gene Franean1_4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4598 
Symbol 
ID5672943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5479162 
End bp5480199 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID641243459 
Productalcohol dehydrogenase 
Protein accessionYP_001508875 
Protein GI158316367 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGTG CGATTGTTTT CAACGGTGAC CGGACCTGGG AGGAGCGCGA CCTGCCGGTG 
CCTGATCCCC AGCCGGGTGG CGCGGTGCTG CGGGTGGAGG CGACGGGCTT GTGTCACGGC
GACGTCGACC AGTTCCACGG CATCGGTCGC ACCCCGAGAG GCGGGGCGTT CCCCGTGGTT
CCAGGCCATG AGGTCGTCGG CCGGATCGAG AAGATCGACG CGCGGGCAGC CGAGGAATGG
GGCGTCGCGG AAGGAGACCG GGTCGCCGTC CGCACGATCG TCATCACCCC CGAAGGCGGC
ACCCGCGCCT ACGGGATCGA CTTCTCGGTG AAGGAGGGCT CCGGTCTCTA CGGCGGTTAT
GCCGACTACA TGGAGATCCT GCCGGGATCC GCGGTCTACC GCCTCCGGGA GGACCTTCCC
GCGGCGGAGC TCACGATCTT CGAGGCGTTG TCCTGCGCGG TCACGTGGGT CCACCCGGTC
AAGGACGATC ACACCGTCGT CATCGAGGGA CCCGGCCACA TGGGCCTGGC CACCGTCGTC
GCGGCCCGCG CCGCGGGCGC TGGCACGATC GTGGTCACCG GCCTCTCGCA GGACCGGTCC
CGGCTCGACT GCGCCCTGCA GGTGGGCGCT GACCACGTGA TCGACGTCCA GACGGAGAAC
GCCGCGCAGC GCCTCGCCGA CATCACCGGT GGACGCATGG CCGACGTCGT GATCGACGCG
GCGTCCGGGA GCTCGGTGAC GGTCAACACC GCGATGGAGC TTGTCGGCAG GGGCGGCCAC
ATCGTCATCG CCGGGCTGAA GGACGAGCCG GTGAACGGCC TGGACAGCAA CTCGCTCCTG
TTCCGGGGGA TCACCATCGG TCCCGGGGCC GGACTCGACG CGGCCCGCGC GGTCGCGCTC
ATCAACGACG GCCAGGTGCC GACCGCCGCG CTGGCCGGCG AGACCTTCCC GCTCGATCGC
TTCGAAGACG CCTTCGCGCT GCTGGATCGC CGTGTCCCCG GCCGTGACGC GGTGCGGGTG
TCGCTGCACG TCTCGTGA
 
Protein sequence
MGRAIVFNGD RTWEERDLPV PDPQPGGAVL RVEATGLCHG DVDQFHGIGR TPRGGAFPVV 
PGHEVVGRIE KIDARAAEEW GVAEGDRVAV RTIVITPEGG TRAYGIDFSV KEGSGLYGGY
ADYMEILPGS AVYRLREDLP AAELTIFEAL SCAVTWVHPV KDDHTVVIEG PGHMGLATVV
AARAAGAGTI VVTGLSQDRS RLDCALQVGA DHVIDVQTEN AAQRLADITG GRMADVVIDA
ASGSSVTVNT AMELVGRGGH IVIAGLKDEP VNGLDSNSLL FRGITIGPGA GLDAARAVAL
INDGQVPTAA LAGETFPLDR FEDAFALLDR RVPGRDAVRV SLHVS