Gene Franean1_4169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4169 
Symbol 
ID5672524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4954737 
End bp4955999 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content75% 
IMG OID641243042 
Productalcohol dehydrogenase 
Protein accessionYP_001508459 
Protein GI158315951 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.235444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.177218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGCG GTCTGGAGCT GTACCGGTCG CTGCCGCGGT ACGCGGCGGC GCGGGTGGTG 
TCGGGCCGGT TCCCGCGGCT GTCGGGGGCG GCGGCGACGA CCGCGGCGCC GCTGCGGCTG
GTCGACCGGG GGGATCCGGG TCTGCCGGGG CCGGGGTGGG TGACGGTGCG GCCACGGCTG
GCGGGGATCT GCGGGTCGGA TCTGGCGACG GTGACGGGGC AGAGCTCGTT CTACTTCTCC
CCGCTGGTGT CGATGCCGTT CACCCCCGGC CATGAGATCG TCGGTGACCT GCAGGAGGCG
GTGACGCTCG CCGACGGGCG CCGGTTGGAC GCCGGTGCCC GGGTGGTGAT CGACCCGGTG
CTGGGCTGCG CGGCCCGCGG GTTGGAGCTG TGCGTGGGCT GCGCGGCGGG CCGGACGTCA
CGCTGCGACC GGATCACGGT GGGGCATCTG GCGCCGGGGT TGCAGACCGG GTTCTGCGCG
GACACCGGTG GCGGGTGGAG CCGGGCGCTG GTCGCCCATC ACAGCCAGCT GCATCCGGTG
CCCGACACGC TGCCCGACTC TCGGGCGGTG CTGGTCGAGC CGTTGGCGAC CGCCGTGCAC
ACCGCGGGCC GCTGCGGGGT GCGTTCCGGG GACCGGGTGC TGATCATCGG GTCGGGGGCG
GTGGGCCTGC TGACGCTGCT GGCCATCCGC GCCTATACGA AGGCCGAGCA TGTGACGATG
GTCGCCAAGC ATCGGCGGCA GGTGGAGCTG GCGCGTCGTT TCGGCGCGGA CGAGGTGCTC
GCCCCCGACG ACGCGGTCGG CGGGGTGCGC CGCGCGAACC GGGCGTTGCG GCTGACCCCG
CAGCTGGGTG GGGAGTATCT GCTCGGCGGG GTGGATGTGG CGATCGACTG TGCGGGCAGC
GCGTCGTCGC TGTCGACGGC GCTGCGGGTG ACCCGGGCCG GTGGCCGGGT GGTGCTCTCC
GGGGTGCCGG CGGGGTCGGT GGATCTGACC CCGCTGTGGT TCCGGGAGCT GGAGCTGGTG
GGGACGTACG CGTCGTCCGG TGGCGCCCGG CCCGGCCGGG CCGGTACCGA ACCGGCGGGA
CCAGCGGAGC CGGTGGAGTC GGATTTCGGG CGGGCGTTGG CGCTGGCCGC CACGGCCCCG
CTCGACGGGG TGGTGTCGGC GGTGTATCCG CTCACCCGGT GGCGGGAGGC GTTGGACCAT
GCGTTGTCCG CGGGGCGTCT CGGCGCCGTG AAGATCGTTT TTGATCCGGC GGCGTCGGCG
TGA
 
Protein sequence
MTRGLELYRS LPRYAAARVV SGRFPRLSGA AATTAAPLRL VDRGDPGLPG PGWVTVRPRL 
AGICGSDLAT VTGQSSFYFS PLVSMPFTPG HEIVGDLQEA VTLADGRRLD AGARVVIDPV
LGCAARGLEL CVGCAAGRTS RCDRITVGHL APGLQTGFCA DTGGGWSRAL VAHHSQLHPV
PDTLPDSRAV LVEPLATAVH TAGRCGVRSG DRVLIIGSGA VGLLTLLAIR AYTKAEHVTM
VAKHRRQVEL ARRFGADEVL APDDAVGGVR RANRALRLTP QLGGEYLLGG VDVAIDCAGS
ASSLSTALRV TRAGGRVVLS GVPAGSVDLT PLWFRELELV GTYASSGGAR PGRAGTEPAG
PAEPVESDFG RALALAATAP LDGVVSAVYP LTRWREALDH ALSAGRLGAV KIVFDPAASA