Gene Franean1_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3784 
Symbol 
ID5672148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4487024 
End bp4488085 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content73% 
IMG OID641242663 
Productaldo/keto reductase 
Protein accessionYP_001508083 
Protein GI158315575 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.116589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGTC CCTGCCGCGG CGTCGCCCGC CGCAGCAGAC TGGCGGTCAT GACGCAGACG 
ACGATGACGA CAAGGCGGCT CGGGGCAACC GGGCCGGAGG TGGGTGCGCT CGGGCTCGGT
TGCATGGGCA TGTCCGGCAT GTACGGGCCA GCCGACGACG ATGAGAGCGT GGCGACGATC
CTCGCCGCGG TCGACGCGGG GATGACCCTG CTCGACACCG GCGACTTCTA CGGCATGGGA
CACAACGAGC TGCTGGTCGG CCGCGCCCTG CGCCAGCTCG ACCGGGACGC GGTGACGGTG
AGCGTGAAGT TCGGGGCGCT GCGCGACGCG GTCGGCGGCT GGGGTGGTCT CGACGCCCGG
CCGGCGGCGC TGCGCAACTT CCTCGCCTAC TCGCTGGTGC GCCTGGGCAC CGACCATGTG
GACGTCTACC GGCCGGCCCG GCTCGACCCC GCCGTGCCGA TCGAGGAGAC GATCGGTGCC
ATCGCCGAGC AGGTCAAGGC CGGTCTGGTC CGGCACATCG GCCTGTCGGA GGTCGGCCCG
GAGACGATCC GGCGGGCGGC GGCCGTGCAC CCGATCTGCG ACCTGCAGAT CGAGTACTCG
GTGCTTTCCC GCGGCATCGA GGACGAGATC CTGGCGACCT GCCGCGAGCT CGGGATCGCG
ATCACCGCGT ACGGGGTGCT CTCCCGCGGG CTGATCGCCG GCACCGGCCC GTCGGGGCAC
AGCAACGACT TCCGGGCACA CAGCCCGCGC TTCCAGGGCG CGAACCTCGA CCACAACCTG
GGGCTCGTCG AGCGGCTGCG GCCGGTCGCC GAGCGCCACG GGATCTCCGT CGCGCAGCTG
GCCATCGCCT GGGTCGCCGC CGCAGGTCCG GACGTCATCC CGCTGGTGGG GATGCGCCGG
CGCAGCCGCA TCGATGACGC CCTGGCCGCC GCGGCCGTCA CCCTGTCCGA GCAGGATCTG
GCCGACGTCG ACCGGGCGGT GCCCGCCGGG TCGGCCGCGG GCGGACGATA CGAGGACGCC
CAGCTCGCCG CCCTCGACAG CGAGCGTCCC GCCGAGCGCT GA
 
Protein sequence
MNGPCRGVAR RSRLAVMTQT TMTTRRLGAT GPEVGALGLG CMGMSGMYGP ADDDESVATI 
LAAVDAGMTL LDTGDFYGMG HNELLVGRAL RQLDRDAVTV SVKFGALRDA VGGWGGLDAR
PAALRNFLAY SLVRLGTDHV DVYRPARLDP AVPIEETIGA IAEQVKAGLV RHIGLSEVGP
ETIRRAAAVH PICDLQIEYS VLSRGIEDEI LATCRELGIA ITAYGVLSRG LIAGTGPSGH
SNDFRAHSPR FQGANLDHNL GLVERLRPVA ERHGISVAQL AIAWVAAAGP DVIPLVGMRR
RSRIDDALAA AAVTLSEQDL ADVDRAVPAG SAAGGRYEDA QLAALDSERP AER