Gene Franean1_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0447 
Symbol 
ID5668869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp526304 
End bp527542 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content65% 
IMG OID641239379 
Productaldo/keto reductase 
Protein accessionYP_001504817 
Protein GI158312309 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTGC GCCTGCTGTA CCTGATCTTC GTTCGGGTCT GTGGCTGGCT GGTTCTACTC 
GGTCGCTCGT CGGCCGCCAA GGACTTGGAA TTGCTGGTGC TACGGCATGA GGTCACGGTG
CTGCGCCGTA CCCAGCCCAG GCCCCGGTGG GACTGGGCGG ACCGGGCGGT CCTCAATCAA
ACAAGATCAA GGAATGGCGG GGTCTGGCCA CCCGCTACGA CAAGACACCC GAAAGCTACG
CCGCAGGACT CCACCTGCGC GGATCCATCC TCTGGCTACG CAGCCTGCCA ACCCCATGAT
CCGAACTTGG AACAGACCCT AGGTACCGAC GGACCCGAGG TTCCCGTGGT CTGCGTCGGA
ACGAGCCCGC TGGGCGGGCT CCCGACAATC TACGGCTATG ACGTCGAGGC GGGGCAGGCG
GTGGCGACGA TTCGCCGCGT GTTGGAGTCA CCGATCGACT TCATCGACAC CTCGAACGAG
TACGCGAACG GCGAAAGCGA GCGGCGCATC GGCGAGGCGT TGCGCAGCGC GGCAGGGGGT
CCCGGCAATG TCGTGCTCGC GACCAAGGCA GATCCCGCGC TGTGGGCCAC GGAGTTCCCC
GGCAGCCGAG TGCGGGAATC GTTCCGGGAG AGCACCGAGC GGCTCGGGGT TGATCGGTTC
GAGGTGTTCT ACCTGCACGA CCCGGAGCGC TTCGATTTCG GGTACATGAC GGCCCCCGGA
GGTGCGGTCG AGGCGATGGT GCAGCTGCGT ACCGACGGCC TGGCCACGGC AATCGGCGTG
GCCGGCAGTG ACATCAGCGA GATGCGCCGC TACGTCGACC TCGGCGTCTT CGACGTCATC
CTGAACCACA ACCGGTATAC ACTTCTTGAT CGTTCTGCGG ACGCCCTCAT CGACCACGCG
GTCAACGCCG GCCTGTCCTT CATTAATGCC GCACCGTATG CCAGCGGCAT GCTCGCGAAG
CAGGTCTCGG CCCGTCCGAG GTATCAATAT CGCGCGCCAT CGCCCGAGAT CGTCCGCACC
ACCGCGTGGT TGCACCAGGA GTGCGCCCGG TTCCACGTGC CGCTCGCGGC ACTCGCCCTC
CAGTTCTCGA CACGCGATCC TCGGATCAGC TCGACGGTCG TCGGCGTATC AGCTCCGGAG
CGTGTGGATG AACTCGTGGA GAACGAGCAG CGTGAGATCC CGTCGGAGCT GTGGGACTCC
GTGCGCGAGC GGCTGGTGTT GCCGCCCACA GTGACGTAA
 
Protein sequence
MSVRLLYLIF VRVCGWLVLL GRSSAAKDLE LLVLRHEVTV LRRTQPRPRW DWADRAVLNQ 
TRSRNGGVWP PATTRHPKAT PQDSTCADPS SGYAACQPHD PNLEQTLGTD GPEVPVVCVG
TSPLGGLPTI YGYDVEAGQA VATIRRVLES PIDFIDTSNE YANGESERRI GEALRSAAGG
PGNVVLATKA DPALWATEFP GSRVRESFRE STERLGVDRF EVFYLHDPER FDFGYMTAPG
GAVEAMVQLR TDGLATAIGV AGSDISEMRR YVDLGVFDVI LNHNRYTLLD RSADALIDHA
VNAGLSFINA APYASGMLAK QVSARPRYQY RAPSPEIVRT TAWLHQECAR FHVPLAALAL
QFSTRDPRIS STVVGVSAPE RVDELVENEQ REIPSELWDS VRERLVLPPT VT