Gene Franean1_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0585 
Symbol 
ID5669002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp677604 
End bp678638 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID641239512 
Productaldo/keto reductase 
Protein accessionYP_001504950 
Protein GI158312442 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATACC GCAGATTCGG AACAACCGGT GTCGAGGTAA GCACCCAGTG TCTGGGCACG 
ATGAACTTCG GCCGGCTGGG TAACGTCGAT CCCGACGACA GCGTGCGGGT CGTCAACCGC
GCGCTGGACG GCGGAATCAA CTTCATCGAC ACGGCCGACG TGTACTCCCA CGGCGAGAGC
GAGGAGATCA TCGGACGGGC GGTGAAATCC CGCCGGGATG ACGTGGTGCT GGCGACCAAG
TGCTTCTTCC CCATGGGCGG CCAGCTTCAG CGGGGTTCCT CACGTCGATG GATCATGCAG
GCTGTCGAGG GCAGCCTGCG CCGGCTGGGC ACCGATCGCA TCGACCTGTA CCAGATACAC
AAGCTCGACT GGAACACCGA TCTGGAGGAG ACCCTCGGTG CCCTGACCGA CCTCGTGCGG
CAGGGCAAGG TCCTCTACCT GGGCTCGTCG TCGTTCCCGG CGGACTGGAT CGTCGAGGCC
CAGTGGACGG CGGCCCGGCG CGGCGGTGAA CGGTTCGTCT GCGAGCAGCC GCAGTACTCG
GTCTTCGCCC GGTCGGTGGA GCAGGCGGTC CTGCCCGCCT GCCAGCGGCA TCGGATGGCG
GTCATCCCCT GGAGCCCACT CGCCGGCGGC TGGCTGACCG GCAAGTACCA GCGCGGCCAG
GAACCGCCGA CCGGCTCACG CTACGACCCG GACAGCCCCT TCATGCAGGG AACCGTCAGC
TCGGCCGCCG AGCGCTCCTC CCCTGTCCGG TTCGACGCCG TCGACGCCCT GCGCGGCGTC
GCCGACCAGG CGGGCATCAC CCTGACCGAG CTGGCCATGG CCTTCGTCGC CAACCACCCC
GCCATCACCT CGACGATCAT CGGGCCGCGC ACCATGAAGC ACCTCGAGGA CGCGCTGAAC
GCGGCGGACG TCGACCTCGA CGAGGACGTC CTCGACGCCA TCGACAAGAT CGTGCCTCCC
GGCACCGACA TGCCCGGGAT CGACCACTTC ACCCAGCACC CGGCCCTCCT GCCCGCCGCC
CGCCGACGCA CGTGA
 
Protein sequence
MRYRRFGTTG VEVSTQCLGT MNFGRLGNVD PDDSVRVVNR ALDGGINFID TADVYSHGES 
EEIIGRAVKS RRDDVVLATK CFFPMGGQLQ RGSSRRWIMQ AVEGSLRRLG TDRIDLYQIH
KLDWNTDLEE TLGALTDLVR QGKVLYLGSS SFPADWIVEA QWTAARRGGE RFVCEQPQYS
VFARSVEQAV LPACQRHRMA VIPWSPLAGG WLTGKYQRGQ EPPTGSRYDP DSPFMQGTVS
SAAERSSPVR FDAVDALRGV ADQAGITLTE LAMAFVANHP AITSTIIGPR TMKHLEDALN
AADVDLDEDV LDAIDKIVPP GTDMPGIDHF TQHPALLPAA RRRT