Gene Franean1_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4612 
Symbol 
ID5672957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5496797 
End bp5497819 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content70% 
IMG OID641243473 
Productaldo/keto reductase 
Protein accessionYP_001508889 
Protein GI158316381 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.593922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACC GCGTACTTGG CCGCACCGGC GTGCGGGTCT CCCCGCTCTG CCTGGGTGCG 
ATGATGTTCG GGACCTGGGG CAACCAGGAC CATGACGACT CAATAAAGAT CATTCACAGG
GCGTTGGACG CCGGCGTCAA CTTCGTCGAC ACCGCGGATG TCTACTCCGC CGGCGAGTCC
GAGGAGATCG TCGGTAAGGC GCTGGCCGGC CGCCGCGACG ACGTCGTGCT CGCAACCAAG
CTGTACATGC CGATGGGACC GGGGCCCAAC CGGCGCGGCC TGTCACGGCG CTGGATCGTC
ACCGAGGTGG AGAACAGCCT GCGGCGGCTC GGCACCGACT GGATCGACCT CTATCAGGTG
CATCGTCCCG ACCCGTCGAC CGACATCGAC GAGACGCTCG GCGCGCTCAC CGACCTCGTC
CGGGCCGGCA AGATCCGCTA CTTCGGCAGC TCGACGTTCC CGGCGCACGA GGTGGTCGAG
GCACAGTGGG TGGCCGAGCG CCGCAACCGC GAGCGGTTCG TCACCGAGCA GCCGCCGTAC
TCGCTGCTCG TCCGTGGGAT CGAGGCCGAC CTGCTGCCCG TGGCGCAGAA GTACGGACTC
GGGGTGCTGC CGTGGAGCCC GCTGGCCGGC GGCTTCCTGT CCGGGCGCCA CACCCGCGAC
GGCGGCGAGG TCACGAGCAC CCGGATGTCC CGGGTGCCGA ACCGGTTCGA TCTGTCCGTG
CCCGCGAACC AGCGCAAGGT CGAGGCCGCG ATCGCGTTCG CCGACCTCGC CGCCGAGGTG
GGCGTCACGC TGATCGAGCT GGCGCTGGCG TTCGTGCTGC GCCACCCGGC GGTGACGTCG
GCGATCATCG GCCCGCGCAC GATGGAGCAC CTGGAAAGCC AGCTCACCGG CGGCGCCGTC
ACCCTGGACG AGGCGACGCT GGACCGGATC GACGAGATCG TCCCGCCTGG CGTCAACGTC
AACCCGGAGG ACGCCGGGTA TGTCCCGCCG TCCCTGGCCC AGCCCGCGCT CCGCCGGCGC
TGA
 
Protein sequence
MDYRVLGRTG VRVSPLCLGA MMFGTWGNQD HDDSIKIIHR ALDAGVNFVD TADVYSAGES 
EEIVGKALAG RRDDVVLATK LYMPMGPGPN RRGLSRRWIV TEVENSLRRL GTDWIDLYQV
HRPDPSTDID ETLGALTDLV RAGKIRYFGS STFPAHEVVE AQWVAERRNR ERFVTEQPPY
SLLVRGIEAD LLPVAQKYGL GVLPWSPLAG GFLSGRHTRD GGEVTSTRMS RVPNRFDLSV
PANQRKVEAA IAFADLAAEV GVTLIELALA FVLRHPAVTS AIIGPRTMEH LESQLTGGAV
TLDEATLDRI DEIVPPGVNV NPEDAGYVPP SLAQPALRRR