Gene Franean1_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3572 
Symbol 
ID5671941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4234233 
End bp4235159 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content71% 
IMG OID641242458 
Productshort chain dehydrogenase 
Protein accessionYP_001507878 
Protein GI158315370 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACA CCATTGATCT GACCGGACGA CTCGCCGTGG TGACCGGAGC GAGCAGCGGC 
CTTGGATTGG GTCTGGCGAC CCGCCTGGCC GCGGCCGGCG CCAAGGTTCT CCTGCCGGTC
CGCGATGAGG CCAAGGGCGA GGCCGCACTG AGCCACATCC GCGCCGAAGC GCCCGGTGCG
GACGTGTCGC TCCGCGAACT CGACCTGGCC TCGCTGAAGT CGGTGGAGGC CCTGGGCGAC
ACCCTGAACG CCGAGGGCCG GCCGATCCAC ATTCTGATCA ACAACGCCGG GCTGATGACG
CCTGCCACGC GGCACACCAC CGCCGACGGC CTGGAACTGC AGTTCGGGAC CAACCACATC
GGGCACTTCG CGCTCACCGG CTGGCTGCTG CCGCTGCTGA ACGCCGGCCA CGCCCGGGTG
ACCACGATGA CCAGCAGCGC GGCCCGGCAC GCCAAGCTCA ACTGGGAAGA CCTGCAGAGC
GACCAGGCGT ACGCGCCGAT CCGCGCCTAC AACCAGTCGA AGCTGGCGAA CCTGCTGTTC
GCACTCGAAC TCGACCGGCG CTCCCGGGCC GGGGGCTGGG GGATCGTCAG CAACGCCGCA
CACCCCGGCA CCACCCTGAC CGGCCTGTAC GCCGCCGGAC CCAACCTGGG CCGGGAGAAA
TCCTCGCCGA TCGAGGCCGC CATGAAGCGC CTGGCCCGCT GGGGCGTCCT GGTCCAGGGC
GTCGACCGGG GCCTGCTCCC GGCCCTGTAC GCGGCCACCA GTCCGGACGC CGAGGGCGGC
CACTTCTACG GTCCGGACGG CTTCGGCCAG TTCACCGGCG GTCCGGCCGA GCTGGAGATC
TACCGCCCGG CACGCGACGA GGACGCGGCC ACCAGGCTGT GGGACGTCTC GCAACGTCTC
GCCGGCGTCG AGTTCGCGGC GGTGTGA
 
Protein sequence
MQNTIDLTGR LAVVTGASSG LGLGLATRLA AAGAKVLLPV RDEAKGEAAL SHIRAEAPGA 
DVSLRELDLA SLKSVEALGD TLNAEGRPIH ILINNAGLMT PATRHTTADG LELQFGTNHI
GHFALTGWLL PLLNAGHARV TTMTSSAARH AKLNWEDLQS DQAYAPIRAY NQSKLANLLF
ALELDRRSRA GGWGIVSNAA HPGTTLTGLY AAGPNLGREK SSPIEAAMKR LARWGVLVQG
VDRGLLPALY AATSPDAEGG HFYGPDGFGQ FTGGPAELEI YRPARDEDAA TRLWDVSQRL
AGVEFAAV