Gene Franean1_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3320 
Symbol 
ID5671692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3932129 
End bp3933823 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content70% 
IMG OID641242209 
Product3-ketosteroid-delta-1-dehydrogenase 
Protein accessionYP_001507629 
Protein GI158315121 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAAGT CGGATGCCAG CTACGACCAT GCCGTTGACG TCATTGTCGT GGGATCGGGC 
GGTGGTCTGT GCGGCGCCGT GGCCGCGGCG GCGAGCGGCC TGGACACGCT GGTGATCGAG
AAGCAGCCGA TGATCGGCGG GTCCACGGCC ATGTCAGGCG GCGTGCTGTG GCTGCCGGAC
AACCCGCTCA TGCAGGCCGA CGGCGTTCCC GACTCCCTTG AGGACGCGCT GGCGTACTTC
GAGTCGGTGG TCGGGGACGT CGGACCCGCC TCGTCGCGGG AGCGGCGGCT CGCCTACATC
GTCGAAGGCT CCAACATGGT CCGCTTCCTG CAGGGCCTGG GCCTGCGTTT CGAGCGGTGC
GAAGGGTACA GCGACTACTA CGCGCAGGTG GCGGGAATTC GTGGCGGCAG CGCCCGCGGC
CGCTCCATCG AACCCGCGGT GACCGATGGC AGGAAGCTCG GACCGTGGTT CGCGAAGCTG
ATGCCGGGCA TGACGGCCGC GCTCGGAATC GTCGTGATGA CCCGCGAGGC GTCCACTCTG
CAGCTGATCA AGCGGCGGCC GAAAGCCATG CGCACCGCCC TCCGGGTGGG GATGCGGACG
GCCATGGGCA GGCTCAGGAG GCAGACCCGG CTGGCCAACG GCGCGGCACT GATCGCGCAG
GCCCTGGAGG CGGCGCTGGC GGCCGGGGCG ACGGTCTGGA CAGACACCGG GCTGGTCGAC
CTGATCGTCG AGGACGGCCG GGTGGCCGGC GTCGTGGCCA GTCGGGACGG GCAGACCGTC
CGGATCCGCG CCCGCCGCGG CGTGCTGTTG AGCTCGGGCG GGTTCGCCCG CAACGCCGAG
ATGCGCAGAC GCTACTCGAA GCAGCCGAAC GAGGGGTCGT GGACCATCGC CAACCCCGGG
GACACGGGCG AGGCGATCGA GGCCGCCCAG CGGGTCGGCG CCGCCGTCGA CTTCATGGAC
GAGGCGTTGT GGATCCCGGC CTCCATCCAG CCCGGCGGAC GTCCGAGCCT GCACACCGGA
GAGCGGAGCA AGCCGGGCTC GATCATCGTC GACCGGGCCG GTCGGCGGTA CTTCAACGAG
GCGGTCTCCT ACATGGAGGC CGGGCGGCAG ATGTACGCGC ACAACACGGC CGGTGAGTCC
ATTCCGAGCT GGCTGGTCAT GGACTCCCGC CACCGCGGTC GTTATCTGTT CGCGTTCCGT
GCCAACACCC CCGAAGAGTG GATCACCAGC GGCTACATGA AGAAGGCCGA CACGGTGGAG
GAACTGGCGC GGGCGTGCGG CATAGACCCG GCCGGTCTGG CCGCCACGGT CGAGCGGTTC
AACGGGTTCG CGAAGCAGGG CACCGACCCC GACTTCCACC GCGGCGAAGG GGCTCACGAG
CGGTACCAGG GCGACTACGG CAACCAGCCG AACGCCTCGC TCGCGCCCGT CGAGAAGGCG
CCCTTCTACG CGGTCGAGCT CTACCCGGGT GACGTGGGGA CGAGCGGCGG GCTGCTCTGT
GACGAGTTCG CCCGTGTGCT CGACACCAAC CATGAGCCCA TCCCGGGTCT GTACGCTGCC
GGGAACTGCA GCGCGTCGGT GATGGGACGC ACCTATCTGG GGGCGGGCGC CAGCATCGGG
AACAGCTGCG TCTTCTCCTA CATCGGCATG AAGCACGCCG CGCACGTCGT CTCCGGCGAC
CCGGTGGCCA CGTGA
 
Protein sequence
MSKSDASYDH AVDVIVVGSG GGLCGAVAAA ASGLDTLVIE KQPMIGGSTA MSGGVLWLPD 
NPLMQADGVP DSLEDALAYF ESVVGDVGPA SSRERRLAYI VEGSNMVRFL QGLGLRFERC
EGYSDYYAQV AGIRGGSARG RSIEPAVTDG RKLGPWFAKL MPGMTAALGI VVMTREASTL
QLIKRRPKAM RTALRVGMRT AMGRLRRQTR LANGAALIAQ ALEAALAAGA TVWTDTGLVD
LIVEDGRVAG VVASRDGQTV RIRARRGVLL SSGGFARNAE MRRRYSKQPN EGSWTIANPG
DTGEAIEAAQ RVGAAVDFMD EALWIPASIQ PGGRPSLHTG ERSKPGSIIV DRAGRRYFNE
AVSYMEAGRQ MYAHNTAGES IPSWLVMDSR HRGRYLFAFR ANTPEEWITS GYMKKADTVE
ELARACGIDP AGLAATVERF NGFAKQGTDP DFHRGEGAHE RYQGDYGNQP NASLAPVEKA
PFYAVELYPG DVGTSGGLLC DEFARVLDTN HEPIPGLYAA GNCSASVMGR TYLGAGASIG
NSCVFSYIGM KHAAHVVSGD PVAT