Gene Franean1_4718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4718 
Symbol 
ID5673060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5632068 
End bp5633762 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content68% 
IMG OID641243575 
Product3-ketosteroid-delta-1-dehydrogenase 
Protein accessionYP_001508991 
Protein GI158316483 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.329428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAAGT CGGATGCCAG CTACGACCAT GCCGTTGACG TCATTGTCGT GGGATCGGGC 
GGTGGTCTGT GCGGCGCCGT GGCCGCGGCG GCGAGCGGCC TGGACACGCT GGTGATCGAG
AAGCAGCCGA TGATCGGCGG GTCCACGGCC ATGTCAGGCG GCGTGCTGTG GCTGCCGGAC
AACCCGCTCA TGCAGGCCGA CGACGTTCCC GACTCCCTTG AGGACGCGCT GGCGTACTTC
GACTCCGTTG TCGGAGACGT CGGTCCCGCG TCGTCGCACC AGCGTCGGGT CGCCTACGTC
ACCGAAGGCG CCACCATGGT CCGCTTCCTG CAGAACCTGG GCATGCGGTT CGAGCGGTGC
GAGGGCTACA GCGACTACTA CGCCGGGGTG GCGGGTATCC GCGGCGGCAG CGCCCGCGGG
CGCTCCCTCG AACCCGCGGT GACCGACGGC AAACAGCTCG GGCCGTGGTT CGACAAGCTG
ATGCCGGGCA TGACGATCGC GCTCGGGATC GTCGTGATGA CACGCGAGGC GTCCGGACTG
CAGATGGTCA AGCGGCGACC GAAAGCCATG CGCACCGCCG CCCGCGTCGG GGTGCGCACG
GCCATCGGCC GGCTACGGCG CCAGACCCGC CTGGCCAACG GCGGGGCACT GATCGCCCAG
ACCCTGACGG CCGCACTCGC CGCCGGGGCG TCGATTTGGA CGAACACCGG ACTGGTCGAC
CTGATCACCG AGGACGGCCG GGTCGTCGGC GTCGTCGCCG ACCGGGACGG CCACACCATC
CGGATCCGCG CCCGCCACGC CGTGCTGTTG AGCTCAGGTG GTTTCGGCTG CAACCCCGAG
ATGCGCAAGC GGTACTCGAA GCAGCCGAAC GACGGAACGT GGACCAGTGC CAACCCGGGT
GACACCGGCG AGGCGATCGA GGCCGCCATG CGCCTGGGCG CCGCCGTCGA CTTCATGGAC
GAGGCCCTGT GGATCCCGGC CTCCATCCAG CCCGGCGGAC GTCCGAGCAT GCACAACGGC
GAACGATGCA AACCAGGTTC GATCATCGTC GACCGGGCCG GCCGCCGGTA CTTCAACGAA
GCGGTCTCCT ACATGGAAGC CGGCCGGCAG ATGTACGCCC ACAACGTCGA CGGTGAATCC
ATCCCGAGCT GGCTCGTCAT GGACTCCCGC CACCGCGCCC GCTACCTGTT CGCGTTCCGT
CCCAACACTC CCGAGGAATG GCTCACCAGC GGCTACATGA AGAAAGCCGA CACGCTCGAC
GAGCTCGCGC GGGCGTGCGG CATCGACCCC GCGGGGCTGG CCACCACGGT GGCCCGGTTC
AACACGTTCG CGGAGCAGGG CACGGACCCC GACTTCCACC GCGGCGAAGG CGCTCATGAA
AAGTACCAGG GCGACTACGG CAACAAACCG AACCCGTCGC TCGCGCCCGT CGAGAAGGCA
CCATTCTACG CCGTCGAACT CTATCCAGGC GACGTCGGCA CCAGCGGCGG TCTCCTGTGC
GACGAACATG CTCGTGTGCT CGACACCAAC CACGACCCGA TCCCCGGCCT CTACGCCGCC
GGGAACTGCA CCGCCTCGGT GATGGGCCGC ACCTATCTCG GTGCGGGCGC CAGCATCGGC
AACAGCTTCG TGTTCTCCTA CATCGGCATG AAACACGCCG CACACGCCGC CTCCGGGGAC
CGAGCGGTCA CGTGA
 
Protein sequence
MSKSDASYDH AVDVIVVGSG GGLCGAVAAA ASGLDTLVIE KQPMIGGSTA MSGGVLWLPD 
NPLMQADDVP DSLEDALAYF DSVVGDVGPA SSHQRRVAYV TEGATMVRFL QNLGMRFERC
EGYSDYYAGV AGIRGGSARG RSLEPAVTDG KQLGPWFDKL MPGMTIALGI VVMTREASGL
QMVKRRPKAM RTAARVGVRT AIGRLRRQTR LANGGALIAQ TLTAALAAGA SIWTNTGLVD
LITEDGRVVG VVADRDGHTI RIRARHAVLL SSGGFGCNPE MRKRYSKQPN DGTWTSANPG
DTGEAIEAAM RLGAAVDFMD EALWIPASIQ PGGRPSMHNG ERCKPGSIIV DRAGRRYFNE
AVSYMEAGRQ MYAHNVDGES IPSWLVMDSR HRARYLFAFR PNTPEEWLTS GYMKKADTLD
ELARACGIDP AGLATTVARF NTFAEQGTDP DFHRGEGAHE KYQGDYGNKP NPSLAPVEKA
PFYAVELYPG DVGTSGGLLC DEHARVLDTN HDPIPGLYAA GNCTASVMGR TYLGAGASIG
NSFVFSYIGM KHAAHAASGD RAVT