Gene Franean1_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1037 
Symbol 
ID5669451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1216811 
End bp1218598 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content71% 
IMG OID641239966 
Product3-hydroxybutyryl-CoA dehydrogenase., 3-hydroxyacyl-CoA dehydrogenase 
Protein accessionYP_001505399 
Protein GI158312891 
COG category[I] Lipid transport and metabolism 
COG ID[COG1250] 3-hydroxyacyl-CoA dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.533481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAGG AGAGCACCCT CGGTGAGCTG ACGAGGATCG GGGTCATCGG GCTCGGCACC 
ATGGGCGCCG GCATCGCGGA GGTGCTGGCC CGTGCTGGTA TCGACGTGGT GGGCGTCGAA
CGGGACGAGG CCGGGCTGGC CCGCGGTCGG GGCCATCTGG AGCACTCCAC GCATCGGGCG
GTCGAGCGCG GCAAGCTGAC CCCGGCCGAG CGCGAGGAGC TGCTCGGCCG CATCAGCAGT
GGCGTCGACC TGACGGCCGT CGCGGACTGT CCGCTGATCA TCGAGGCCAT CCACGAGCGG
CTGGACGCCA AGCAGGAGCT CATCGCCCGT CTCGACGAGA TCTGTCCCGC CGAGACGATC
ATCGCCAGCA ACACCAGCTC ACTGTCGGTC ACCGAGCTTG CCGCCGGCAC CCACCGGCCG
TCCAGGGTGC TCGGCACGCA CTGGTTCAAC CCGGCGCCGG TGATGGAACT TGTTGAGGTG
GTCGGGACGG TCGTCACGGA TCCTGTCGTC CTTGACGACG TCCGCGACCT GGTCGGTCGC
GTCGGGAAGA TTGCGGTCAG GGCAGGCGAC CGGGCCGGTT TCATCGCCAA CGCGCTGCTG
TTCGGCTACC TGAACAACGC GGTGCGGATG CTGGAGGCGC ACTACGCGAC CCGGGAGGAC
ATCGACGCGG CGATGCGCCT CGGCTGCGGT CACCCGATGG GGCCGCTGGC CCTGCTCGAC
CTGATCGGGC TCGACTCCGC GTACGAGATC CTCGACTCGA TGTACCACCA GTCCCGCGAT
CACCTGCATG CTCCCGCGCC GCTGCTCAAG CAGCTCGTCA CGGCCGGAAT GCTGGGCCGC
AAGACCGGTC GGGGCTTCTA CACCTACGCC GAGGCCGACT CGCCGCACGT CGTCGACGAC
GCCGCCCCGA GCGTGGTCGC CGGGGTCGAA CCCAGGCCGG TGCGCTCCGT CGGCGTCGTG
GGCACCGGGA CGATGGCCTG CGGCATCGTC GAGGTCCTGG CCCGCTCCGG TTTCGACGTG
GTGTTCCGCG CGCGGTCCGA CGACAAGGTG TCGGCCGTCC GCAAGAGACT GGTGGAGTCG
CTGGAGCGAG GCGTCCAGCG TGGACGGCTG TCCGCCGAGG AGCGTGACGC CGCGCTCGCC
CGGGTGACGG GGACGACCGA CCTCGACGCG CTCGCCGGCT GCGATCTCGT CGTCGAGGCG
GTCGTCGAGG AGCTTTCGGT GAAGCGGGCG CTGTTCGCCG CGCTCGACGA GGTGATGAAG
CCGGGGGCGG TGCTCGCGAC CACGACGTCA TCCCTGCCGG TGGTCGAGTG CGCCACGGCG
ACGACCCGTC CGCGGGACGT CGTCGGGATG CACTGGTTCA ACCCGGCGCC CGCGATGCGC
CTGATCGAGG TCGTGCCTAC GGTGCTGACC GCTCCCGATG TCACCGCGAC GGTGCTCGCG
GTCAGCCGGG CCGCCGGCAA GCACCCGGTG GTCTGCGCGG ACCGCGCCGG GTTCATCGTG
AACGCGCTGC TCTTCCCGTA CCTCAACGAC GCCGTGCGGA TGCTGGAGGC GCACTACGCC
AGCGTCGACG ACATCGACAC CGCGATGACG GTGGGCTGCG GGCATCCGAT GGGCCCGTTC
GCGCTGGCGG ACGTGGTCGG TCTCGACGTC ACACTGGCGA TCCAGCGGAC GTTGTACCTG
GAGTTCCGGG AGCCGGGCTA TGCCCCGGCC CCCCTGCTCG AGCACCTCGT GAAGGCGGGC
TACCTGGGGC GCAAGACCGG CCGGGGCTTC CGCGACCACT CGCGCTGA
 
Protein sequence
MGQESTLGEL TRIGVIGLGT MGAGIAEVLA RAGIDVVGVE RDEAGLARGR GHLEHSTHRA 
VERGKLTPAE REELLGRISS GVDLTAVADC PLIIEAIHER LDAKQELIAR LDEICPAETI
IASNTSSLSV TELAAGTHRP SRVLGTHWFN PAPVMELVEV VGTVVTDPVV LDDVRDLVGR
VGKIAVRAGD RAGFIANALL FGYLNNAVRM LEAHYATRED IDAAMRLGCG HPMGPLALLD
LIGLDSAYEI LDSMYHQSRD HLHAPAPLLK QLVTAGMLGR KTGRGFYTYA EADSPHVVDD
AAPSVVAGVE PRPVRSVGVV GTGTMACGIV EVLARSGFDV VFRARSDDKV SAVRKRLVES
LERGVQRGRL SAEERDAALA RVTGTTDLDA LAGCDLVVEA VVEELSVKRA LFAALDEVMK
PGAVLATTTS SLPVVECATA TTRPRDVVGM HWFNPAPAMR LIEVVPTVLT APDVTATVLA
VSRAAGKHPV VCADRAGFIV NALLFPYLND AVRMLEAHYA SVDDIDTAMT VGCGHPMGPF
ALADVVGLDV TLAIQRTLYL EFREPGYAPA PLLEHLVKAG YLGRKTGRGF RDHSR