Gene Franean1_3907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3907 
Symbol 
ID5672268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4673296 
End bp4674441 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID641242786 
ProductPropanoyl-CoA C-acyltransferase 
Protein accessionYP_001508203 
Protein GI158315695 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.102476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATG TCGCGATCAT CGGGGTGGGC ATCCACCCCT TCGGGCGCTT CGGCAACAAG 
TCGGCGATCC AGATGGGCGC CGAGGCCGTC CGCTCCGCGC TCACCGACGC CGGCCTGGAG
TGGAAGCAGG TCCAGTTCGC CTTCGGCGGG AGCTGTGAGG TGGACAACCC CGACGCGGTC
GTGGGCCTGC TCGGCCTCAC CGGCATCCCG TTCATGGACG TCTACAACGG CTGCGCGACG
GCCGCGACCG CGCTGGAGCT GACCGCCGAC GCGATCCGGT ACGGCAAGTA CGACATCGGC
CTCGCGGTCG GCATGGACAA GCACGCCTTC GGCGCCTTCA CGGCCGATCC CGTCCACTAC
AGCGCCCCGC CGTGGTACGG CGACATCGGT CACTTCCTGA CGACGAAGTT CTTCGGGATG
AAGATCAACC GCTACATGCA CGACCACGGG ATCTCGCACC GCACGCTGGC CCGGGTGGCG
GCGAAGAACT ACCGCAACGG CGTGCTGAAC CCCAACGCCT TCCGTCGCAA GGCGCTGACG
GAGGACGAGA TCCTCACCTC CCAGATGCTC AACTACCCGC TGACGAAGTA CATGTTCTGC
AGCCCGGACG AGGGCGCCGC GGCGATCATC CTGTGCCGCG CCGACATCGC CCGCCAGTAC
ACGTCCAACC CCATCTACCT GCGGGCGAGC ACGCTGCGGA CCAGGACGTA CGGCGCGCAC
GAGGTGCACA GCTCCTGGGC CGCGGTGGAG CACGCCGAGG CCCCCACCGT GTTCGCGTCG
CGGGCGGCCT ACGAGACCGC GGGCATCGGC CCGGAGGACG TCGACGTCAT CCAGATCCAG
GACACCGACT CCGGTGCCGA GATCATGCAC ATGGCGGAGA ACGGCTTCTG CGCGGACGGC
GACCAGGAGA AGCTGCTCGC GGAGGGCGCC ACCGAGATCG GCGGCCGGCT GCCGGTCAAC
ACCGACGGCG GGCTGATCGC CAACGGCGAG CCCGTCGGCG CCTCGGGTCT GCGCCAGATT
CACGAGCTGG TCCTGCAGCT GCGCGGGCAG GCCGGCGACC GGCAGGTCCC CGGCAACCCG
CGGGTCGGCT ACGCCCAGCT CTACGGCGCG CCCGGCACGG CCGGCGTGTC GATCGTGACC
ACCTGA
 
Protein sequence
MEDVAIIGVG IHPFGRFGNK SAIQMGAEAV RSALTDAGLE WKQVQFAFGG SCEVDNPDAV 
VGLLGLTGIP FMDVYNGCAT AATALELTAD AIRYGKYDIG LAVGMDKHAF GAFTADPVHY
SAPPWYGDIG HFLTTKFFGM KINRYMHDHG ISHRTLARVA AKNYRNGVLN PNAFRRKALT
EDEILTSQML NYPLTKYMFC SPDEGAAAII LCRADIARQY TSNPIYLRAS TLRTRTYGAH
EVHSSWAAVE HAEAPTVFAS RAAYETAGIG PEDVDVIQIQ DTDSGAEIMH MAENGFCADG
DQEKLLAEGA TEIGGRLPVN TDGGLIANGE PVGASGLRQI HELVLQLRGQ AGDRQVPGNP
RVGYAQLYGA PGTAGVSIVT T