Gene Franean1_3588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3588 
Symbol 
ID5671957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4250185 
End bp4251327 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content73% 
IMG OID641242474 
ProductPropanoyl-CoA C-acyltransferase 
Protein accessionYP_001507894 
Protein GI158315386 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.193067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCG TACTGATCGC CGGCGCCGCG CGGACGTCGT TCGCGCGGTC TGAGCGCAGC 
GGGCGCCAGC ACGCTGTGGC TGCGGTGCAC GCGGCGCTCG AGGACGCGGG CCTGCCTTGG
TCGCGGGTCC GGGCCGCGTT CGGCGGCAGC GACGCCGCCG GGCTCGCGGA CACCCTCGTC
GCCGATCTGG GACTGACCGG CGTGCCGTTC CTCAACGTCA AGAACGGCTG CGCGACCGGG
GGAAGCGCGC TGGTGTCGGC GGTGAACGCG ATCCGGTCCG GTATGGCCGA CGTCGTGCTC
GCCGTCGGCT TCGACAAGCA TCCTCGGGGG GCCTTCGCCC CGATGCCCGA GGACTGGGGG
CTCGACGCCG CCTACGGCCG CAACGGGCTC ATGGTGACCA CGCAGTTCTT CGGGATGAAG
ATCCAGCGGT ACGCCCATGA CCACGGCGTC ACCGCGCGCA CCCTCGCGCG GGTGGCCGAG
AAGGCCTACC GCAACGGCGC GCTGACGCCG CAGGCCTGGC GCCGCACGCC GCTCACCGCC
GACGAGGTGC TCGCGTCCGG CATGGTGAAC GACCCGCTCA CCCGCTACAT GTTCTGCTCG
CCGGGGGAGG GCGCCGCCGC GGTTGTGCTC TGCACGCCGT CCGTGGCCGC CGAGCTGGCC
AACCGGCCGG TCACGCTGCG GGCCGCCGAG GTCCGCACCC GCCAGTTCGG CACCTTCGAG
GTATTCAGTC CCTGGATCCC GGCCGGCGAG CTGACCAGTG TCAGCCGCGC CGCCGCCGCC
GCGGCCTTCG AGGCCGCTGG CGTCGGACCG GACGAGATCG ACGTCTGCCA GCTCCAGGAC
ACCGAGGCCG GAGCGGAGGT CATGCACATG GCCGAGTGCG GGTTCTGCGC CGATGGCGAT
CAGGACAAGC TCATTGCCGA GGGCGCCACC GACATCGGCG GGTCGCTGCC GGTGAACACC
GACGGCGGAT GCATCGCCAA CGGCGAGCCG ATCGGGGCAT CCGGCCTGCG CCAGGTTGTC
GAGGTGGTCA CGCAGCTGCG CGGGCAGGCC GGACAGCGCC AGGTCCCCGG GACACCGCGG
CTCGGCTTCA CCCACGTCTA CGGTGCGCCG GGTGTCAGTG CCTGCACCGT GCTGTCGGTC
TGA
 
Protein sequence
MSGVLIAGAA RTSFARSERS GRQHAVAAVH AALEDAGLPW SRVRAAFGGS DAAGLADTLV 
ADLGLTGVPF LNVKNGCATG GSALVSAVNA IRSGMADVVL AVGFDKHPRG AFAPMPEDWG
LDAAYGRNGL MVTTQFFGMK IQRYAHDHGV TARTLARVAE KAYRNGALTP QAWRRTPLTA
DEVLASGMVN DPLTRYMFCS PGEGAAAVVL CTPSVAAELA NRPVTLRAAE VRTRQFGTFE
VFSPWIPAGE LTSVSRAAAA AAFEAAGVGP DEIDVCQLQD TEAGAEVMHM AECGFCADGD
QDKLIAEGAT DIGGSLPVNT DGGCIANGEP IGASGLRQVV EVVTQLRGQA GQRQVPGTPR
LGFTHVYGAP GVSACTVLSV