Gene Franean1_3929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3929 
Symbol 
ID5672290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4698392 
End bp4699603 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content73% 
IMG OID641242808 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001508225 
Protein GI158315717 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.041846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGG CGGTCATCTG CGAACCGGTG CGCACCGCGG TCGGGCGCTA CGGCGGTGCG 
CTCGCGGCGC TGAGCGCGCA GGCGCTCGGC GCGGCCGTGC TGCGCGGCCT GCTGGACCGG
ACCGGCCTGC GGTCAGCGGA CATCGATGAC GTGATCTTCG GGTCGTGTTA TCCGACGATG
GAGGCACCCG CGCTTGGCCG GGTGGTCGCC CTCGACGCCG GCCTCGACGT CACCGTCGCG
GGCCTGCAGC TCGACCGCCG CTGCGGATCG GGCATGCAGG CGGTCACGAC GGCCGCGATG
CAGGTCCAGA CCGGAGTGGC CGACGTCGTG ATCGCCGGGG GAGCGGAGAG CATGAGCAAC
GCCCCCTTCT ACTCGACCCG GATGCGCCGG GGGTCGGGCG GCGGGGACGT CACGCTGCAC
GACGCCCTGG CCCGCGGCCG GGTGACCGCG GGCGGCGCGA ACTTCCCCGT CCCGGGCGGG
ATGATCGAGA CGGCGGAGAA CCTGCGGAGG GAGTACGGGA TCTCGCGCTC CGAGCAGGAC
GAGTTCGCGC TGCGGTCGCA TGTCCGGGCC GTGGACGCGC AGGCCGCCGG TCGGTTCGCG
GACGAGATCG TGTCGGTGTC GGTGCCGGGT CGCGGCGGTT CGGTCGTCGT GGACGTCGAC
GAGCATCCCC GCGCCGACGC CAGCCTCGAC ACGCTCGCCG CGCTGCGCCC GATCATGGGT
AGGACCGACC CGGAGGCCAC GGTCACTGCG GGCAATTCCA GCGGGCAGAA CGACGCGGCG
TCGGCGTGTG TCGTCACCCA TCCGGAGGCG GCGAGGCGAC TCGGCCTGCG TCCGCTCGGT
CGGCTGGTGA GCTGGGCGGT CGCCGGTGTG GAACCCGCGA GGATGGGGAT CGGACCCGTT
GCGGCCACGG CGAAGGCGCT GGAGCGGGCG AACCTCAAGC TCGCCGACAT CGACCTGATC
GAGCTCAACG AGGCCTTCGC GGCGCAGGTG CTCGCCTGCA CCCGGGAGTG GGGGCTCACG
ACCGCGGACC TGGACCGGCT CAATGTCAAC GGTTCCGGGA TCTCGCTCGG TCATCCCGTC
GCGGCGACCG GTGGCCGGAT CCTCGCGACC CTGCTTCACG AGATGGAGCG CCAGGACGCC
CGGTACGGGC TGGAGACCCT GTGCATCGGC GGCGGCCAGG GGATCACCGC GATCTTCGAA
CGGGTCGGCT GA
 
Protein sequence
MREAVICEPV RTAVGRYGGA LAALSAQALG AAVLRGLLDR TGLRSADIDD VIFGSCYPTM 
EAPALGRVVA LDAGLDVTVA GLQLDRRCGS GMQAVTTAAM QVQTGVADVV IAGGAESMSN
APFYSTRMRR GSGGGDVTLH DALARGRVTA GGANFPVPGG MIETAENLRR EYGISRSEQD
EFALRSHVRA VDAQAAGRFA DEIVSVSVPG RGGSVVVDVD EHPRADASLD TLAALRPIMG
RTDPEATVTA GNSSGQNDAA SACVVTHPEA ARRLGLRPLG RLVSWAVAGV EPARMGIGPV
AATAKALERA NLKLADIDLI ELNEAFAAQV LACTREWGLT TADLDRLNVN GSGISLGHPV
AATGGRILAT LLHEMERQDA RYGLETLCIG GGQGITAIFE RVG