Gene Franean1_3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3017 
Symbol 
ID5671399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3551036 
End bp3552238 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content71% 
IMG OID641241919 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001507339 
Protein GI158314831 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.712178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGTCG TAGCGAAGGG TTCGATCCGC GACGTCGTGT TCGTCGACGG AGTGCGCACC 
CCGTTCGGCA AGGCAAAGGG CGTCTATGCC GAGACCCGGG CGGACGACCT GGTGATCCGG
GTAATCCGCG AGCTGATCCG GCGCAACCCC GGGCTGCCGC CGGAGCGGAT CGACGAGGTC
GCGGTCGCCG CGACCACCCA GATCGGCGAC CAGGGCCTGA CCATTGGCCG GGTCGCCGGG
ATCCTGGCCG GGCTGCCCGA GTCGGTGCCC GGCTACGCCA TCGACCGGAT GTGCGCAGGT
GCCGTGACCG CGGTGACCAC GACCGCGTCG GCGATCGCCG TCGGCGCCTA CGACGTGGCG
ATCGCCGGGG GCGTCGAGCA CATGGGCCGC CATCCGATGG GCGAGGGCGC CGACCCGAAC
CCCCGGTTCG TCTCCGAGCG GCTCGTCGAC CCGTCCGCGC TGGTCATGGG CATGACCGCC
GAGAACGTGC ACGACCGCTA CCCCGGCATC ACCCGGGCCC GGGCGGACGC CTACGCCCTG
GCGTCCCAGC AGAAGGTCGC CAAGGCCTAC GCCGACGGGA AGATCCAGCC CGACCTCGTG
CCCGTCGCGG CCCGGCACAC CGACAGTGGG TGGGAACTGG TCACCGCCGA CGAACCGCCG
CGGCCCGACA CGACACTCGA GGGCCTCGCC GGACTGCGCA CCCCGTTCCG CCCGCACGGG
CGGGTCACCG CCGGCAACGC TGCGGGCATC AACGACGGTG CGACCGGCTG CGTGCTGGCC
GCCGCCGAGG TAGCCGCCGA ACTCGGCCTG GAGCGGAGGA TGACCCTCGT CGGGTTCGGG
TTCGCCGGGG TGGCGCCAGA AGTGATGGGC GTCGGGCCAA TCCCGTCCAC GGAGAAGGCG
CTGGCCCGTA CCGGCCTGAG CATCGACGAC ATCGGACTGT TCGAGCTGAA CGAGGCCTTC
GCGGTGCAGG TGCTGGCCTT CCTGGACCAC TTCGGCATCG CCGACGACGA CCCGCGGGTG
AACCAGTACG GTGGCGCGAT CGCCCTGGGA CACCCGCTCG CCTCCAGCGG GGTCCGGCTG
ATGACACAAC TGGCCCGGCA GTTCGAGGAA CACCCCGATG TCCGCTACGG CCTGACCGCG
ATGTGCGTTG GCTTCGGCAT GGGCGCCACC ACCATCTGGG AGAACCCGCA CCACATGGCC
TGA
 
Protein sequence
MFVVAKGSIR DVVFVDGVRT PFGKAKGVYA ETRADDLVIR VIRELIRRNP GLPPERIDEV 
AVAATTQIGD QGLTIGRVAG ILAGLPESVP GYAIDRMCAG AVTAVTTTAS AIAVGAYDVA
IAGGVEHMGR HPMGEGADPN PRFVSERLVD PSALVMGMTA ENVHDRYPGI TRARADAYAL
ASQQKVAKAY ADGKIQPDLV PVAARHTDSG WELVTADEPP RPDTTLEGLA GLRTPFRPHG
RVTAGNAAGI NDGATGCVLA AAEVAAELGL ERRMTLVGFG FAGVAPEVMG VGPIPSTEKA
LARTGLSIDD IGLFELNEAF AVQVLAFLDH FGIADDDPRV NQYGGAIALG HPLASSGVRL
MTQLARQFEE HPDVRYGLTA MCVGFGMGAT TIWENPHHMA