Gene Franean1_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3657 
Symbol 
ID5672023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4333845 
End bp4334990 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content72% 
IMG OID641242540 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001507960 
Protein GI158315452 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.838334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACG CGGTGATCGT GGAGGCCCTC CGCACGCCGA CCGGCAAGCG CAATGGTTCC 
CTGTCGGGTG TGCATCCGAC GGATCTTTCG GCGCACGTGC TGGCGAGCCT CGCCGAGCGG
GCCGGCGTCG ATCCGGCTCT GGTGGACGAC GTGGTGTGGG GCTGTGTCGG CCAGGTGGGC
GAGCAGACCT TCGACATCGC CCGCAACGCC GCGCTCGGTG CCGGCTGGCC GGAGACCGTC
ACCGGCGTGA CCGTCGACCG CCAGTGCGGC TCGGGCCAGC AGGCGGTGCA CTTCGCCGCC
GCCGGGCTGA TCGCGGGGCA GTACGACGTG GTCGTCGCCG GCGGCGTCGA GTCGATGTCC
CGGGTGCCGA TGGGCTCCTC CCTGATGGAC AAGGTCCCCT TCGGCGAGCG GTACCTGGCC
CGCTACAACG GCGCCTTCCC GGACCAGGGC ATCGGCGCCG AGATGATCGC GGAGCGCTGG
GGCCTGTCGC GGACCCAGCT CGACGAGTTC GCGCTGCTCT CCCACGAGCG GGCGGCGGCG
GCGCAGGACG ACGGCCGCTT CGACGAGCAG ATCATCCCGG TCACCCTGAC CGACGGCACC
GTGGCCAGCA AGGACGAGGG CATCCGCCGC GGCGGCACGG TCGAGGGCCT CGCCGGGCTC
CGGACGGCTT TCAAGCCGGA CGGCGTGATC ACAGCGGCGA ACTCGTCCCA GATCTCCGAC
GGCTCGTCGG CACTGCTGAT GACGACCAGT GAGAAGGCCG CCGAGCTGGG CCTGTGCCCG
ATCGCCCGGG TGCACACCGC CGTCCTCGCT GGCACCGACC CGGTGATCAT GCTGACCGCG
CCGATCCCCG CCACCCAGAA GGTGCTGGCG AAGTCCGGCC TGAAGCTCGA CGACATCGGT
GCCTTCGAGG TCAACGAGGC GTTCGCCTCC GTGCCCCTGG CCTGGCTGGC CGACATCGGC
GCCGACCCGA AGGCCCTGAA CCCGAACGGC GGCGCGATCG CCCTCGGCCA CCCGCTCGGC
GGCTCCGGCA CCCGGCTCAT GACCACCCTG ATCTACCACA TGCGCGACAA CGGGATCCGC
TACGGCCTGC AGACCATGTG CGAGGGCGGC GGCCAGGCGA ACGCCACCAT CCTCGAGCTG
CTCTGA
 
Protein sequence
MRDAVIVEAL RTPTGKRNGS LSGVHPTDLS AHVLASLAER AGVDPALVDD VVWGCVGQVG 
EQTFDIARNA ALGAGWPETV TGVTVDRQCG SGQQAVHFAA AGLIAGQYDV VVAGGVESMS
RVPMGSSLMD KVPFGERYLA RYNGAFPDQG IGAEMIAERW GLSRTQLDEF ALLSHERAAA
AQDDGRFDEQ IIPVTLTDGT VASKDEGIRR GGTVEGLAGL RTAFKPDGVI TAANSSQISD
GSSALLMTTS EKAAELGLCP IARVHTAVLA GTDPVIMLTA PIPATQKVLA KSGLKLDDIG
AFEVNEAFAS VPLAWLADIG ADPKALNPNG GAIALGHPLG GSGTRLMTTL IYHMRDNGIR
YGLQTMCEGG GQANATILEL L