Gene Franean1_4577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4577 
Symbol 
ID5672924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5459955 
End bp5461121 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content75% 
IMG OID641243440 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001508856 
Protein GI158316348 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.4334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.466675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGG CCGTGATCGT CAGTGCGGTG CGCACGCCCA TCGCGACGTC GTTCAAGGGG 
ACGCTGCGGG ACACCTCGGC CGAGGAGCTG GCCACGGCGG TCGTCCGGGC CGCGGTGGAC
CGCTCGGGGC TGGCGCCCGA GGACGTCGAC GACGTCATCC TCGCCGAGGA GCTGGCCGGC
GGCGGCGACA TCGCCAGGTA CGCCGCCTTC GCGGCCGGGC TGACGGCGGC GCCGGGCCAG
GCCGTCAACC GCCACTGCGC GGCGAGCCTC GCGGCGGTGG GCAACGCGGC GGCGACGATC
CGGGCCGGGA TGGACCGCGC GGTCGTCGCC GGCGGCACCC ACTCCTCGTC GATGAACCCC
AGGCTGTCGT GGCGGGTGCC CGGGTCGGAC GAGCCGCGCG CCGGGTTCAA CCCCACGTTC
CCCTACTACG AGGGCGCCAC CGACGACGTG ACCCTCGCCG TCGGCTGGAA CACCGCGCAG
GAGGTGGGCA TCACCCGGGC GGAGATGGAC GCCTGGGCCA AGCGCTCCCA CGACCGGGCG
ATCGCCGCGA TCGACGCCGG AGTCTTCGAC GACGAGATCG TCCCGATCGA CGTCGTCGTG
GCGGGGGAGA AGGTCCGCTT CGCCGTCGAC GAGCACCCGC GCCGGACGTC CACGCTGGAG
AAGCTGGCCA CGCTGAAGCC GCTGCACCCC GAGATCGAGG GCTTCGGCAT CACCGCGGGG
AACGCGAGCG GCGTGAACGA CGCCGCCGCG GCCCTGATGC TGGTCACCGA CGACCTCGCC
CGGGATCGGG GCCTCACCCC GCTGGCCCGG GTGCGGGCGT GGGCGGCACT CGGCGTGGCC
CCGCACCGCA CCGGGATGGC CGGGGTGGAG GTCATCCCGC GGGTGCTGGA GCGGGCCGGG
ATCGGGGTGG CCGACGTCGA CGCCTGGGAG ATCAACGAGG CCTTCGCGTC GGTCCCGATC
GCCGCCTGCC GCCTCCTGGG AATCCCGGAC GACCTGGTCA ACCAGTACGG CAGTGGCTGC
AGTCTCGGGC ATCCGGTCGC GGCCTCCGGA GCGCGGATGC TGACCACCCT GACCCACCAC
CTGCGCCGGC GCGGCGGCGG GATCGCCGTG GCGGCGATGT GCGCCGCGGG CGGCCAGGGC
GGCGCAGTGG TCATCGAGGC GCCGTGA
 
Protein sequence
MAQAVIVSAV RTPIATSFKG TLRDTSAEEL ATAVVRAAVD RSGLAPEDVD DVILAEELAG 
GGDIARYAAF AAGLTAAPGQ AVNRHCAASL AAVGNAAATI RAGMDRAVVA GGTHSSSMNP
RLSWRVPGSD EPRAGFNPTF PYYEGATDDV TLAVGWNTAQ EVGITRAEMD AWAKRSHDRA
IAAIDAGVFD DEIVPIDVVV AGEKVRFAVD EHPRRTSTLE KLATLKPLHP EIEGFGITAG
NASGVNDAAA ALMLVTDDLA RDRGLTPLAR VRAWAALGVA PHRTGMAGVE VIPRVLERAG
IGVADVDAWE INEAFASVPI AACRLLGIPD DLVNQYGSGC SLGHPVAASG ARMLTTLTHH
LRRRGGGIAV AAMCAAGGQG GAVVIEAP