Gene Franean1_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4037 
Symbol 
ID5672395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4813463 
End bp4814656 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID641242913 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001508330 
Protein GI158315822 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.301808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGGAGC ATGAAGCGGT AATCGTCTCG ACCGCACGGA CGGCGATCGG CACCGCGGGC 
AAGGGTTCCC TCGTCGACGT CGACGCTTTC GAGCTGGGTA CGCAGGCCGT CGCGGAGGCA
GTCCGGCGGT CCGGTATCGA CAGCGCGGAC ATTGATGACG TCGTCCTGGG TGAATCGCTT
TACGGCGGCG GCGACATCGC GCGTTACGCG GCCATCGAGG CTGGTTTCGC GAACGCCGCC
GGCGTCGCGC ACAACCGGCA CTGCGCGTCG GGGCTCGTCG CGGTGCAGAC CGCCGCCGCC
TCGATCATCT CCGGGATGGA CCGTGTCATC GTCGCGGGTG GGACGAACTC CTCCTCCACC
TCGCCACGGG CGAAGCGGCG CCGGCCCGGA ACCGACGAGG TCGAGGACTG GTACTCGCCG
ACCCACCGGA ACACCGCCGA GGCCCCGAAC TTCGACATGT CCATCACCGT CGGGTGGAAC
GCGGCCGTCA AGGCCGGCGT CAGCCGCGAG GAGATGGACG CGTGGGCGCT GCGCTCGCAC
CAGCGGGCCG TCGCGGGCAT CGACGCGGGC AGCTTCACCG ACGAGATCTT CCCGATCGAG
GTGCCCCGGC GGGACGGCAC GAGCTTCACC TTCGCCGTGG ACGAGCACCC GCGGCGCACC
ACCAGCATGG AGAAGCTGGC CTCGCTCAAG CCGCTGCACC CCGAGATCGA GGGCTTCAGC
ATCACGGCCG GCAACGCGGC GGGCGCGAAC GACGGCGCCG CCGCCATGGT CATCACTGAC
GGCGGCTACG CCGCGGAGCA CGGCCTGGAG CCGCTGGGCA TCGTCCGGGC CTGGGCGTCG
GTCGGGGTGC CCCCGGCCGA GACCGGCATC GCCCCCACAC TGGCGATCCC GAAGGCCCTC
AAGCGCGCCG GCCTCACCGT GGCCGACGTC GACCTGTGGG AGATCAACGA GGCGTTTGCG
TCCGTGGCGG TCGCGGCATC CCGGATCCTG GAGATCGACG ACAGCCGCGT CAACTTCCTG
GGCAGCGGTT GCAGCCTCGG GCACCCGATC GCGACGACCG GCGCGCGGCA GATCATCACG
TTGATCCATG AGCTGCGCCG CCGTGGCGGC GGAGTGGCGG TGTCGGCGAT GTGCGCCGGC
GGGGGCATGG CAAGCGCCCT CGTGCTGGAG GTTCCCGCTC CGCGCTCGAG CTGA
 
Protein sequence
MSEHEAVIVS TARTAIGTAG KGSLVDVDAF ELGTQAVAEA VRRSGIDSAD IDDVVLGESL 
YGGGDIARYA AIEAGFANAA GVAHNRHCAS GLVAVQTAAA SIISGMDRVI VAGGTNSSST
SPRAKRRRPG TDEVEDWYSP THRNTAEAPN FDMSITVGWN AAVKAGVSRE EMDAWALRSH
QRAVAGIDAG SFTDEIFPIE VPRRDGTSFT FAVDEHPRRT TSMEKLASLK PLHPEIEGFS
ITAGNAAGAN DGAAAMVITD GGYAAEHGLE PLGIVRAWAS VGVPPAETGI APTLAIPKAL
KRAGLTVADV DLWEINEAFA SVAVAASRIL EIDDSRVNFL GSGCSLGHPI ATTGARQIIT
LIHELRRRGG GVAVSAMCAG GGMASALVLE VPAPRSS