Gene Franean1_6477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6477 
Symbol 
ID5674792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7874674 
End bp7875867 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content74% 
IMG OID641245325 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001510720 
Protein GI158318212 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0438071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACG CCGTCGTCGT CGAAGCGGTG CGGTCCCCGA TCGGGCGGGG ACGCGCCAGC 
GGCAGCCTCG CCCCCGTCCA CCCCATCGAC CTGCTGGCGC ACAGCATCTC CGCGGCCGTC
GAGCGGTCCG GCGTCGATCC GGCGCTGATC GACGACGTCA TCGCCGGCTG CGTGACCCAG
AGCGGGGAGC AGGCCGCGAA CATCGCCCGC TGGGCGACCC TGGCCGCCGG ACTGCCCGAG
TCGGTGCCCG GGACCACCGT CGACCGCCAG TGCGGCTCGT CCCAGCAGGC CGTCCACTTC
GCCGCGCAGG GCGTCATCGC CGGCGCCTAC GACATGGTCC TGGCCTGTGG TGTCGAGTCG
ATGGGGCGGG TCCCGATGGG CTCCGCGACG CTCCCCGGAG ACAGCCACGG CACCCGGATC
AACGCCCGCT ACCCGGACGG GCTGATCTCG CAGGGCATCA GCGCCGAGCT GATCGGCGCG
CGGCGGGGCC TCGGCCGCGC CGAGATGGAC GAGTTCGCCC TCGCCAGCCA CCAGCGCGCC
GCCACCGCGG CCCGGGATGG CCTGTTCACC CCGGAGATCG CGCCGGTGAA GGTCACCGGG
CCGGACGGGT CGGTCGTCGA GTTCTCCGCC GACGAGGGCA TCCGCCCGGC GTCGAGCACG
CAGGCCCTCG CGGGTCTGCG CCCGGCCTTC TACGACGAGG CCACCGCCGC GCGTTTCCCG
GAGATCGGCT GGAACGTGAC GGCCGGCAAC GCCTCGCAGA TCAGCGACGG CTCCGCCGCG
CTGCTGATCA CCACCTCGGA GCGGGCCCGC GAACTGGGGC TGCGCCCGCT CGCCCGGCTG
CACAGCTTCG CCGTCGTCGG CGACGACCCG TTCCTGATGC TGCGCGGCGT CATCCCCGCC
ACCCGCCGGG TGCTGGAGCG CTCGGGCCTG AGCCTGGACG ACATCGACCT GTTCGAGGTG
AACGAGGCCT TCGCCGCGGT CGTGCTGGAC TGGCGCGCCG AGATCGGCGC ATCGGCGGAG
AAGGTCAACG CCCGGGGCGG CGCGATCGCG CTCGGCCACC CGCTCGGCGC CAGCGGCGCC
CGGATCATGA CGACCCTCGT GCACGCGCTG CACCAGACCG GCGGCCGGTT CGGGCTGCAG
ACGATGTGCG AGGCGGGCGG CCTCGCGAAC GCGACCATCA TCGAGCGCCT CTGA
 
Protein sequence
MRDAVVVEAV RSPIGRGRAS GSLAPVHPID LLAHSISAAV ERSGVDPALI DDVIAGCVTQ 
SGEQAANIAR WATLAAGLPE SVPGTTVDRQ CGSSQQAVHF AAQGVIAGAY DMVLACGVES
MGRVPMGSAT LPGDSHGTRI NARYPDGLIS QGISAELIGA RRGLGRAEMD EFALASHQRA
ATAARDGLFT PEIAPVKVTG PDGSVVEFSA DEGIRPASST QALAGLRPAF YDEATAARFP
EIGWNVTAGN ASQISDGSAA LLITTSERAR ELGLRPLARL HSFAVVGDDP FLMLRGVIPA
TRRVLERSGL SLDDIDLFEV NEAFAAVVLD WRAEIGASAE KVNARGGAIA LGHPLGASGA
RIMTTLVHAL HQTGGRFGLQ TMCEAGGLAN ATIIERL