Gene Franean1_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1960 
Symbol 
ID5670361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2356374 
End bp2357567 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content72% 
IMG OID641240881 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001506303 
Protein GI158313795 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.201942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAACG CAGTCATCGT CGACGTCGTG CGCACCCCCT CAGGGCGGGG CAAGCCGGGC 
GGCGCGCTGT CCGAGACCCA CCCCGTGGAG CTTCTCGCCA CGACGCTGAA GGCGCTGACC
AGCCGGAACG ACTTCGACCC GGCGCTCATT GACGACGTCA TCGCCGGGTG CGTCGGCCAG
GCCGGTGAGC AGTCGGCGAA CATCGGCCGC AGTGCGGTGC TCTCCGCCGG CTACCCCGAG
TCGGTGCCGG CGACCACCGT CGACCGGCAG TGCGGCTCCA GCCAGCAGGC GCTGCACTTC
GCCGCGCAGG GCGTGATGGC CGGTGCCTAC GACGTCGTGG TGGCCTGTGG CGTCGAGTCG
ATGAGCCGGG TCCCGATGCG CTTCGCGGCG GCGGGGAAGG ACCCACGCGG GCCGGGCATG
ATCGCCCGCT ACCCCGAGGG CCTGGCCAAC CAGGGCATCG GCGCCGAGCT GATCGCCGCC
CGCTGGAAGC TGACCCGCGA GGAGCTCGAC GAGTTCTCCG CGCTGTCGCA CCAGCGGGCC
GCGGCGACGG CCGAGGCCGG CGGCTTCGAC AACGAGATCG TCCCCGTCGA GGCGCTCGCG
CCGGACGGCA GCACGTTCAC GCACACCGTC GACCAGACCG TCCGGCCGAC CACCACGGCC
GAGGGTCTCG CGGCCCTCAA GCCGTCCTTC TACACCGAGC AGTTCGCGCA GCGGTTCCCG
GAGATCGGCT GGCACATCAC CCCGGGCAAC TCCTCCCCGC TGACCGACGG CGCGTCCGCC
GCTCTGATCA TGAGCGAGTC GAAGGCCGTC GAGCTCGGGC TGCGCCCGCG GGCCCGCTTC
CACGCCTTCG CCCTCGCCGG TAGCGAGCCG CTGATCATGC TGACCGGCCC GGCCCCGGCC
ACCCGCAAGA TCCTCGCCCG GTCCGGCCTG CGGATCGACG ACATCGACGC CTACGAGGTC
AACGAGGCGT TCGCGTCCGT CCCGCTGTTC TGGGCGAAGG AGTTCGACGC CGACCCGGCC
AGGCTCAACC CACGCGGTGG TGCGATCGCG CTGGGCCATC CGCTGGGCGG CTCGGGCGTC
CGTCTGATGG CGACGATGGT CAACTACCTG GAGGCCACCG GCGGGCGCTA CGGCCTGCAG
ACCATGTGTG AAGGTGGCGG CATGGCCAAC GCCACCATCA TCGAACGCCT CTGA
 
Protein sequence
MENAVIVDVV RTPSGRGKPG GALSETHPVE LLATTLKALT SRNDFDPALI DDVIAGCVGQ 
AGEQSANIGR SAVLSAGYPE SVPATTVDRQ CGSSQQALHF AAQGVMAGAY DVVVACGVES
MSRVPMRFAA AGKDPRGPGM IARYPEGLAN QGIGAELIAA RWKLTREELD EFSALSHQRA
AATAEAGGFD NEIVPVEALA PDGSTFTHTV DQTVRPTTTA EGLAALKPSF YTEQFAQRFP
EIGWHITPGN SSPLTDGASA ALIMSESKAV ELGLRPRARF HAFALAGSEP LIMLTGPAPA
TRKILARSGL RIDDIDAYEV NEAFASVPLF WAKEFDADPA RLNPRGGAIA LGHPLGGSGV
RLMATMVNYL EATGGRYGLQ TMCEGGGMAN ATIIERL