Gene Franean1_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2943 
Symbol 
ID5671329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3462238 
End bp3463449 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID641241849 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001507269 
Protein GI158314761 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCTG AAGCCTTCGT CTACGATGCC GTTCGCACGC CCCGTGGGCG TGGCCGGAAG 
GGATCACTGC ACGGGACCAA GCCGATCGAC CTGGTGGTCA GCCTCGTGGA CGCCCTACGC
AAGCGCAACC CCACCCTTGA CCCCGAGCGG ATCGACGACC TCGTCCTGGG GGTCGTCACC
CCCATCGGCG ACCAGGGATC GGACATCGCA CGTACCGCCG TGCTGGCCTC GGGCCTGCCC
GACACGGTCG GAGGCGTACA GCTGAACCGG TTCTGTTCCT CGGGTCTGGA GGCAGTCAAC
ACCGCCGCGC AGAAGGTGCG CTCGGGATGG GAATGCCTGG TGGTCGCCGG TGGAGTCGAG
TCCATGTCCC GGGTCCCGAT GGCATCGGAC GGCGGTGCGT GGGCCCTGGA CCCGTGGACG
AATCTCACGA CGTCGTTCGT GCCGCAGGGC GTCAGCGCCG ACCTCATCGC GACGATCGAG
GGGTTTGACC GGGAGGCGGT CGACTCCTAT GCCGTCCGTT CTCAGGGGCT CGCGGCCAAG
GCGTGGGCCG GGGGCTACTT CGAGCGATCT GTGGTCCCGG TCGTGGATCG CAACGGGCTG
ACCGTCCTCG ACCGCGACGA GCACATGCGG CCCGAGACGA CCCTCGAGAG CCTCGCCGCG
CTCAACCCGT CCTTCGCAGC GGTAGGCGAG CAGGGCGGCT TCGACGCGGT GGCGCTGCAG
AAGTACCACT GGGTCGAGCG CATCGAACAC GTCCACCACG CCGGCAACTC CTCCGGCGTA
GTCGACGGCG CCGCCCTGGT GGTCGTGGGG AACGAGGAGA TCGGACGCGA CCTGGGACTG
ACACCGCGAG CTCGCATCGT GGCCACGGCG ACCAGCGGCG CGGACTCGAC GATCATGCTG
ACCGGCCCGA CACCCGCGAC CCTCAAGGTG CTGACAAAGG CGGGGCTGAC ACCCGACGAC
ATCGACCTGT TCGAGATCAA CGAGGCGTTC GCGTCGGTCG TCCTGAAGTA CCAGAAGGAC
CTGCGAATTC CGGACGAGAA GCTCAACGTC AACGGCGGCG CGATCGCGAT GGGTCATCCG
CTCGGGGCCA CCGGTGCCAT GATCCTCGGC ACCGTGGTCG ACGAGCTCGA ACGCCGTGAG
GCCCGACGGG GTCTGGTCAC CTTGTGCGTC GGCGGTGGGA TGGGCGTGGC CACCGTGGTC
GAGCGCGTCT GA
 
Protein sequence
MSAEAFVYDA VRTPRGRGRK GSLHGTKPID LVVSLVDALR KRNPTLDPER IDDLVLGVVT 
PIGDQGSDIA RTAVLASGLP DTVGGVQLNR FCSSGLEAVN TAAQKVRSGW ECLVVAGGVE
SMSRVPMASD GGAWALDPWT NLTTSFVPQG VSADLIATIE GFDREAVDSY AVRSQGLAAK
AWAGGYFERS VVPVVDRNGL TVLDRDEHMR PETTLESLAA LNPSFAAVGE QGGFDAVALQ
KYHWVERIEH VHHAGNSSGV VDGAALVVVG NEEIGRDLGL TPRARIVATA TSGADSTIML
TGPTPATLKV LTKAGLTPDD IDLFEINEAF ASVVLKYQKD LRIPDEKLNV NGGAIAMGHP
LGATGAMILG TVVDELERRE ARRGLVTLCV GGGMGVATVV ERV