Gene Francci3_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3687 
Symbol 
ID3903788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4423016 
End bp4424200 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content72% 
IMG OID637881013 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_482768 
Protein GI86742368 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.24017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.945207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGTT CCGTGATCGT GGCCGGAGTG CGGACACCGT TCGGGAAACT GTCGGGCGGG 
CTGAAGAGCT TCACCGCGAC GGATCTCGGT GGCATCGCGA TCTCGGGTGC CCTCGCGCGG
GCCGGCCTGT CCGGCGACGC GGTCGATTAT GTGATCATGG GGCATGTGAT CCAGGCCGGT
GCGGGTCAGA TCACCGCCCG GCAGGCGGCG GTCGCCGCCG GCATCCCGCT GTCGGTGCCC
GCCATCACGA TCAACAAGGT CTGCCTGTCC GGCCTCGACG CGATCGCCCT GGCCGACACG
TACATCTCCA GCGGGGAGTT CGACCTCGTC GTCGCCGGTG GGATGGAGTC CATGACCGGC
GGCCCGCACC TGCTGCGCTC GCTGCGCTCC GGGGTGAAGT ACGGCCGGGC CGAGCTGCTC
GACGCCACCG AGCACGACGC GCTGTTCTGC GCCTTCGACC AGATCGCGAT GGGGGCGAGC
ACCGAGCGGT ACAACGCGGG GCTTGGCATC GGCCGGGCCG AGCAGGACGA GTTCGGCGCC
CAGTCGCATC AGCGGGCCGC CGCCGCCACC AAGAACGGGC TGTTCGACGA CGAGATCGTC
CCGGTCGCGG TGCCGCAGCG CCGCGGCGAG CCGCTGGTCG TCAACCAGGA CGAGAGCGTG
CGTCCCGACA CCACCGTCGA GGTGCTCGCG AAGCTGCGGC CGGCCTTCGA CGGCAACGGC
ACGATCACCG CGGGTTCGTC GTCGCCGATC TCCGACGGCG CCGCCGCCGT CATCGTGGCG
AGCCGGGCGA AGGCCGAACA GCTCGGCCTG CCGATCCTCG CGGAGGTCGG TCACCACGGC
TTCGTGGCCG GTCCGGACAC CTCCCTGCAG TCCCAGCCCT CGCGCGCCAT CCTCGCGGCG
CTGGCCAAGG AGCGGCTCAC CCCGGCCGAT CTCGACCTCG TCGAGATCAA CGAGGCCTTC
GCCGCGGTCG CGATCCAGTC CATGCGCGAC CTCGGGATCG GCCCGGAGAT CACCAACGTC
AACGGCGGCG CGATCGCGAT CGGTCACCCC GTCGGAGCCT CCGGTGCCCG GATCGCCCTG
ACCCTGGCCA ACGAGCTGAA GCGCCGCGGC GGCGGGATCG GGGCCGCCGG TCTCTGCGGC
GGCGGCGGCC AGGGAGACGC CTTGGTGCTG CGCGTTCCGG CGTGA
 
Protein sequence
MPGSVIVAGV RTPFGKLSGG LKSFTATDLG GIAISGALAR AGLSGDAVDY VIMGHVIQAG 
AGQITARQAA VAAGIPLSVP AITINKVCLS GLDAIALADT YISSGEFDLV VAGGMESMTG
GPHLLRSLRS GVKYGRAELL DATEHDALFC AFDQIAMGAS TERYNAGLGI GRAEQDEFGA
QSHQRAAAAT KNGLFDDEIV PVAVPQRRGE PLVVNQDESV RPDTTVEVLA KLRPAFDGNG
TITAGSSSPI SDGAAAVIVA SRAKAEQLGL PILAEVGHHG FVAGPDTSLQ SQPSRAILAA
LAKERLTPAD LDLVEINEAF AAVAIQSMRD LGIGPEITNV NGGAIAIGHP VGASGARIAL
TLANELKRRG GGIGAAGLCG GGGQGDALVL RVPA