Gene Francci3_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2244 
Symbol 
ID3905012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2617787 
End bp2618953 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID637879575 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_481341 
Protein GI86740941 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.106872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0160959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGACG TGGAGATCGT GGGCTGGGGT CATACCCCGT TTGGCCGTCT GCCCGAGGAG 
ACTCTGGAGT CGCTGATCGT TGCCGCCGCG CGGGAGGCGA TCGCGTCGGC GGGTCTGCGC
CCGCGGGAGA TCGACGAGAT TGTTCTCGGT ACCTATAACG CGGGGTTGCA GCCGTTGGCG
TTTCCTTCCT CGCTGGTGCT GGAGGCCGAT GACGATCTGC TGTTCACCCC GGCGACCCGG
GTGGAGAACG CGTGTGCGAG CGGGTCGGCG GCGTTGCTGT GCGGGGTGCG GGCGATTCGG
TCCGGGCAGG CGCGCCGGGT GCTGGTGGTC GGGGCGGAGA AGATGACCCA CGCGTCGGCC
GAGGTTGTCG GTGGGGCTTT GCTGGGTGCC GACTATGAGC ATGCCGGCGA GTGCGCGCCT
GCCGGGTTCG CGCGGTTGTT CGCCGATGTC GCCGAGGCCT ACTTCACCAA GTATGGTGAT
CACAGTGACG CGTTGGCGCG GATCGCGGCG AAGAATCATC GTAACGGGGT GGTGAATCCG
TACGCCCATC TGCGTTCCGA TCTCGGCTTC GAGTTCTGTA GCACGGTCGG TCCGCGTAAT
CCGGTGGTCG CGGGTCCGCT GCGGCGTACG GACTGCTGTC CGGTCTCGGA TGGTGCCGCC
GCGGTGGTGC TGGCCGCGCC GGGTGGTGCG CCGCGTGCCC CGGCGGTGCG GATCCGTGCT
CTGGCGCAGG CCAACGATTT CCTGCCGGCC GCGCGGCGTC ATCCGCTGGC GTTCGCCGCC
GCGCATGGGG CCTGGCAGGC GGCGTTGGGG CAGGCCCGGG TGCGGCTGTC CGACCTGCAC
CTGCTGGAGC TGCACGACTG TTTCACCATC GCGGAGTTGC TGGAGTACGA GGTCGTCGGG
TTGTGCCCGC CGGGCGGCGG GGGGCAGGTC ATCCTCGGTG GGGTCGTGGA CCGGGACGGC
ACGCTGCCGG TCAACCCCTC CGGGGGGTTG AAGGCCAAGG GTCATCCGGT TGGTGCCACC
GGGGTGTCCC AGCATGTGAT GGCGGTCCTG CAGCTCACCG GCACCGCCGG GGCGATGCAG
ATCCCCGGGG CCACCGTCGC CGGGGTGTTC AACATGGGTG GCCTGGCGGT CGCCAACTAT
GCGAGTGTCC TGGAGCGTGT GCGATGA
 
Protein sequence
MDDVEIVGWG HTPFGRLPEE TLESLIVAAA REAIASAGLR PREIDEIVLG TYNAGLQPLA 
FPSSLVLEAD DDLLFTPATR VENACASGSA ALLCGVRAIR SGQARRVLVV GAEKMTHASA
EVVGGALLGA DYEHAGECAP AGFARLFADV AEAYFTKYGD HSDALARIAA KNHRNGVVNP
YAHLRSDLGF EFCSTVGPRN PVVAGPLRRT DCCPVSDGAA AVVLAAPGGA PRAPAVRIRA
LAQANDFLPA ARRHPLAFAA AHGAWQAALG QARVRLSDLH LLELHDCFTI AELLEYEVVG
LCPPGGGGQV ILGGVVDRDG TLPVNPSGGL KAKGHPVGAT GVSQHVMAVL QLTGTAGAMQ
IPGATVAGVF NMGGLAVANY ASVLERVR