Gene Francci3_4451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4451 
Symbol 
ID3907427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5321176 
End bp5322396 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID637881783 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_483526 
Protein GI86743126 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGG CCGTTGTCGT CGCCGCAACC CGATCACCGA TCGGGCGGGC GTTCAAGGGT 
TCGCTGCGGG GGATGCGGGC CGACGACCTC GCGGCCATCA TCGTCCGGGC AGCCCTGGAC
CAGGTTCCGG CCCTCAACCC GGCCGACATC GACGATCTCA TCCTCGGTTG CGGGCTACCC
GGAGGCGAAC AGGGACACAA CATCAGCCGC GTCGTTGCCG TCAGCCTCGG GTTCGACACC
GTTCCCGGCA CGACGGTCTC CCGGTACTGC GCATCCTCCC TGCAGGCTCT GCGGATGGCT
GCGCACGCCG TGCGCGCGGG AGAGGGTGAC GCGTTCATCG CCGCCGGGGT GGAGGTCGTC
AGCGGCTTCG TCAGGGGCAA CAGCGACAGC CTGCCGGACA CCCAGAACCC GCGCTACTGC
GAGGCCCAGG CCCGTACGGC GGCCCGTGCC CAGGCGGGGT CCGCGCCCTG GACGAATCCC
CGGGCGGCCG GCGAGCTGCC CGACGTCTAC ATCGCCATGG GGCAGACCGC CGAGAACGTG
GCCCAGCTCG CGGGTGTGAG CCGCCTCGAA CAGGACGAGT ACGCCTGCCG CTCGCAGAAC
CTGACGGAGC GGGCCGTCGC GGACGGCTTC TTCAAGCGGG AGATCATTCC GGTGCCGCTG
CCCGACGGCG GGGTGATCGA CAGCGACGAC AGTCCCCGGC CGGGAACCAC GATGCGGGCG
CTGGCCGCTC TCAGGCCGGT GTTCCGCCCG GACGGTACCG TCACGGCCGG CAACGCCTGC
CCGCTCAACG ACGGAGCCGC CGCGGTCATC GTCATGAGCG ACACCAGGGC CCGCGAGCTC
GGCATCACCC CGCTGGCCCG GATCGTCTCC ACCGGCGTCA GCGCACTGTC ACCGGAGATC
ATGGGCCTGG CCCCGGTGGA GGCGTCCCGG CGGGCCCTGG CCCGGGCGGG GATGACGATC
GACGACGTCG ACCTGGTAGA GCTGAACGAG GCGTTCGCGG CGCAGGTCAT CCCGACCTAC
CGGGCGCTCG GGCTCGACGT CGCGAAGCTG AATGTGCACG GTGGCGCGAT CGCGCTCGGT
CACCCGTTCG GGATGACCGG CGCCCGGCTA CTCACCACCC TGCTGAACGG ACTACGCACC
ACCGACACGA CCATCGGCCT GGAGACAATG TGCGTCGGCG GCGGGCAGGG CATGGCGCTG
GTCGTCGAGC GGCTGTCCTG A
 
Protein sequence
MTEAVVVAAT RSPIGRAFKG SLRGMRADDL AAIIVRAALD QVPALNPADI DDLILGCGLP 
GGEQGHNISR VVAVSLGFDT VPGTTVSRYC ASSLQALRMA AHAVRAGEGD AFIAAGVEVV
SGFVRGNSDS LPDTQNPRYC EAQARTAARA QAGSAPWTNP RAAGELPDVY IAMGQTAENV
AQLAGVSRLE QDEYACRSQN LTERAVADGF FKREIIPVPL PDGGVIDSDD SPRPGTTMRA
LAALRPVFRP DGTVTAGNAC PLNDGAAAVI VMSDTRAREL GITPLARIVS TGVSALSPEI
MGLAPVEASR RALARAGMTI DDVDLVELNE AFAAQVIPTY RALGLDVAKL NVHGGAIALG
HPFGMTGARL LTTLLNGLRT TDTTIGLETM CVGGGQGMAL VVERLS