Gene Francci3_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3978 
Symbol 
ID3906938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4760057 
End bp4761061 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content70% 
IMG OID637881306 
ProductAcyl-CoA thioesterase-like 
Protein accessionYP_483057 
Protein GI86742657 
COG category[I] Lipid transport and metabolism 
COG ID[COG1946] Acyl-CoA thioesterase 
TIGRFAM ID[TIGR00189] acyl-CoA thioesterase II 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGCGGT TCGCCAGAGA GTGCAGCGAC CGGTCGCGGT CCCCCGACCT GGACGTCCAG 
CGCGGGCGTC CAGCGCGGGC GTCCGCGGCA TCCGCACCTG GCGAGCGTCC ACGGGAAGGG
TGCGCTGTGG GTACACAATC CGCGGAGACA ACGGACGCAC GAGCCGGCGA GGCCCCCAAG
CAACGGGCAA GCACGCAGGA TCAGGTCGAT GCGGTGCCGG CGCCCGGCCA GTGCACTCTG
AGTGAACTAC TGGCAGTTCT CGACCTCGGC GGGGATGTTC CGCCGGTGCC GGCCCGATGC
CTGCCCAGCA CACACGGCGG CGTCCTCGGG GCGCAGCTGC TCGGCCAGCA GATCGTTCTC
GCCGAGCGCA TGACCCCCAA CAAGATCACG CACAGCCTGC AAACGGTGTT CATGCGCCCC
GGCGACTGCC GCCAACCGGT CTGGATCGAC GTCGAGCGGC TGGCCCACGG GCGGTCGATC
GACTCCCTGG CCCTGACCTT CCGGCAGGAC TCGTTGCAGA TCTGCCGGGC GGACGTGATG
CTCCGTTCCC CGGAGCCGGA CTTCCTGCGC CTGCGCTCCA CCGAGTCCGA CTTCCTCGGC
CCGCAGCACG CCAGCCCCCT TGACCGTCCG ATGATGCCCT GGGAAGTTCG GGTGCTTCCC
CGAGCGGACA CCCATCAGCT GGACCAATGG CAGCGCATCC CCGAGGCGCC GGAGGACCGG
TCCCTGTGGA GGGCGTTCAT CGCGCACAGC TGCGAGTTGC TGCCGCTGTC CGACCTGCTG
GCCGTCACCG GCCTGACCCC CACCAAGCGC CTGGCAGTCG CTGTCCTATC GCAGAATGTG
ACCTTCTACG ACGACCTGGA CGTCCGGGAC TGGCATCTGT TCCGGGTGCG TACCCTGCAC
GCGGGCCACG GCCGGGCGAT CGGTCGCGTC GAGGTCTTCG GCCCGGACGG GGAGCTACGC
GCGGGCAGTG AGCTCGTCGG GCTGCTGCGC GGATCCGCGA TCTGA
 
Protein sequence
MWRFARECSD RSRSPDLDVQ RGRPARASAA SAPGERPREG CAVGTQSAET TDARAGEAPK 
QRASTQDQVD AVPAPGQCTL SELLAVLDLG GDVPPVPARC LPSTHGGVLG AQLLGQQIVL
AERMTPNKIT HSLQTVFMRP GDCRQPVWID VERLAHGRSI DSLALTFRQD SLQICRADVM
LRSPEPDFLR LRSTESDFLG PQHASPLDRP MMPWEVRVLP RADTHQLDQW QRIPEAPEDR
SLWRAFIAHS CELLPLSDLL AVTGLTPTKR LAVAVLSQNV TFYDDLDVRD WHLFRVRTLH
AGHGRAIGRV EVFGPDGELR AGSELVGLLR GSAI