Gene Francci3_4243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4243 
Symbol 
ID3907209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5062978 
End bp5064399 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content68% 
IMG OID637881569 
Productlipase, putative 
Protein accessionYP_483318 
Protein GI86742918 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.48059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCGGT GCGTGGCCGT CGGATATCCC GAGGGAAAGG GACGTCGAGT GGCTCGCCGC 
CGTTACTTCC ACCTGCTGAG CGCGCTTGCC ATCGCCGGAC TGGCGGTCGC CTGTTCCGGC
GCTGGTGGCA CGGAGTCCGG CGCGGGACGC TCCCCGGCGC TCGCGCGGGA GCTGGCGGAG
GCCGCGGTCG CCGCCCCGGA CGCCTCTCGC CTCCCGATCC TGTTCGTGCA CGGCAGCCTT
GGGTCGGGCA GCCTCTTCGA GACACCGGCG CAACGATTCA CCGCAAACGG CTATCCCGTC
GACTACGTCG AGGCCTTCGA CTACGATGCG TCGTCCTCGG CCAACTCCGT CGAGGAGGTC
CTTCCGGACC TCGATGCGAA GATCGCTAGT ATCCTCAAGG CAACCGGGGC GAAGAAGCTC
GATCTGGTGG CACATCTCGA GGGCGCCGCG CTAAGCGAGC GGTACCTGAC CAGCTCATCC
GACCGGGCGA ACCGGGTCGC GCACTACGTC ACCGTCGGAG GCTCGCTCGC GGGTAGACCC
CCGGCCGGAG TGCCCACCCT GGCGGTCTGG GGCGAGAACG ATCCGACCCG GCGCGTCGAG
GGGGCCGTCA ACGTGCGGTT CGTCGACCAG GACCGCGCCC AGCTCCTGAC CTCGACGGCC
ACCTTCAGGG AGATGTACCG GTTTCTCACC GGAACCACTC CGCACACGAC CGACGTGGTG
CCGAAGCAGG GCCCGATCGA GATCTCCGGC CGGGCCCTGC GCTATCCGAT GAATGTCGGT
GTGCGGGACG CCAGGCTGGA GGTTCACCGG GTCGACCCAC GGACGGGTCG GCGGGGGGCG
AACAGCCTCA AAGGTTATTT CCACCTCTCC GGCGACGGAT CCTGGGGTCC GTTCCGAGGT
GACGGCGCCT CCACCTACGA GTTCGTGATC GTACGCGGCG GGGAACCCGA CCACCACTTC
TACTACCAGC CGTTCCGCCG CGACGATCAC CTCGTACGTC TGCTGACGAG CCAGCCGGGC
CAGGAGATCG GCGGCGGGAT GGAGACCGGT CCCGGGCACA CCGATCTGGT CGTCGAACGC
CCGCTCGAAT GGTGGGGGGA CCAGGGGAAC GCCGACGACG TCCTCAGCAT CGACGGGAAG
AACGTGCTGA ACGCCGCCGA CGCGCCCCGG TCCAAACTGG CCCTGGGCGT CTACCTGTTC
GACCGGAATT CCGACGGCGT CACCCACCTC ACAATGCCGA TTCCGGCGTA TTTCGGGATC
GGCTTCATCA CCGGCGTAGA TGTGTTCATC CCGGCGGGTG AGCGCTCGGT CGAAGTCGCG
ATGACGCCGC GGGGAAAGAC AGCCCACCGG AGCGTTCTCC AGGTGCCCGC CTGGCCGTCG
GACCGGCATC GCGTCACTCT GCACTTCACC GGCGAGGGCT GA
 
Protein sequence
MARCVAVGYP EGKGRRVARR RYFHLLSALA IAGLAVACSG AGGTESGAGR SPALARELAE 
AAVAAPDASR LPILFVHGSL GSGSLFETPA QRFTANGYPV DYVEAFDYDA SSSANSVEEV
LPDLDAKIAS ILKATGAKKL DLVAHLEGAA LSERYLTSSS DRANRVAHYV TVGGSLAGRP
PAGVPTLAVW GENDPTRRVE GAVNVRFVDQ DRAQLLTSTA TFREMYRFLT GTTPHTTDVV
PKQGPIEISG RALRYPMNVG VRDARLEVHR VDPRTGRRGA NSLKGYFHLS GDGSWGPFRG
DGASTYEFVI VRGGEPDHHF YYQPFRRDDH LVRLLTSQPG QEIGGGMETG PGHTDLVVER
PLEWWGDQGN ADDVLSIDGK NVLNAADAPR SKLALGVYLF DRNSDGVTHL TMPIPAYFGI
GFITGVDVFI PAGERSVEVA MTPRGKTAHR SVLQVPAWPS DRHRVTLHFT GEG