Gene Francci3_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2218 
Symbol 
ID3906357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2593154 
End bp2594209 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID637879550 
Producthypothetical protein 
Protein accessionYP_481316 
Protein GI86740916 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3315] O-Methyltransferase involved in polyketide biosynthesis 
TIGRFAM ID[TIGR00027] methyltransferase, putative, TIGR00027 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.432005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG ATGGCGAGCG CGACGGGCCG ATGATCACCT TCTTGCCTGT GGGTGCCCCT 
GCGGTAGTGG CATGCAATCG TGGAAGAGTG CGGGGACATC GGCCCATGGG CGGTCACGGG
GGTGCGAGCC GGACCGCGGT GCTGGTGTGC CAGGGGCGGG CCGTCGCGCA CGGGCGCATC
GCTGTCGACC GGTTCGACGA TCCGACAGCG ATGATGTTTC TGCGCGCCGA CGAACGGGGC
GTTGTCGAGT GGGTGCGCGG TGGTGTCCCG CCGCGGGGGT GGGGGGCGCG GCTTGAGTTT
GAGATGGTGC GGGCCTGCGG TGAGGTGATG GTGCCGCGTA CCGTCGCCAT CGACGAGGCC
CTCCGGGAGG CCGTCGGTGG GGCCGACGGC GATCGGGAGG TGCGGCGGGC GGAGCAACTG
GTGGTTCTCG GCGCGGGCCT GGACGGACGG CCCTGGCGGA TGCCGGAGCT CGCCGGCGTC
GCCGTATTCG AGGTCGATCA TCCGGCGTCC CAGCGGGACA AGCTCGACCG GCTCGGGGAC
GCGGATCGGC TCGGGGACGC GGGGCCGCCG GGAACGGCAC CGCGCTTTGC GCCGGTGGAC
TTCAGCCGTG ACGACCTCGA CGCGGCCCTG ACGTCGGTGG GGCATCGGCC GACGGTGCCG
ACGGTCTGGA TCTGGGAGGG TGTCGTACCC TACCTCACCC GTGGCCAAGT CGCCGCCACG
ACCCGGGTCG TGGCGGGGCG CTCGACCCCG GGAAGCCGCC TGATCGTTCA TTACCACGCG
CCGGCGCTCT CGGCGCTTCT CGGCCGGCTG GCGGGACGGG TGCTGACTAT TGTGTCACGG
CGCCCCGATC CCATGGCCCG CGAACCCAAC CGCTCGGCAT GGACCCCCGC CGCGATGCGT
CGGATGCTCG CCGCCCACGG CTTCACCGTC CGCCGCGACG ACGACCTGCT CACGCTCGCC
CGGCGATTGG CGGTGCCGGT TCGGCACCGC CAATCGCTGC GCAACGGCCA CGTCGTCGTA
GCCGACCTCC TCGTGGCCGA CTTCGACACG GTCTGA
 
Protein sequence
MAADGERDGP MITFLPVGAP AVVACNRGRV RGHRPMGGHG GASRTAVLVC QGRAVAHGRI 
AVDRFDDPTA MMFLRADERG VVEWVRGGVP PRGWGARLEF EMVRACGEVM VPRTVAIDEA
LREAVGGADG DREVRRAEQL VVLGAGLDGR PWRMPELAGV AVFEVDHPAS QRDKLDRLGD
ADRLGDAGPP GTAPRFAPVD FSRDDLDAAL TSVGHRPTVP TVWIWEGVVP YLTRGQVAAT
TRVVAGRSTP GSRLIVHYHA PALSALLGRL AGRVLTIVSR RPDPMAREPN RSAWTPAAMR
RMLAAHGFTV RRDDDLLTLA RRLAVPVRHR QSLRNGHVVV ADLLVADFDT V