Gene Francci3_2590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2590 
Symbol 
ID3906496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3054649 
End bp3056298 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content68% 
IMG OID637879915 
Productputative phosphoesterase 
Protein accessionYP_481681 
Protein GI86741281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.671216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.163709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGG ACATGACAAT CGCCGAGCAG TACGAATGGC TCCAGACCTT TCTCCGGCGG 
CGGCCGGTCA GCCGACGCGG GATTCTGCTC GGCGGTCTGG CTGGTGCCGG CGCGGCGAGC
CTGTTCGGCA CGCCGTTCGG CCGGGCCGCC TACGCCGCCA CGAACGACAG TCCCCTGCTC
GTGGGCGGCA GGCACACCGC CTTCGGCGCC GATCCGTCCC GGCAGCTTCG GCTGGCGGCG
CAGTTGTCGC GTAACCCGGG CCGCGGCAAG GTTCTGGTCG ATATCGGGCC GGGACGCGGT
CTCGGCGGCA CGGTCGAGGC GGAGATCCGA CCGCTGCTGT CCCAGGTCCC CCAGGCCGAC
GGCTCGATTC TCGCCGTCGA ACAGTACTAC GTGCACGCGG CGTTCGACAA TCTGACGCCG
GGCCGGGAAT ACTACTACCG TTTCCGGATT CCCGGTGGGG AAGCCACCCC CGTGCAGGCG
GTCCGCACCG GACACCGGAA GGGACGCCAT GGCGGCCCGT TCCGGTTCAC GATGATGGGC
GACCACGGTT CGAACACCAC CCCGCCGGGC GATCCGAAAG GTGTGTTCGA TGACAATTAC
TACAAGGCCG ACAACGCCCC GATGGTCGCC CACGCGTCGG CTGTCACCGC GGCCATCGCC
CGCCAGGACC CGGTGTTCCA CCTGCTCGCC GGGGACATCA GCTACGCGGA CCCATCGGGT
CAGGGAAGGC CGCCCAAGCG GACGGCCGCC GGCGTATCTC CTACCGGTTT CGACAATTAT
GACCCGACCG TCTGGGACGT CTACCTGGCC GACATCGAGG TCAGCAGTGC CCGCACGCCG
TGGATGTTCG CCACCGGCAA CCATGACATG GAGGCGCTCT ACTCCCCGCA CGGCTACGGC
GGCCACCTCG CCCGCCTGGA CCTGCCAGGC GGGGGGCCTC AGGGCTGCCC GAGCGTGTAC
TCCTTCGTCC ACGGCAACGT GGCCGTGCTC TCGCTGGACG CCAACGACCT CAGTTATGAG
ATCAAGGCGA ACACCGGATA TTCGAACGGT GCGCAGACGT CCTGGGTCGA GAGCACGCTG
AAGGGCTACC GGAACGACCC GGATATCGAT TTCATCGTCT GTTTCTTCCA CCACTGCGCC
TATTCGACGA CCGCGCAGCA CGCCAGCGAC GGCGGCGTCC GCGGCGCCTG GGCGCCGTTG
TTCGACTGTT ACCAGGTCGA CCTCGTGCTG CAGGGTCACA ACCACCTGTA CGAGCGCACC
GACCCGATCC GCGCGAACGC ACCGACCCGC GAGGCGCCCG ACGGCTCGAC GATCGAACCC
GCGAAGGACG GCACCACCTA CATCGTCGCG GGCAGCGCGG GACGGCCCCG CTACCAGTTC
CAGACCGGTG AGCCGGAGTC CTACCGAGGA AAAATCGTAC CGGGCTCGAA CGTTGTGCCG
AACAGCTATG TCTGGCAGGC CGACGGGACG AAGGCACCGG AGTCCATCAA CTGGTCGCGG
ACCCGTTTCG ACGACTACGC CTTCGTGGCT GCCGAGTCCG ATCCGGGCCG GCCCGGCGGG
TACTCCACCC TGACCATCCG GGGCCTGGAC GAGCATGGTC AGGAGTTCGA CCGGGTGGTC
CTGCGCCGCG CCGTTGCGCA CGGCCACTGA
 
Protein sequence
MAEDMTIAEQ YEWLQTFLRR RPVSRRGILL GGLAGAGAAS LFGTPFGRAA YAATNDSPLL 
VGGRHTAFGA DPSRQLRLAA QLSRNPGRGK VLVDIGPGRG LGGTVEAEIR PLLSQVPQAD
GSILAVEQYY VHAAFDNLTP GREYYYRFRI PGGEATPVQA VRTGHRKGRH GGPFRFTMMG
DHGSNTTPPG DPKGVFDDNY YKADNAPMVA HASAVTAAIA RQDPVFHLLA GDISYADPSG
QGRPPKRTAA GVSPTGFDNY DPTVWDVYLA DIEVSSARTP WMFATGNHDM EALYSPHGYG
GHLARLDLPG GGPQGCPSVY SFVHGNVAVL SLDANDLSYE IKANTGYSNG AQTSWVESTL
KGYRNDPDID FIVCFFHHCA YSTTAQHASD GGVRGAWAPL FDCYQVDLVL QGHNHLYERT
DPIRANAPTR EAPDGSTIEP AKDGTTYIVA GSAGRPRYQF QTGEPESYRG KIVPGSNVVP
NSYVWQADGT KAPESINWSR TRFDDYAFVA AESDPGRPGG YSTLTIRGLD EHGQEFDRVV
LRRAVAHGH