Gene Francci3_3546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3546 
Symbol 
ID3904485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4238578 
End bp4240986 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content69% 
IMG OID637880867 
Productputative phosphoketolase 
Protein accessionYP_482627 
Protein GI86742227 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.252555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.409021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG TCGTCGACGC CGAGCGGAGC GGGGCGGATC CGGCCGCGGG GCCGTTGTCC 
CCGGCGGAGC TTGACCTCAT CCACGCCTAC TGGCGAGCGG CGAACTATCT GTCCGTCGGA
CAGATCTACC TGCTCGACAA TCCCCTGCTC ACCGAGCCGC TGCGTCCGGA GCACGTCAAA
CCGCGCCTGC TCGGGCACTG GGGCACGACT CCCGGCCTGA ACCTCATCTA CGCCCATCTG
AACCGCGTGA TCCGGGCAAA GGACCTGAAC GCCATCTACA TCATCGGTCC GGGTCACGGC
GGTCCCGGCA TCGTGGCGAA CGCCTACCTG GAAGGCACCT ACAGCGAGGT CTACCCCTCG
ATCGGCCGGG ATGTGGCGGG GATGCGGCGG CTGTTCCGGC AGTTTTCCTT CCCCGGCGGA
ATTCCCAGCC ACGTCGCGCC GGAGACTCCC GGGTCCATTC ATGAGGGCGG CGAGCTCGGT
TACGCGCTGA CCCACGCCTA CGGCGCGGCC TTCGACAATC CCGACCTGGT GGTCTGCGCC
GTCGTGGGGG ACGGGGAGGC CGAGACGGGC CCGCTCGCGG CGAGCTGGCA CTCCAACAAG
TTCCTCGACC CGGGTCGCGA CGGCGCGGTG CTGCCGGTCC TGCACCTCAA CGGCTACAAG
ATCGCCAATC CGACGATCCT GGCCCGCATC CCCGAGCAGG AGCTGACCGC GCTCCTGCGT
GGCTACGGGT ACCATCCGTA CTTCGTGGTC GGGGACGACC CGGCCGCGGT GCACCAGCGG
ATGGCGGGCG TCCTGGACGC CGCCGTCGGC GAGATCCACG AGATCCAGGA CCGGGCCCGC
CGCCACGGCC GCACCGAGCG CCCCACCTGG CCGATGATCG TGCTGCGGAC CCCCAAGGGG
TGGACCGGGC CGGCCACTGT TGACGGCCTG CCGGTCGAGG GCACGTGGCG GTCCCACCAG
GTGCCCGTCG CGCAGTTGCA CGACAACCCC GGGCACTTGG CCCAGCTGGA GGCGTGGCTG
CGCTCCTACC GGCCGGAGGA GCTCTTCGAT GCCGGCGGCG CCCCGCGGGC CGAACTCGCC
GCGCTCGCCC CCCGCGGTCC CCGGCGGATG AGTGCGAACC CGCATGCCAA CGGGGGCGAG
CTGCTGCGTG ATCTGACTCT TCCCGACTTC CGTGACTACG CGGTCGACGT CACCGCCCCC
GGGACCACCC TCGCCGAGCC GACCCGGATC CTGGGCGCGT TCCTACGGGA CGTGGTGGCC
GCCAATCCGA GCACCTTCCG GGTGATGGGC CCGGATGAGA CCGCCTCCAA TCGGCTGGGT
GCGCTCTTCG AGGTGACCGA CCGCGTCTGG AACGCCGGGA CGGAACCGAC GGACGAGCAC
CTGGCCGTGG ACGGTCGGGT CATGGAGGTG CTTTCCGAGC ATCTGTGCCA GGGCTGGCTG
GAGGGCTATC TGCTCACCGG CCGGCACGGG TTCTTCTCCT GCTACGAGGC GTTCATCCAC
ATCGTCGACT CGATGGTCAA TCAGCACGCG AAATGGCTGA AGACCACCCG GCACATCCCG
TGGCGGCGCC CGATCGCATC GTTGAACTAC CTGTTGACCA GCCACGTGTG GCGGCAGGAC
CACAACGGCT TCTCCCATCA GGACCCCGGA TTCATCGACC ACGTCGTGAA CAAGAAGGCC
GAGGTCGTCC GGGTGTACCT GCCCCCGGAC GCGAACACCC TGCTCTCCGT CGGCAACCAC
TGCCTGCGCA GCCGGCACTA CATCAACGTG ATCGTGGCCG GCAAGCAGCC GGCGTTGAAC
TACCTGTCGA TGCCGGCGGC GATCGCCCAC TGCACCCGTG GCCTGGGCAT CTGGGACTGG
GCCAGCAACG ACCACGACGC GCCCGGCGAC GGGCTGCCGG ATGTGGTGAT GGCCTGTGCC
GGGGACATCC CGACCCTGGA GACCCTCGCC GCGGTGGACC TCCTGCGCCA GCACTTCCCC
GATCTGAAGG TGCGCGTCGT CAACGTCGTC GACCTGATGT CCCTGCAGGA CGAGGCGGAA
CATCCGCACG GCATCTCGCA CGCCCGCTTC GACGCGCTGT TCACCACCGA CCGGCCGATC
ATCTTCGCCT ACCACGGCTA TCCCTGGCTG ATCCACCGGC TGACCTACCG GCGGCACAAC
CATCACAACC TGCATGTCCG TGGCTACATC GAGGAGGGGA CGACCACCAC CCCGTTCGAC
ATGGTCATGC TGAACAACCT CGACCGGTTC CACCTGGTGG CGGACGTCAT CGACCGGGTA
CCGAGCCTGG GCGCCCGCGC CGCGCACGTC CGGCAGCAGA TGATCGACGC GCGGCTCGCG
GCCCGGACCT ACACCCGGGA CCACGGCGAG GACGCCCCGG AGATCCGGAA CTGGACTTGG
CCCTACTGA
 
Protein sequence
MTSVVDAERS GADPAAGPLS PAELDLIHAY WRAANYLSVG QIYLLDNPLL TEPLRPEHVK 
PRLLGHWGTT PGLNLIYAHL NRVIRAKDLN AIYIIGPGHG GPGIVANAYL EGTYSEVYPS
IGRDVAGMRR LFRQFSFPGG IPSHVAPETP GSIHEGGELG YALTHAYGAA FDNPDLVVCA
VVGDGEAETG PLAASWHSNK FLDPGRDGAV LPVLHLNGYK IANPTILARI PEQELTALLR
GYGYHPYFVV GDDPAAVHQR MAGVLDAAVG EIHEIQDRAR RHGRTERPTW PMIVLRTPKG
WTGPATVDGL PVEGTWRSHQ VPVAQLHDNP GHLAQLEAWL RSYRPEELFD AGGAPRAELA
ALAPRGPRRM SANPHANGGE LLRDLTLPDF RDYAVDVTAP GTTLAEPTRI LGAFLRDVVA
ANPSTFRVMG PDETASNRLG ALFEVTDRVW NAGTEPTDEH LAVDGRVMEV LSEHLCQGWL
EGYLLTGRHG FFSCYEAFIH IVDSMVNQHA KWLKTTRHIP WRRPIASLNY LLTSHVWRQD
HNGFSHQDPG FIDHVVNKKA EVVRVYLPPD ANTLLSVGNH CLRSRHYINV IVAGKQPALN
YLSMPAAIAH CTRGLGIWDW ASNDHDAPGD GLPDVVMACA GDIPTLETLA AVDLLRQHFP
DLKVRVVNVV DLMSLQDEAE HPHGISHARF DALFTTDRPI IFAYHGYPWL IHRLTYRRHN
HHNLHVRGYI EEGTTTTPFD MVMLNNLDRF HLVADVIDRV PSLGARAAHV RQQMIDARLA
ARTYTRDHGE DAPEIRNWTW PY