Gene Francci3_3773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3773 
Symbol 
ID3906057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4521983 
End bp4522933 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content71% 
IMG OID637881099 
Producthistidinol-phosphate phosphatase 
Protein accessionYP_482853 
Protein GI86742453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID[TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0738114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGCAA TAGATCGGGA GATTGGTCAC ACTAACCCCG CCACCTGGTG TCCGGCGCCC 
GGGACCACCG ACGCGCGGTG GGGCTCGGCT CGTAGTCTCG CAGGCGTGAC CCCGCCTAAC
CCCGCATCCG TGAGCATCGA ACCGACCGCG GACTCCGACA CCGTCGCCAT CGCCGACGAT
CTGGCCCTCG CGCTCAGCCT CGCGGACGCC GCCGACCGGA TCACTCTTTC CCGATTCCAG
GCAGTAGACC TGCACGTCGA GTCGAAGCCG GACAACACCC CGGTCTCCGA CGCCGACACC
GCCGTCGAGT CCATGATCCG CAAACGGCTC GCCGTGGCCC GGCCGGGCGA CGCCGTGCTC
GGCGAGGAGG AGGGTCTCGT CGGATCGGGC GCCCGCCGGC GCTGGATCCT GGACCCCGTT
GACGGCACCA AGAACTTCGT GCGCGGCGTG CCCGTCTGGG GCACCCTGCT CGGACTCGAG
GTGGACGGCG AGATGGTCGT CGGGGTCGCG AGCGCCCCGG CGATGAGCCG ACGGTGGTGG
GGAGCCCGGG GCACCGGCGC CTTCACTCGC GACGCCACCG GCGACACCCG TGCCCTGAAG
GTCTCCTCGG TCACCAGCCT GGGTGACGCC TTCCTGTCCT TCGCCTCCGT CGAGGGCTGG
CAGACCGCCG ACCGGCTGGA GGAGTTCCTC CACCTCGCCG GCCAGGTCTG GCGGACCCGC
GCCTACGGCG ACTTCTGGTC GCACATGATG GTCGCCGAGG GCGCCGTGGA CCTCGCCTGC
GAGCCCGAGG TGTCGCTCTG GGACATGGCC GCGCTCCAGG TCATCGTCGA GGAGGCCGGT
GGCCGGTTCA CCGACCTGGG CGGGCGCCGG GGGCCGGGAC ACGGCACGGT GCTCACCACC
AACGGGCATC TCCACAACGT CGCGCTGCAG TCGTTCAACG GCTCTTCATG A
 
Protein sequence
MIAIDREIGH TNPATWCPAP GTTDARWGSA RSLAGVTPPN PASVSIEPTA DSDTVAIADD 
LALALSLADA ADRITLSRFQ AVDLHVESKP DNTPVSDADT AVESMIRKRL AVARPGDAVL
GEEEGLVGSG ARRRWILDPV DGTKNFVRGV PVWGTLLGLE VDGEMVVGVA SAPAMSRRWW
GARGTGAFTR DATGDTRALK VSSVTSLGDA FLSFASVEGW QTADRLEEFL HLAGQVWRTR
AYGDFWSHMM VAEGAVDLAC EPEVSLWDMA ALQVIVEEAG GRFTDLGGRR GPGHGTVLTT
NGHLHNVALQ SFNGSS