Gene Franean1_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5201 
Symbol 
ID5673535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6243695 
End bp6244609 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content76% 
IMG OID641244055 
Productinositol-phosphate phosphatase 
Protein accessionYP_001509465 
Protein GI158316957 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.289429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG TGGACGCGCC CCCCAGCACC CTCCGCCCGG CCCCGGCCGC CCTGCTCGAC 
CTCGGCCTGG ACGTCGCCCG CGAGGCCGGG GCCCTGCTCG TCACCGGCCG CGCCGGCACG
GTCGCCGCCG AGGCGACGAA ATCCTCGCCG ACCGACGTCG TCACCGCGCT GGACCGGGCG
TCGGAGGCCC TCGTCGCCCG CCGCCTGCGC GAAGCCCGCC CGGACGACGG CCTGCTCGGC
GAGGAGGGCT CCGACACAGC GGGCACCAGC GGCGTCCGCT GGATCGTCGA CCCCCTCGAC
GGGACGGTCA ACTTCCTCTA CCGCCTGCCC AACTGGGCGG TGTCGATCGC GGCCGAGCTG
GACGGCGAGA TCGTGGCGGG CGTGGTGCAC GCGCCCGCGA TGGGAGTCAC ATACACCGCT
GTCCGCGGCG GCGGCGCCTT CCGCTGGGAG ACACCGGTGG GCACGGATCG CGGGGGCGAC
CAGGGCGCTG GCACCGGCGC GGGCGCGCCG CTCGGGCCGA CGGCCGGGGA GCCGACGAAG
CTGACCGGTT CGGCGGTGAC CGAGCTGGGC GGCGCACTCG TCGCCACCGG CTTCGGATAC
ACCGAGCGCC GCCGGACGAC CCAGGCCGCG GTGCTGACCC GGGTCGTGCC CAGGGTCCGC
GACATCCGCC GGATGGGCGC GGCCTCCCTC GACCTCTGCG CCGCCGCGGC GGGCATCGTC
GACGCCTACT ACGAACGCGG ACTACACCCC TGGGACCACG CGGCGGGCGC ACTGATCGCC
GCCGAGGCGG GCCTGCGGGT CGGCGGCCTG GACGGCCGGG AAGTCAGCGA GGACCTCGTC
ATAGCCGCTC CCCCCTCCCT GTTCGCCAAC CTCACCGCCC TGCTGGCCGA ACACCCCCGC
GCCGACACCG ACTAG
 
Protein sequence
MSTVDAPPST LRPAPAALLD LGLDVAREAG ALLVTGRAGT VAAEATKSSP TDVVTALDRA 
SEALVARRLR EARPDDGLLG EEGSDTAGTS GVRWIVDPLD GTVNFLYRLP NWAVSIAAEL
DGEIVAGVVH APAMGVTYTA VRGGGAFRWE TPVGTDRGGD QGAGTGAGAP LGPTAGEPTK
LTGSAVTELG GALVATGFGY TERRRTTQAA VLTRVVPRVR DIRRMGAASL DLCAAAAGIV
DAYYERGLHP WDHAAGALIA AEAGLRVGGL DGREVSEDLV IAAPPSLFAN LTALLAEHPR
ADTD