Gene Franean1_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0189 
Symbol 
ID5668614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp231593 
End bp233140 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content73% 
IMG OID641239118 
Productmetallophosphoesterase 
Protein accessionYP_001504562 
Protein GI158312054 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.326449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCACA GCCTTCGCGG TCCACGACGT TCCCGGTCAC GATCCTCCGA GTCGCCGGCC 
TCCGCACCAC GCGACGCCGC ACCACGCGAC GCCCCGCCGC GGCCCGGCGG AGGGCCGTCG
CACGCCGAGC ACGGGGTGCA CCTCGCCTTC GGCGCGGATC CGGCGACGTC GATGGTGGTC
TCCTGGATCA CCCGGGAGCC CGTCGTCCGG CCGCTGGCCC GGGTGGTCAC GGGCACCGCC
GAGGCAGTCC GCGAGGTCGA GGCCGGCACC AGGTCGTACA CGGACGCGGC CACCGGGTGG
GAGATCTACG CGCACCACGC GCTGCTGGAC GAGCTGGCGC CGGACACCGA GTACACCTAC
GAGATCACGT ACCAGACCAC GGCGGCCGGG GTCGTCCGCG AGGTGGGCCG GGCGTCGTTC
CGGACGGCCC CCCGCGGCCG GGCCGCCTTC ACCTTCGCCT GCTTCGGCGA TCACGGCACC
GACGCGTCCG ACAACCCGTT CGGCACGCCG GCCTCCGGCG CGCTCGTCGC CGGCGTCGAG
CGGGTGGACC CGCTGTTCAC CCTGGTCGAC GGCGATCTGG CCTATTCGAA CGTCAGCGAC
GTCCCGCCGC GGGCCTGGGC GGACTGGTTC GCGATGATCA GCACCTCGGC CGCGCGCCGC
CCGTGGATGC CGAGTGTCGG CAATCACGAG ACCGAGCGGG GAAACGGAGC GCTGGGCCTC
GCCGCCTACC AGACCTACTT CCAGCCGCCG GACAACGGTG AGGAGCCTTA CCTGGCCGGC
CTCTGGTACG CCTTCACAGT GGGTGGCGTA CGGTTCGTCG TGCTCAGCGG CGACGACGTC
TGCTACCAGG ACGCCGGCCG CGTCTACCTG CACGGCTACA GCTCGGGTCG GCAGACCGCC
TGGCTCGAGC GGCAGTTGGC CGAGGCCCGG GCGGACCAGG CGGTCGACTG GATCATCGTG
GCCCTGCACC AGGCAGCAGT CTCCACAGCG GAGTTCCACA ACGGCGCGGA CCTCGGCCTG
CGCGAGGCCT GGCTGCCGTT GTTCGACCAG TACGGCGTCG ACCTGGTGAT CTCCGGGCAC
GAACACCACT ACGAGCGCAC ACACCCGCTA CGGGGGGTTG TGGACGGCAG CACGACGCTG
ACCCCGCGGC CGGTCCCGGG CTCGGTGTCC GTCGCGGGGG GCGGCGGGGG CGGTACTGCC
ACGCTCGACA CGTCCGCCGG GACGGTGCAC ATGCTGATCG GCACCGGCGG CTCGTCCACG
CCGTCGGCCG GGCAGCTGTT CGACCCGCCG GCCTGCCGGG TGGTCGTCGG GGTGCGGGAG
CGGGAGCCCG GGCAGCGGCA GCGCTCCTCG ATCCGTGCGG TCGAGCCGGC TCCGTGGCTG
GCGGCCCGCT TCCCCGAGCA TCCGTACGCG TTCGCCGCGC TCACGGTCGA TCCGGGCGAG
CCGGGCGGGA CGACCCGCAT CCAGGTCACC GTCTACGACT CGGCGGACGC CGTGCCCGTG
CCCTTCGACA CCTTCACCCT CGCCCGCCCG CGCGCCGACG CGACCTGA
 
Protein sequence
MPHSLRGPRR SRSRSSESPA SAPRDAAPRD APPRPGGGPS HAEHGVHLAF GADPATSMVV 
SWITREPVVR PLARVVTGTA EAVREVEAGT RSYTDAATGW EIYAHHALLD ELAPDTEYTY
EITYQTTAAG VVREVGRASF RTAPRGRAAF TFACFGDHGT DASDNPFGTP ASGALVAGVE
RVDPLFTLVD GDLAYSNVSD VPPRAWADWF AMISTSAARR PWMPSVGNHE TERGNGALGL
AAYQTYFQPP DNGEEPYLAG LWYAFTVGGV RFVVLSGDDV CYQDAGRVYL HGYSSGRQTA
WLERQLAEAR ADQAVDWIIV ALHQAAVSTA EFHNGADLGL REAWLPLFDQ YGVDLVISGH
EHHYERTHPL RGVVDGSTTL TPRPVPGSVS VAGGGGGGTA TLDTSAGTVH MLIGTGGSST
PSAGQLFDPP ACRVVVGVRE REPGQRQRSS IRAVEPAPWL AARFPEHPYA FAALTVDPGE
PGGTTRIQVT VYDSADAVPV PFDTFTLARP RADAT