Gene Franean1_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4801 
Symbol 
ID5673142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5731955 
End bp5733463 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID641243657 
Productalkaline phosphatase 
Protein accessionYP_001509073 
Protein GI158316565 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.447394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGG TGAGCCGACG GTCCGTCGTT CTCGGCGGAA TTGCGGGCGT GGGAACGGTG 
CTGGCGGGCG CCGCGGCGGC CCGTGCCGCG TCCTATCCCT TCACGCTGGG CGTTGCCTCG
GGCGAGCCCA GCGCGGACGG ATTCGTGCTG TGGACCCGCC TCGCGCCCAG TCCCCTCGCC
GCGGACGGGC TCGGCGGCAT GTCGAGCGGC GCCGTCACCG TGGAGTGGCA GGTCGCCACC
GACCAGTACT TCACCCAGAT CGCCGCCAGC GGCTCGGTCT CCGCCGTCCA GGCCTGGGCG
CACAGCGTGC ATGTCGAGGT CGGCGGCCTG CAGCCCAACC GGGAGTACTG GTACCGCTTC
CGCGCCTCCG GCCAGATCTC GCCGGTCGGC CGGGCCCGGA CCGCCCCGGC CGTCGGCTCC
AGCCCCGTCC TGAAGATGCT GTTCACCTCG TGCTCGCACT ACGAGGCCGG CTACTTCACC
GCCTACCGCC GGATGGCCGA GGAGAACCCG GACCTCATCC TGCACCTCGG GGACTACATC
TACGAGGGCG GGGCCGGGTC CGGCGTGCGC TCGCACGTGC CCAGCGCCGA GATCAGCTCG
CTGGCCGACT ACCGCGTCCG GCACGCTCTC TACAAATCCG ACGCCGACCT GCAGGCCGCG
CACGCCGCCG CGCCGTGGAT ACCGGTCTGG GACGACCACG AGGTCGAGAA CAACTACGCC
GACCTCGTCC GCAACGACAC CAGTCCGGCC GGCGACTTCA CCGCCCGCCG GGCGGCCGCC
TACAAGGCGT ACTACGAGCA CATGCCGCTG CGGTCGGCGC AGGTTCCCGT CAACGAGAAC
CTGCAGCTCT ACCGGCGCCT GCGCTGGGGC AGCCTGGCCA CCTTCCACAT GCTCGACACC
CGGCAGCACC GGGACGACCA GGCGTGCGGT GACGGCACGA AGGTCTGCGC CGCGGCCGAC
GACCCGGCAC GCACGCTGAC CGGGGCGACG CAGGAGGCCT GGCTGCTCGA CGGCCTCGGC
CAGCGCCTGG GTACCTGGGA CATCATCGGC CAGCAGGTGT TCTTCGCCCA GCGCCTCGCC
GCCTCCGACG GCTCGAAAAG CATGGACGCC TGGGACGGTT ACACCGCCAA CCGCGGCCGG
ATCCAGGCGG GCTGGCAGGC CAGCGGCAAC ACCAGCACGG TCGTGCTCAC CGGAGACGTC
CACCAGCACT GGGCGGCCGA CATCATGGAC AACTACGCGA CCCAGAACAA GGTGATCGGC
ACCGAGCTGG TGTCCACCTC GATCACCTCA GGCGGGGACG GCGCCGGTGC CGGGACCGGC
CTGTCCAGCC TCAACCCGCA TGTGAAGTTC AACTGGAACC GGCGCGGCTA CGTCCGCACC
GTCACCACAC CCACCCAGAT GACGGTGGAC TTCCGCGCGC TCAACCAGGT CACGGTCCGT
GGCAGCGCGG CCACCACCGT GCAGAGCTAC GTGATCGAGG CCGGCAACCC CGGTCTCCAG
ACGGTGTGA
 
Protein sequence
MNQVSRRSVV LGGIAGVGTV LAGAAAARAA SYPFTLGVAS GEPSADGFVL WTRLAPSPLA 
ADGLGGMSSG AVTVEWQVAT DQYFTQIAAS GSVSAVQAWA HSVHVEVGGL QPNREYWYRF
RASGQISPVG RARTAPAVGS SPVLKMLFTS CSHYEAGYFT AYRRMAEENP DLILHLGDYI
YEGGAGSGVR SHVPSAEISS LADYRVRHAL YKSDADLQAA HAAAPWIPVW DDHEVENNYA
DLVRNDTSPA GDFTARRAAA YKAYYEHMPL RSAQVPVNEN LQLYRRLRWG SLATFHMLDT
RQHRDDQACG DGTKVCAAAD DPARTLTGAT QEAWLLDGLG QRLGTWDIIG QQVFFAQRLA
ASDGSKSMDA WDGYTANRGR IQAGWQASGN TSTVVLTGDV HQHWAADIMD NYATQNKVIG
TELVSTSITS GGDGAGAGTG LSSLNPHVKF NWNRRGYVRT VTTPTQMTVD FRALNQVTVR
GSAATTVQSY VIEAGNPGLQ TV