Gene Franean1_4926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4926 
Symbol 
ID5673266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5914807 
End bp5916411 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content71% 
IMG OID641243781 
Productsulfatase 
Protein accessionYP_001509197 
Protein GI158316689 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGAC GACATCGGCC GCTCGGAACA CGCGCCCGGA TCGGGACGGC CAGCCTCCTG 
GTCGCGGCCC TCGGCGCGGC GGTGTTCGCC GCCGCGGGCG GCCCGGAGCC ATCGACGCCG
CCCGGGCTGT CGGCCACGCC GTTGGCAGCC GACACGCAGC GGCCCAACTT CGTCTTCATC
CCCGCCGACG ACCTCGACGC GACCACCTCG CCCTACTGGG AGGCGATGCC GAGGACGGCC
GCGCTCATCC GCGACGCCGG CCTGACCTTC ACCGAGAGCT TCGCGCCCAC CCCGATCTGC
TGCCCGGCCC GCGGGTCGCT GCTCACCGGA AAGTACGGGC ACAACACCGG GGTGCTCACC
AACAGCGGCG ACGAGGGCGG CTGGGCGACG TTCGCGGCGA ACGGCAACGA GGAGCGCACC
TTCGCGAAGT ACCTGCAGGA CAGCGGCTAC AACACCGCGC TCGTCGGCAA GTACATGAAC
GGCATCGAGG ACGCGCCCGA CCACGTTCCT CCGGGCTGGA CGGAGTGGTA CGGCAGCGTC
GACAACTTCT TCTACACCGG CTACAACTAC GCGCTGAACG AGAACGGGAC GATCGTGCAC
TACGGCGGCC CGTCCGATCC CGCGAACTAC TCCACCGACG TCGTCGCCGC GAAGTCGGTG
GACTTCCTCG AGCGGGCGGC GGCGAAGGAC GAGCCGTTCA TGCTCTACAC CGCCTCGACC
GCCCCGCACC TGCCGCTGCC GCCAGCGCCG CGCGACAGCA ACAATCCGTT CACGGACGAT
CTCGCGCCAC GCTCGCCCAA CTACCAGGAG CCGGACGTCA GCGACAAGCC CGCGTGGCTG
CGGACGAGCG CCGGGGTCCG CAGCGCCCAG GTGAACCTGA TCAACGACAA CGACTACCGG
AACAGGATGG GATCGCTCCT CGCGCTCGAC GACATGGTCG GCGACATCGT CACGACGTTG
CGCGACACCG GCGAGCTCGA CCACACCTAC CTGGTCTTCA CCTCGGACAA CGGCTACAAC
CTCGGCGCGC ACCGGCTGAT CCACAAGATG GCGCCGTACG AGGAGTCGCT GCGGGTCCCG
CTGGTCGTCG CCGGGCCTGG GGTGACCAGG GGAACCGACG ACCACATGGT CGCCGCGATC
GACATCGCGC CGACGTTCCT GGAGCTGGCC GGGGTGCCCG TCCCGGCGGA CGTCGACGGC
ATGTCACTCG CGCCGCTGCT GCGCGGACAG GACCCGGCGC AGTGGCGCTC GGACCTGCTG
GGCCAGTACG CCGGCCCGGG CGGCCAGGGT GACGACGGCA TCGCCGCCGA GCAGGTGCCC
GGCCAGCCGA TCGTGGCGGC CGCCACCGAC CCGGTCGCCC ACTACCTGGA CATCCCAGCC
TGGAGCGGGC TGCGCACCGA CCGGTACACG TATGTGCGCT GGTACGACAC GGACCGGACC
GTGGTCCACG AGCGCGAGCT CTACGACCTG TCCAACGATC CTTACGAGCT CACGAACCTG
CTGGCGACCC CGGCGGGACG GGCGGCGAAC GCCGAGCTCG TCGCACGCCT CGACAGCCGT
CTGGACACGC TCGCCGCATG CGCCGGAGCG ACCTGCCGGA CGTAA
 
Protein sequence
MPGRHRPLGT RARIGTASLL VAALGAAVFA AAGGPEPSTP PGLSATPLAA DTQRPNFVFI 
PADDLDATTS PYWEAMPRTA ALIRDAGLTF TESFAPTPIC CPARGSLLTG KYGHNTGVLT
NSGDEGGWAT FAANGNEERT FAKYLQDSGY NTALVGKYMN GIEDAPDHVP PGWTEWYGSV
DNFFYTGYNY ALNENGTIVH YGGPSDPANY STDVVAAKSV DFLERAAAKD EPFMLYTAST
APHLPLPPAP RDSNNPFTDD LAPRSPNYQE PDVSDKPAWL RTSAGVRSAQ VNLINDNDYR
NRMGSLLALD DMVGDIVTTL RDTGELDHTY LVFTSDNGYN LGAHRLIHKM APYEESLRVP
LVVAGPGVTR GTDDHMVAAI DIAPTFLELA GVPVPADVDG MSLAPLLRGQ DPAQWRSDLL
GQYAGPGGQG DDGIAAEQVP GQPIVAAATD PVAHYLDIPA WSGLRTDRYT YVRWYDTDRT
VVHERELYDL SNDPYELTNL LATPAGRAAN AELVARLDSR LDTLAACAGA TCRT