Gene Franean1_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3166 
Symbol 
ID5671543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3731508 
End bp3732956 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content65% 
IMG OID641242061 
Productsulfatase 
Protein accessionYP_001507481 
Protein GI158314973 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATCC GTCCGAATTT TGTGATCTTC ATTCCCGACC AGCTCCGTGC GGACGCGCTC 
GGCGCGTTCG GTAACCCGCA CATCCGCACT CCGAATCTCG ACGCTCTGGC GGCCCGGGGT
ACCCGGTTCA CCAACGCCTA CGTGCAGCAT CCGGTGTGCG TGCCCAGCCG CGCGTCGTTC
CTCACCGGCT GGTACCCCCA TACCTCAGGT CATCGCAGCC AGAACCATCT GCTGCGGCCG
CACGAGCCAA ACCTGCTGCG CATCCTGAAG GACGCCGGCT ACCACGTCAC CTGGGCCGGG
CGCCGCGGTG ATACGTTCGC CCCGGGGGTA ACCGAGACCA GCGTGCACGA GTACGGCTAC
ACCGAGCCGC CCCCGGCCAG CGCCTACCGG CCCGAGTTGG CAACCTGGCC CGGCGGGGAC
CTGTGGGCAC GCCTGTTCTA CTTCGGCCGG ACAGCAGGCA ACCTCGACCA GGACGAAGCC
ACGATCCGTA CTGCCGAGCA GAGGTTAGCC GCAATCCCGG ACGCGCCGTG GACGCTGTTC
GTTCCTATCA TCGCTCCACA CTGCCCATTT CGCGCGCCCG AGCCATGGTT CTCGATGTAC
GACCGCGACA CGATGCCCGA CCCGATTCCG CCAGGCGAAA TCGAACCCCG GTACGTTCCG
GCGCTCCGCA ACCTTCACAG ACTGGAACGG GTAGCCCCGG AAATATGGCG CGAAGTCATC
GCCACTTATT ACGCCATGGT ATCTCGCATG GACGACCATC TCGGGCGGGT CTTGTCTGCT
GTTGAGCGGA CCGGGCAGGC CGGAAACACA GTCACGATGT TCTTCGCTGA CCACGGCGAG
TACCTCGGAG ACTTCGGGCT GATCGAGAAG TGGCCCTCGG CAATGCACCC CTGCATCACC
CGCGACCCTC TCGTCATCGC CGGTGGAGGG CTCCCTGAGG GCCAGGTCTA CGACGGCATG
GTGGAACTCG TCGACGTCCT GCCTACTGTG CTCGAACTGG CCGGCGTACC CGCACCACAC
CGGCACTTCG GGCGCAGCCT GCTAACCGTC TTGCATGACC CCGGCTCCGA GCACCGCGAG
TACGCATTCA CCGAGGGGGG CTTCACCGTC GAGGAGGAGT CGCAGCTGGA GGACTCGCCC
TTCCCCTACG ACCTGAAGAC CGCATTGCAA CACCATCAAC CCGACCTAGT CGGCAAGGCC
ACCGCGATAC GTGACCGGGA GTGGACCTAT GTGTGGCGGC TGTACGATCC CCCGGAGCTC
TACCACCGGG TTACCGATCC CGACGAACGA CACAATATCG CCGGACGCTC AGAACATTCC
GAAGTGGAGC GCCGCTTAAG CCAGGCACTG CTGCGCTGGC TGATGACCAC CACCGACATC
ATTCCCACCG ATTCTGACCC ACGCATGCCC GACGTCGACC TGCCCACCCC ACAACCCGTC
AGCCCGTAG
 
Protein sequence
MDIRPNFVIF IPDQLRADAL GAFGNPHIRT PNLDALAARG TRFTNAYVQH PVCVPSRASF 
LTGWYPHTSG HRSQNHLLRP HEPNLLRILK DAGYHVTWAG RRGDTFAPGV TETSVHEYGY
TEPPPASAYR PELATWPGGD LWARLFYFGR TAGNLDQDEA TIRTAEQRLA AIPDAPWTLF
VPIIAPHCPF RAPEPWFSMY DRDTMPDPIP PGEIEPRYVP ALRNLHRLER VAPEIWREVI
ATYYAMVSRM DDHLGRVLSA VERTGQAGNT VTMFFADHGE YLGDFGLIEK WPSAMHPCIT
RDPLVIAGGG LPEGQVYDGM VELVDVLPTV LELAGVPAPH RHFGRSLLTV LHDPGSEHRE
YAFTEGGFTV EEESQLEDSP FPYDLKTALQ HHQPDLVGKA TAIRDREWTY VWRLYDPPEL
YHRVTDPDER HNIAGRSEHS EVERRLSQAL LRWLMTTTDI IPTDSDPRMP DVDLPTPQPV
SP