Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3166 |
Symbol | |
ID | 5671543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3731508 |
End bp | 3732956 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242061 |
Product | sulfatase |
Protein accession | YP_001507481 |
Protein GI | 158314973 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATATCC GTCCGAATTT TGTGATCTTC ATTCCCGACC AGCTCCGTGC GGACGCGCTC GGCGCGTTCG GTAACCCGCA CATCCGCACT CCGAATCTCG ACGCTCTGGC GGCCCGGGGT ACCCGGTTCA CCAACGCCTA CGTGCAGCAT CCGGTGTGCG TGCCCAGCCG CGCGTCGTTC CTCACCGGCT GGTACCCCCA TACCTCAGGT CATCGCAGCC AGAACCATCT GCTGCGGCCG CACGAGCCAA ACCTGCTGCG CATCCTGAAG GACGCCGGCT ACCACGTCAC CTGGGCCGGG CGCCGCGGTG ATACGTTCGC CCCGGGGGTA ACCGAGACCA GCGTGCACGA GTACGGCTAC ACCGAGCCGC CCCCGGCCAG CGCCTACCGG CCCGAGTTGG CAACCTGGCC CGGCGGGGAC CTGTGGGCAC GCCTGTTCTA CTTCGGCCGG ACAGCAGGCA ACCTCGACCA GGACGAAGCC ACGATCCGTA CTGCCGAGCA GAGGTTAGCC GCAATCCCGG ACGCGCCGTG GACGCTGTTC GTTCCTATCA TCGCTCCACA CTGCCCATTT CGCGCGCCCG AGCCATGGTT CTCGATGTAC GACCGCGACA CGATGCCCGA CCCGATTCCG CCAGGCGAAA TCGAACCCCG GTACGTTCCG GCGCTCCGCA ACCTTCACAG ACTGGAACGG GTAGCCCCGG AAATATGGCG CGAAGTCATC GCCACTTATT ACGCCATGGT ATCTCGCATG GACGACCATC TCGGGCGGGT CTTGTCTGCT GTTGAGCGGA CCGGGCAGGC CGGAAACACA GTCACGATGT TCTTCGCTGA CCACGGCGAG TACCTCGGAG ACTTCGGGCT GATCGAGAAG TGGCCCTCGG CAATGCACCC CTGCATCACC CGCGACCCTC TCGTCATCGC CGGTGGAGGG CTCCCTGAGG GCCAGGTCTA CGACGGCATG GTGGAACTCG TCGACGTCCT GCCTACTGTG CTCGAACTGG CCGGCGTACC CGCACCACAC CGGCACTTCG GGCGCAGCCT GCTAACCGTC TTGCATGACC CCGGCTCCGA GCACCGCGAG TACGCATTCA CCGAGGGGGG CTTCACCGTC GAGGAGGAGT CGCAGCTGGA GGACTCGCCC TTCCCCTACG ACCTGAAGAC CGCATTGCAA CACCATCAAC CCGACCTAGT CGGCAAGGCC ACCGCGATAC GTGACCGGGA GTGGACCTAT GTGTGGCGGC TGTACGATCC CCCGGAGCTC TACCACCGGG TTACCGATCC CGACGAACGA CACAATATCG CCGGACGCTC AGAACATTCC GAAGTGGAGC GCCGCTTAAG CCAGGCACTG CTGCGCTGGC TGATGACCAC CACCGACATC ATTCCCACCG ATTCTGACCC ACGCATGCCC GACGTCGACC TGCCCACCCC ACAACCCGTC AGCCCGTAG
|
Protein sequence | MDIRPNFVIF IPDQLRADAL GAFGNPHIRT PNLDALAARG TRFTNAYVQH PVCVPSRASF LTGWYPHTSG HRSQNHLLRP HEPNLLRILK DAGYHVTWAG RRGDTFAPGV TETSVHEYGY TEPPPASAYR PELATWPGGD LWARLFYFGR TAGNLDQDEA TIRTAEQRLA AIPDAPWTLF VPIIAPHCPF RAPEPWFSMY DRDTMPDPIP PGEIEPRYVP ALRNLHRLER VAPEIWREVI ATYYAMVSRM DDHLGRVLSA VERTGQAGNT VTMFFADHGE YLGDFGLIEK WPSAMHPCIT RDPLVIAGGG LPEGQVYDGM VELVDVLPTV LELAGVPAPH RHFGRSLLTV LHDPGSEHRE YAFTEGGFTV EEESQLEDSP FPYDLKTALQ HHQPDLVGKA TAIRDREWTY VWRLYDPPEL YHRVTDPDER HNIAGRSEHS EVERRLSQAL LRWLMTTTDI IPTDSDPRMP DVDLPTPQPV SP
|
| |