Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_2294 |
Symbol | |
ID | 4694561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 2603032 |
End bp | 2604360 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639850061 |
Product | sulfatase |
Protein accession | YP_997060 |
Protein GI | 121609253 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.434198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.499589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCCA GACCGAACAT CATCTTCATC GTGGCCGACG ATCTCGGCTA TGCCGACCTG GGCTGCTACG GCGCCCGCGC GGCCGGCTTC GGCCCGGTCT CGCCCACGCT CGACGCGCTG GCCGCAGGCG GGCTCAGACT CACCCAGGGC TACTCGAACT CGCCCGTGTG CTCGCCCACA CGCTTTGCGC TGATGACGGC GCGCTACCAG TACCGTTTGC GCGGGGCGGC CGAAGAGCCG ATCAACAGCC AGAGCCGGGG CAGCACCACC CTGGGCCTGC CGCCCGAGCA CCCGACGCTG CCGTCGCTAC TGCGCGGCGC GGGCTACCGC ACGGCGCTGA TGGGCAAGTG GCACCTGGGC TACCCGCCCG CTTTCGGGCC GTTGCGCTCG GGGTATGAGG AATTCTTCGG CCCGATGTCG GGCGGAGTGG ACTATTTCAC CCACTGCAGT TCCAGCGGCC AGCATGATCT GTACTTGGGC GCCCAGGAAC AGCCGCAGGA CGGCTACCTC ACCGATCTGA TCACCGAGCA CGCGCTCGAC TATGTGGCGC GCATGGCACC CGGCGCCAAG GCCGGGACGC CCTTTTTTCT GAGCCTGCAC TACACGGCGC CGCATTGGCC CTGGGAGACG CGCGACGACC AGGCGCTGGC GCCGCAACTG CACCGCCACC TGTTCCATCT GCACGGCGGC AGCATCCACA GCTACCGCCG CATGATCCAC CACATGGACG AGGGCATCGG CCGCCTGATG GCGTTGCTTG CGCAGCATGG CCTGACGCGC GACACGCTGC TGGTCTTTAC CAGCGACAAC GGCGGTGAGC GCTTCTCGGA CAACTGGCCG CTGGTGGGTG GCAAGATGGA CCTGACCGAA GGTGGCATCC GCGTGCCCTG GATAGCGCAC TGGCCGGCGG TGATCGCCCC CGGTGGTGTC AGCGCCCAGA CCTGCATGAC CATGGACTGG TCGGCCACCC TGCTCGACGC CGCCACTGTG GCGCCTGATG CCGACTACCC GCTCGACGGC AAATCCCTGA TGCCGCTGCT GCGCGACGCC ACCACATGGT ATGGGCCGGC GCTGTTTTGG CGCATGAACC ATCGCGGCCA ACGCGCCATG CGCCAGGGCG CGTGGAAGTA CCTGCGCGTG GACGGCCATG ACTACCTGTT CGACCTGTCG CAAGACGAAC GCGAGCGCGC CAACCAGGCC GCCATCGACC CCGAACGCCT GTCGGCCATG CGCGCCGCGT GGGAACGATG GAATGCCGGC ATGCCAGCGA TACCGCAGGA TGCCACGGTG AGTCTGGGCT ACTCCACCCG GGACATGCCA CAGCGCTGA
|
Protein sequence | MSSRPNIIFI VADDLGYADL GCYGARAAGF GPVSPTLDAL AAGGLRLTQG YSNSPVCSPT RFALMTARYQ YRLRGAAEEP INSQSRGSTT LGLPPEHPTL PSLLRGAGYR TALMGKWHLG YPPAFGPLRS GYEEFFGPMS GGVDYFTHCS SSGQHDLYLG AQEQPQDGYL TDLITEHALD YVARMAPGAK AGTPFFLSLH YTAPHWPWET RDDQALAPQL HRHLFHLHGG SIHSYRRMIH HMDEGIGRLM ALLAQHGLTR DTLLVFTSDN GGERFSDNWP LVGGKMDLTE GGIRVPWIAH WPAVIAPGGV SAQTCMTMDW SATLLDAATV APDADYPLDG KSLMPLLRDA TTWYGPALFW RMNHRGQRAM RQGAWKYLRV DGHDYLFDLS QDERERANQA AIDPERLSAM RAAWERWNAG MPAIPQDATV SLGYSTRDMP QR
|
| |