Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_2242 |
Symbol | |
ID | 4690354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 2538669 |
End bp | 2540207 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639850004 |
Product | sulfatase |
Protein accession | YP_997008 |
Protein GI | 121609201 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAA AGAACGTGCT GCTGATCGTC GTGGACCAGT GGCGGGGGGA CACCCTGCCG ATGCTCGGGC ACCCGGTGGT CAAGACGCCC CATATCGCGG CCCTGGCCGA TGAAGGCGTC ACCTTCAGGC GCCACTACAC CCAGGCGGTG CCTTGCGGCC CGGGCCGTGC CAGTTTGCTC ACCGGGCTGT ACATGATGAA CCACCGGGCG GTTCAGAACA CCATACCGCT GGATGCGCGC CACACCAACA TCGCGTTCGA AGTGCGCAAA GCCGGCTACG ATCCGGCCCT GGTCGGGTAC ACCACCACCA CCCCCGACCC GCGCGTGAAC GCGCATTCAG ACCCGCGCTT TACCGTGCTC GGCGCCGACA TGGAGGGCTG GCGGCCCGTC GGCTCCTGGG GCCTGAAGAT GGAGGCTTAC TTCGCGTGGC TCGCTGCACA GGGCTACGCC TTGCCCGAGA ACCCCTGGGA CATCTGGCTG CCCCAAGACA TGGGCCCCGG TGAAATCGGC GCGAGCCGGC AAGCCAGCCG CGTTCCTGCG CATCTGTCCG ATACGGTCTG GTTCACCGAC CGGGGGCTGG ATTACCTGCA CGGCGCCCGG CGCAAGCCCT GGTTTCTGCA CCTGGGCTAC TGGCGGCCGC ACCCGCCATT CATTGCCCCG GCGCCGTACC ATGAGATGTA CGACCCCGCG CAATGCCCTG CGCCCGTGCG CGCGGCCAGT GCGCAGGCCG AAGCGCAGCA GCACCCTTTG CTGCGTTATT ACCTGGAAAA CATCCAGCGC AGCAGCTTCT TTCAAAATGG CAAGGGTCTG GGCAGCCAGA TGGACGACGC CGAAGTGCGC CAGATGCGCG CGACCTACTA TGGACTGATG ACCGAAATCG ATGCCCAGCT CGGGCGTGTC TTTGCTTACC TCAAGGAGAC GGGGCAGTGG GACGACACCC TGATTGTCCT GACCAGCGAC CATGGCGAGC AACTGGGCGA TCACCATCTG CTGGGCAAGA TCGGGTACTT CGACCAGAGC TACCACATTC CGATGCTGAT ACGGGACCCG TCCCGCGCCG CCGATGGCAC GCGCGGCACG CAGGTCGATC ACTTCACCGA GACCATCGAC ACCATGCCGA CATTGCTCGA CTGGCTCGGA CAGCCCGCAC CCCGGGCCTG CGACGGCCGC TCGCTGCTGC CCTTCGTTCA TGCCGGCGCA GCCCCTGCGG ACTGGCGCAC CGAAGTCCAC TACGAATACG ACTTTCGCGA CATCTTCTAC TCCCGCCCCG AGACCGCACT GGGCCTGAAA ATGGACGAAT GCGCGCTGGC GGTGGTGCAG GACGAAAACT GGAAATATGT GCACTTCGCA GCCTCGGCGC CCCTGTTCTT CGATCTGCGG CGTGACCCGT CGCAACTCGA CAGCGTCGCG GGTCAGCCGG AATACGCCGC GCAGCAACTG GTCTATGCAC AAAAAATGCT CAATTGGCGC TTGCAGCACG CGGAGCGGAC ATTGACCGGC TACGCCGCAT CGCCGCAAGG GCTGCGCTGC CGGCGGTGA
|
Protein sequence | MRKKNVLLIV VDQWRGDTLP MLGHPVVKTP HIAALADEGV TFRRHYTQAV PCGPGRASLL TGLYMMNHRA VQNTIPLDAR HTNIAFEVRK AGYDPALVGY TTTTPDPRVN AHSDPRFTVL GADMEGWRPV GSWGLKMEAY FAWLAAQGYA LPENPWDIWL PQDMGPGEIG ASRQASRVPA HLSDTVWFTD RGLDYLHGAR RKPWFLHLGY WRPHPPFIAP APYHEMYDPA QCPAPVRAAS AQAEAQQHPL LRYYLENIQR SSFFQNGKGL GSQMDDAEVR QMRATYYGLM TEIDAQLGRV FAYLKETGQW DDTLIVLTSD HGEQLGDHHL LGKIGYFDQS YHIPMLIRDP SRAADGTRGT QVDHFTETID TMPTLLDWLG QPAPRACDGR SLLPFVHAGA APADWRTEVH YEYDFRDIFY SRPETALGLK MDECALAVVQ DENWKYVHFA ASAPLFFDLR RDPSQLDSVA GQPEYAAQQL VYAQKMLNWR LQHAERTLTG YAASPQGLRC RR
|
| |