Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3401 |
Symbol | |
ID | 7970639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 3581781 |
End bp | 3583625 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644793985 |
Product | sulfatase |
Protein accession | YP_002945284 |
Protein GI | 239816374 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCG ATGCCGGGCG AGCCCGTCTG GGCCTGCGAG GCGCAATAAG TTCCGTGGGG GATATCCCGC AGCGCCGCAG CAAAGACCCC GTTGCAGCCC GGACACTGGC AGGCAGCGCA CACGCCGCGG GGCAAAATGC GCCCGCCCTC TCGACCTCCC GCTTTGCACG ATTGACCACC CTCCACTCCG AAGCGTCCGC TGTCCGGAAC GCCTCGCACC CACTGCTCGC CTGCGGCGCG GTCGGTGCCA CGCTGCTGGC ATTCATCGTG CTGGGCCATG ACGGCCGGCG GATCGCGCAG CTCGCGGTGC TCGCCGCGCC GATGGTGCTG TGGCTCGCGT GGCCGCTGCG CAGCGCACGA CTGCGGCGCC TGCGCGCGGC GCTCGTCTGG CTCTGGGTGA TGGGCTTCGC GCTCGACGGC GTGGCGCGCG CCTACCTGCT CGACACCTAC CAGGCCGCGC CCGACGGCGC GATGGTGCTG GGCGCCGCGG CCAACACCAG CGTGCGCGAA AGCACCGAAT ACCTGAGGAT GCACTGGCGG TCCGCCGTCG TCTGGTCGGC GGCGCTGGCG GGGGCCGCGG TGCTCGCGGG CATGTTCGTT CGGCGCGGTG CGGCGGCGGT TTCAGCTTCG TTGGCGCCGG GCATGCGCCC GCGCCTACCG TTCTGGCTGA GATCGCTGGT GCTCGCGCTG CTGCTGCTCG CCTGCCTGGC CTATGCGAGC AAGCCGTGGC GCCGGCTGCA TCCCGCGCTG TTCTGGTCCC AGTGGTCGCA TTCGGTCCAC ACGCTGCGCG CCGCATGGGC CGACCAGCAG CAGGTGCGCG ACCGCATGAT GGCGCAGGCC AAGGCGATCG CGCCCGTGCC GCTGGAGGCC GGCCCGTCGA CGGTGGTGCT GGTCATCACC GACAGCATCA ACCGCGACAA CATGGCGCTC TACGGCTACG GTCGCCCGAC CACGCCGCGT CTGCTGGCGC ACAAGGCACA GGCCGGCGAC CAGATGGCCG TGCTGCGCCA CGCGTGGTCG GCGGATGCGA GCACCCTGCC CGCGCTGCGC AACCTGTTCC ACTTCGGCTT GCCCCACACG GAGAACGGCG AGAACCCGCC GCACCTGCTG GCGCTGGCAC GCGCCGCGGG CTACAAGGTC TGGTGGATCA GCAACCATGA CGACCTGGCC ATCGAGCAGC AGCATGCGCG CTTCGCCGAT GTGGTGGACA TGGTGAACCG CACGCCCGGC CGCGCCAGCG CCTCGCTAGA CGGCGAGATC CTCGACTGCG TGCAGGAGGC GCTGCAGGAT GCGGGCACCG ACCGCAAGCT GATCGTGGTG CACCTCATGG GCGCGCACCC GCACTACAGC CTGCGCTTTC CCGAGAACGC CAATCCCTTC GACGACGACG TGGACGCGGT CGAGACCGGC CTGGTGAAGA ACGGCCGGTC GGCCTGGGTG CGCCGCTTCC GGCAGGAGTA CGACGCCGCG CTGCTGTACC ACGACTTCGT GGTGTCCGAG CTTTTGCAGC AGACGCGCAG CGCGGGCAAC CCGCATGACT ATCGCGCCTG GATCTACCTG TCGGACCACG GCCAGGAAGT GGGGCATGGC AGCGACCGTG CGGGCCACAG CCCTTCGACC GCTTCGGGCT ACCGCATTCC GGCGGTGATC TGGCGCAACC GGCAGCCGCT GCCGGACGGC GCCGCGCAGC AGCAGCCCTT TCGCGCCGAC TGGGCGGCCT GGACGCTGAT GGACCTGCTC AAGATCCAGT GGCGCGGCCA GGTGCCCGAG CGCAACGTGC TGGACCCGGC CTACCGCTGG CAGGCCCCGA AGATCCCGGT GGCCGTCGAA TCGTTCTCGC GCTGA
|
Protein sequence | MSGDAGRARL GLRGAISSVG DIPQRRSKDP VAARTLAGSA HAAGQNAPAL STSRFARLTT LHSEASAVRN ASHPLLACGA VGATLLAFIV LGHDGRRIAQ LAVLAAPMVL WLAWPLRSAR LRRLRAALVW LWVMGFALDG VARAYLLDTY QAAPDGAMVL GAAANTSVRE STEYLRMHWR SAVVWSAALA GAAVLAGMFV RRGAAAVSAS LAPGMRPRLP FWLRSLVLAL LLLACLAYAS KPWRRLHPAL FWSQWSHSVH TLRAAWADQQ QVRDRMMAQA KAIAPVPLEA GPSTVVLVIT DSINRDNMAL YGYGRPTTPR LLAHKAQAGD QMAVLRHAWS ADASTLPALR NLFHFGLPHT ENGENPPHLL ALARAAGYKV WWISNHDDLA IEQQHARFAD VVDMVNRTPG RASASLDGEI LDCVQEALQD AGTDRKLIVV HLMGAHPHYS LRFPENANPF DDDVDAVETG LVKNGRSAWV RRFRQEYDAA LLYHDFVVSE LLQQTRSAGN PHDYRAWIYL SDHGQEVGHG SDRAGHSPST ASGYRIPAVI WRNRQPLPDG AAQQQPFRAD WAAWTLMDLL KIQWRGQVPE RNVLDPAYRW QAPKIPVAVE SFSR
|
| |