Gene Vapar_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3401 
Symbol 
ID7970639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3581781 
End bp3583625 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content71% 
IMG OID644793985 
Productsulfatase 
Protein accessionYP_002945284 
Protein GI239816374 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCG ATGCCGGGCG AGCCCGTCTG GGCCTGCGAG GCGCAATAAG TTCCGTGGGG 
GATATCCCGC AGCGCCGCAG CAAAGACCCC GTTGCAGCCC GGACACTGGC AGGCAGCGCA
CACGCCGCGG GGCAAAATGC GCCCGCCCTC TCGACCTCCC GCTTTGCACG ATTGACCACC
CTCCACTCCG AAGCGTCCGC TGTCCGGAAC GCCTCGCACC CACTGCTCGC CTGCGGCGCG
GTCGGTGCCA CGCTGCTGGC ATTCATCGTG CTGGGCCATG ACGGCCGGCG GATCGCGCAG
CTCGCGGTGC TCGCCGCGCC GATGGTGCTG TGGCTCGCGT GGCCGCTGCG CAGCGCACGA
CTGCGGCGCC TGCGCGCGGC GCTCGTCTGG CTCTGGGTGA TGGGCTTCGC GCTCGACGGC
GTGGCGCGCG CCTACCTGCT CGACACCTAC CAGGCCGCGC CCGACGGCGC GATGGTGCTG
GGCGCCGCGG CCAACACCAG CGTGCGCGAA AGCACCGAAT ACCTGAGGAT GCACTGGCGG
TCCGCCGTCG TCTGGTCGGC GGCGCTGGCG GGGGCCGCGG TGCTCGCGGG CATGTTCGTT
CGGCGCGGTG CGGCGGCGGT TTCAGCTTCG TTGGCGCCGG GCATGCGCCC GCGCCTACCG
TTCTGGCTGA GATCGCTGGT GCTCGCGCTG CTGCTGCTCG CCTGCCTGGC CTATGCGAGC
AAGCCGTGGC GCCGGCTGCA TCCCGCGCTG TTCTGGTCCC AGTGGTCGCA TTCGGTCCAC
ACGCTGCGCG CCGCATGGGC CGACCAGCAG CAGGTGCGCG ACCGCATGAT GGCGCAGGCC
AAGGCGATCG CGCCCGTGCC GCTGGAGGCC GGCCCGTCGA CGGTGGTGCT GGTCATCACC
GACAGCATCA ACCGCGACAA CATGGCGCTC TACGGCTACG GTCGCCCGAC CACGCCGCGT
CTGCTGGCGC ACAAGGCACA GGCCGGCGAC CAGATGGCCG TGCTGCGCCA CGCGTGGTCG
GCGGATGCGA GCACCCTGCC CGCGCTGCGC AACCTGTTCC ACTTCGGCTT GCCCCACACG
GAGAACGGCG AGAACCCGCC GCACCTGCTG GCGCTGGCAC GCGCCGCGGG CTACAAGGTC
TGGTGGATCA GCAACCATGA CGACCTGGCC ATCGAGCAGC AGCATGCGCG CTTCGCCGAT
GTGGTGGACA TGGTGAACCG CACGCCCGGC CGCGCCAGCG CCTCGCTAGA CGGCGAGATC
CTCGACTGCG TGCAGGAGGC GCTGCAGGAT GCGGGCACCG ACCGCAAGCT GATCGTGGTG
CACCTCATGG GCGCGCACCC GCACTACAGC CTGCGCTTTC CCGAGAACGC CAATCCCTTC
GACGACGACG TGGACGCGGT CGAGACCGGC CTGGTGAAGA ACGGCCGGTC GGCCTGGGTG
CGCCGCTTCC GGCAGGAGTA CGACGCCGCG CTGCTGTACC ACGACTTCGT GGTGTCCGAG
CTTTTGCAGC AGACGCGCAG CGCGGGCAAC CCGCATGACT ATCGCGCCTG GATCTACCTG
TCGGACCACG GCCAGGAAGT GGGGCATGGC AGCGACCGTG CGGGCCACAG CCCTTCGACC
GCTTCGGGCT ACCGCATTCC GGCGGTGATC TGGCGCAACC GGCAGCCGCT GCCGGACGGC
GCCGCGCAGC AGCAGCCCTT TCGCGCCGAC TGGGCGGCCT GGACGCTGAT GGACCTGCTC
AAGATCCAGT GGCGCGGCCA GGTGCCCGAG CGCAACGTGC TGGACCCGGC CTACCGCTGG
CAGGCCCCGA AGATCCCGGT GGCCGTCGAA TCGTTCTCGC GCTGA
 
Protein sequence
MSGDAGRARL GLRGAISSVG DIPQRRSKDP VAARTLAGSA HAAGQNAPAL STSRFARLTT 
LHSEASAVRN ASHPLLACGA VGATLLAFIV LGHDGRRIAQ LAVLAAPMVL WLAWPLRSAR
LRRLRAALVW LWVMGFALDG VARAYLLDTY QAAPDGAMVL GAAANTSVRE STEYLRMHWR
SAVVWSAALA GAAVLAGMFV RRGAAAVSAS LAPGMRPRLP FWLRSLVLAL LLLACLAYAS
KPWRRLHPAL FWSQWSHSVH TLRAAWADQQ QVRDRMMAQA KAIAPVPLEA GPSTVVLVIT
DSINRDNMAL YGYGRPTTPR LLAHKAQAGD QMAVLRHAWS ADASTLPALR NLFHFGLPHT
ENGENPPHLL ALARAAGYKV WWISNHDDLA IEQQHARFAD VVDMVNRTPG RASASLDGEI
LDCVQEALQD AGTDRKLIVV HLMGAHPHYS LRFPENANPF DDDVDAVETG LVKNGRSAWV
RRFRQEYDAA LLYHDFVVSE LLQQTRSAGN PHDYRAWIYL SDHGQEVGHG SDRAGHSPST
ASGYRIPAVI WRNRQPLPDG AAQQQPFRAD WAAWTLMDLL KIQWRGQVPE RNVLDPAYRW
QAPKIPVAVE SFSR