Gene Veis_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_0934 
Symbol 
ID4693530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1043888 
End bp1045444 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content69% 
IMG OID639848712 
Productsulfatase 
Protein accessionYP_995730 
Protein GI121607923 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC ACCCGGCTGC ATCGCTGACC CGGCCGCGCA ACGCCGTGGT CATTTTGCTC 
GACAGCCTGA ACCGCCACCT GCTGGGCGCC TATGGCGCCA CCGAGTTCGA GACCCCGCAG
ATCGACCGCT TTTGCGCCAG CGCCCTGCGC TTTGACCGGC ACTATGCCGG CTCGCTGCCG
TGCATGCCGG CGCGCCACGA TATCTTGTGC GGTGCGCTGG ACTTTCTCTG GCGTCCCTGG
GGATCGATCG AAGTCTGGGA GGACGCGATC ACCTACTGGC TGCGCAACGC CGGCGTCGTC
ACCCAACTGA TCTCGGACCA CCCGCACCTG TTCGAGAGCG GCGGCGAAAA CTACCACGCC
GACTTTCAGG GCTGGGACTA TCTGCGCGGC CACGAAAGCG ACCCGTGGAA AACGGCGCAA
AGCGAGTGCG CCATCGGCGC CCCGCTGCAC CAGGTGCTGC CCGGCCCCTT CCCGCACGAG
TACGACACCA ATCGCACCTG GTTCAAGCGC GAAGAAGACT TTCCCGGCCC GCAGACCATG
GCCAGCGCCG CGCGCTGGAT CGACGAGAAC GCCGGACGGC ATCAGCGTTT CTTCCTGATG
ATCGACGAGT TCGATCCGCA CGAACCCTTC GACACGCCAC AGCCCTGGGC CTGCCGGTAC
CGGCAGGCCC AGGGGGCCGA TGAGCACCAG CCGCTGCTGG TATGGCCGCC CTACGCGGTG
GATGCGATCG AGCGCGGCGT GCTCACGGCC GCCCAGGCGC AGGAACTGCG CAACAACTAC
GGCGCCAAGC TGTCGATGAT CGACCATTGG CTGGGCCGGG TGCTCGACGC GATCGAGCGC
AATCGGCTGG CCGCCGACAC CGCCGTGATC CTGTGCACCG ACCACGGCCA CTACCTCGGC
GAGCGCGACA TCTTCGGCAA ACCGGGCGTG CCGCTGTACC AGCCGATGGC CCATATCCCG
CTGATGATCC GCTGGCCCGG CATGGCGCCG GGCCGCCGCG ACATGCTGAC AACGAGCGTG
GACATCCACG CCACCATTGC CGACATCTTC GGCGTGTCGG CCGCGCACCG CACGCATGGG
CGCTCGCTGC TGCCCGCCAT CGCCGACCCG GGCCAGCAGG TGCGCGAGCA TTTGCTGGCC
GGCGTCTGGG GCCGCGAGGT GCATTACATC GACCGCAGCC ACAAATACGT TCGCGCCCCG
GCGCAGGCCA ACGCGCCGCT ATCGATGTGG TCCAACCGCT GGTCGACGAT GCCGCAGCAC
CATGTGCCGG GCCGGCGTCT GCTGCCGCCC GACCGCCGCG CGCGCATCGA CTTCATGCCC
GGCAGCCAGG TGCCGGTGCT GCGCCAGCCC TTCGTCGAGG GCGACCTGCT GCCGCTATGG
GCGCGGAACC TGCGGTTCAG CGGCAACCAC CTGTGGAACC TCGACGCCGA CCCCCGCGAG
CAGACCGATC TGGCCGGCAG CGCGCTGGAG GCCGAGTACG CGCACAAATT GCACGCCGCG
CTGCGGGCCA TCGAGGCGCC GGATGACCAG GCCATCCGGC TCGGGCTTGG GGTTTGA
 
Protein sequence
MNQHPAASLT RPRNAVVILL DSLNRHLLGA YGATEFETPQ IDRFCASALR FDRHYAGSLP 
CMPARHDILC GALDFLWRPW GSIEVWEDAI TYWLRNAGVV TQLISDHPHL FESGGENYHA
DFQGWDYLRG HESDPWKTAQ SECAIGAPLH QVLPGPFPHE YDTNRTWFKR EEDFPGPQTM
ASAARWIDEN AGRHQRFFLM IDEFDPHEPF DTPQPWACRY RQAQGADEHQ PLLVWPPYAV
DAIERGVLTA AQAQELRNNY GAKLSMIDHW LGRVLDAIER NRLAADTAVI LCTDHGHYLG
ERDIFGKPGV PLYQPMAHIP LMIRWPGMAP GRRDMLTTSV DIHATIADIF GVSAAHRTHG
RSLLPAIADP GQQVREHLLA GVWGREVHYI DRSHKYVRAP AQANAPLSMW SNRWSTMPQH
HVPGRRLLPP DRRARIDFMP GSQVPVLRQP FVEGDLLPLW ARNLRFSGNH LWNLDADPRE
QTDLAGSALE AEYAHKLHAA LRAIEAPDDQ AIRLGLGV