Gene Veis_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2294 
Symbol 
ID4694561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2603032 
End bp2604360 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID639850061 
Productsulfatase 
Protein accessionYP_997060 
Protein GI121609253 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.434198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.499589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCA GACCGAACAT CATCTTCATC GTGGCCGACG ATCTCGGCTA TGCCGACCTG 
GGCTGCTACG GCGCCCGCGC GGCCGGCTTC GGCCCGGTCT CGCCCACGCT CGACGCGCTG
GCCGCAGGCG GGCTCAGACT CACCCAGGGC TACTCGAACT CGCCCGTGTG CTCGCCCACA
CGCTTTGCGC TGATGACGGC GCGCTACCAG TACCGTTTGC GCGGGGCGGC CGAAGAGCCG
ATCAACAGCC AGAGCCGGGG CAGCACCACC CTGGGCCTGC CGCCCGAGCA CCCGACGCTG
CCGTCGCTAC TGCGCGGCGC GGGCTACCGC ACGGCGCTGA TGGGCAAGTG GCACCTGGGC
TACCCGCCCG CTTTCGGGCC GTTGCGCTCG GGGTATGAGG AATTCTTCGG CCCGATGTCG
GGCGGAGTGG ACTATTTCAC CCACTGCAGT TCCAGCGGCC AGCATGATCT GTACTTGGGC
GCCCAGGAAC AGCCGCAGGA CGGCTACCTC ACCGATCTGA TCACCGAGCA CGCGCTCGAC
TATGTGGCGC GCATGGCACC CGGCGCCAAG GCCGGGACGC CCTTTTTTCT GAGCCTGCAC
TACACGGCGC CGCATTGGCC CTGGGAGACG CGCGACGACC AGGCGCTGGC GCCGCAACTG
CACCGCCACC TGTTCCATCT GCACGGCGGC AGCATCCACA GCTACCGCCG CATGATCCAC
CACATGGACG AGGGCATCGG CCGCCTGATG GCGTTGCTTG CGCAGCATGG CCTGACGCGC
GACACGCTGC TGGTCTTTAC CAGCGACAAC GGCGGTGAGC GCTTCTCGGA CAACTGGCCG
CTGGTGGGTG GCAAGATGGA CCTGACCGAA GGTGGCATCC GCGTGCCCTG GATAGCGCAC
TGGCCGGCGG TGATCGCCCC CGGTGGTGTC AGCGCCCAGA CCTGCATGAC CATGGACTGG
TCGGCCACCC TGCTCGACGC CGCCACTGTG GCGCCTGATG CCGACTACCC GCTCGACGGC
AAATCCCTGA TGCCGCTGCT GCGCGACGCC ACCACATGGT ATGGGCCGGC GCTGTTTTGG
CGCATGAACC ATCGCGGCCA ACGCGCCATG CGCCAGGGCG CGTGGAAGTA CCTGCGCGTG
GACGGCCATG ACTACCTGTT CGACCTGTCG CAAGACGAAC GCGAGCGCGC CAACCAGGCC
GCCATCGACC CCGAACGCCT GTCGGCCATG CGCGCCGCGT GGGAACGATG GAATGCCGGC
ATGCCAGCGA TACCGCAGGA TGCCACGGTG AGTCTGGGCT ACTCCACCCG GGACATGCCA
CAGCGCTGA
 
Protein sequence
MSSRPNIIFI VADDLGYADL GCYGARAAGF GPVSPTLDAL AAGGLRLTQG YSNSPVCSPT 
RFALMTARYQ YRLRGAAEEP INSQSRGSTT LGLPPEHPTL PSLLRGAGYR TALMGKWHLG
YPPAFGPLRS GYEEFFGPMS GGVDYFTHCS SSGQHDLYLG AQEQPQDGYL TDLITEHALD
YVARMAPGAK AGTPFFLSLH YTAPHWPWET RDDQALAPQL HRHLFHLHGG SIHSYRRMIH
HMDEGIGRLM ALLAQHGLTR DTLLVFTSDN GGERFSDNWP LVGGKMDLTE GGIRVPWIAH
WPAVIAPGGV SAQTCMTMDW SATLLDAATV APDADYPLDG KSLMPLLRDA TTWYGPALFW
RMNHRGQRAM RQGAWKYLRV DGHDYLFDLS QDERERANQA AIDPERLSAM RAAWERWNAG
MPAIPQDATV SLGYSTRDMP QR