Gene Veis_4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4349 
Symbol 
ID4691514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4790901 
End bp4792640 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content70% 
IMG OID639852094 
Productsulfatase 
Protein accessionYP_999070 
Protein GI121611263 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.398112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.728931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCATGC TGTCGACCCT GCGGCGCCTG CTGCCGATGG CCCTGCGCAG CGCGTTTGCA 
CGCCGCCCCG GCGACCCCGC CAGCCCCTCT GGCCGCCGTC CGATCCACCC GGCCCAGGTG
GTGCTGCTGA CCAGCGCCTG GCTGGCCAGC GCCTGCAATC TGCCGCTGTG GCAGGCCGTG
GCCCGGCTGC CCGGGCAGGG CAGCCTGCGC GGCTGGGGCT TTGCGCTGGC CTTCTGGCTC
ATCGTGATGG CCGGCAACAC CGCCCTGCTG AGCCTGCTGG CCTGGCGCTG GACGCTCAAG
CCCGCCGTCG TGCTGCTGCT GCTGATGGCG GCGTTTGGCG CGTATTTCAT GCTCGCCTAT
GGCATCGTGA TCGACGCCGG CATGCTGGTC AATGTGCAGC AGACCGACCC GCGCGAAGCC
CGCGACCTGC TCAGCGGGCG CATGGCGGTG ACCGTCTCGG CGCTGGCCCT GCCCCCGCTG
CTGTGGCTGC GCCGCCGCCC GTTGCAGCGC CTGGGCGCGC TGCGCCAACT GCGCAGCAAC
AGCCTGCTGC TCGGCGGCTC GATCACGGTC GGCCTGCTCA GCCTGCTGCT GGTCTTTCAG
GACTTCGCCT CCGCGATGCG CAACCACAGC CAGATGCGCT ACCTGATCAA CCCGCTCAAC
AGCGTGTACG CGCTGGGCCA TCTGGCCGCC CAACCGCTGC GCATGGACAC CAGCGTGCTG
CTGCCCCTCG GGCGCGACGC CCGGCTCGGC GCCAGCTATG CCGGCCAGAC CCAGGCGCCG
CTGCTGATCC TGGTGCTCGG CGAAACCGGC CGCAGCCAGA ACTTCGGCAT CAACGGCTAC
GAGCGCGACA CCACTGCGCT GCTCGCGGCG CGCAAAGACC TGATCAGCGC GCGCAACGCC
TGGTCTTGCG GCACCAGCAC CGCCGCATCG CTGCCGTGCA TGTTCTCGCA CCTGGGGCGC
GCAGGCTATG CCGGGCGCTC GGCCAACCAT GAGAACCTGC TCGACGTGCT GCAACACGCG
GGCCTGGCCC TGCTGTGGGT GGACAACCAG GCCGGCTGCA AAGGCGTGTG CGCGCGCATC
GCGCAAACCC GCCCGGCCAC CGATCCGGCG CTCTGCCCCG ACGGCGAATG CCTGGACCGC
GCGATGCTCG ACGGCCTGTC CGCCCAAATC GCCGCGCTGC CCGCCGCGCG GCGCCAGCGC
GGCACCGTGG TCGTGCTGCA CCAGATCGGC AGCCACGGCC CGGCCTACTA CAAGCGCTCG
GCGCCACAGA ACAAGAAGTT CATGCCCGAA TGCCACTCGG CCGCGCTGCA AGAATGCGCG
CGCCAGCAGG TGGTCAACGC CTACGACAAC AGCATCGTCG AGACCGACCA GTTTCTCGCT
GCGCTGCTGC AATGGCTGGC AGCACCGGGC CACGCGCAGG ACCATGCCCA GGCCGCGATG
ATCTATGTCT CCGACCATGG CGAATCGCTC GGCGAAAACA ACCTGTACCT GCACGGCCTG
CCCTACGCCA TCGCCCCCGA CGTGCAAAAG CATGTGCCCT GGATCACCTG GCTATCCCCC
GCGATGCAGG CGCGCACCGG CCTTGCCACC GGCTGCCTGC AGCGCGACCT GGGCCAGCGG
CAGATCAGCC ACGACAACTA CTTCCACTCG GTGCTCGGCC TGATGGATGT GCAAACCAGC
GCCTACGACC CGGCGCTGGA CATGTTTGCG CGCTGCAAGG CCAGGGGCGA AAAGGAATAG
 
Protein sequence
MAMLSTLRRL LPMALRSAFA RRPGDPASPS GRRPIHPAQV VLLTSAWLAS ACNLPLWQAV 
ARLPGQGSLR GWGFALAFWL IVMAGNTALL SLLAWRWTLK PAVVLLLLMA AFGAYFMLAY
GIVIDAGMLV NVQQTDPREA RDLLSGRMAV TVSALALPPL LWLRRRPLQR LGALRQLRSN
SLLLGGSITV GLLSLLLVFQ DFASAMRNHS QMRYLINPLN SVYALGHLAA QPLRMDTSVL
LPLGRDARLG ASYAGQTQAP LLILVLGETG RSQNFGINGY ERDTTALLAA RKDLISARNA
WSCGTSTAAS LPCMFSHLGR AGYAGRSANH ENLLDVLQHA GLALLWVDNQ AGCKGVCARI
AQTRPATDPA LCPDGECLDR AMLDGLSAQI AALPAARRQR GTVVVLHQIG SHGPAYYKRS
APQNKKFMPE CHSAALQECA RQQVVNAYDN SIVETDQFLA ALLQWLAAPG HAQDHAQAAM
IYVSDHGESL GENNLYLHGL PYAIAPDVQK HVPWITWLSP AMQARTGLAT GCLQRDLGQR
QISHDNYFHS VLGLMDVQTS AYDPALDMFA RCKARGEKE