Gene Ent638_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2784 
Symbol 
ID5112735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3007937 
End bp3009697 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content54% 
IMG OID640492971 
Productsulfatase 
Protein accessionYP_001177500 
Protein GI146312426 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.879948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGA ATCGTCAGCG CTACCGCGAA AAAGTCTCCC AGATGGTTAG CTGGGGGCAC 
TGGTTTGCCT TGTTCAACAT TTTGTTGGGC ATGGTGCTGG GCAGCCGTTA TCTGTTCGTG
GCCGACTGGC CAACGACGCT AACCGGGCGT ATTTACTCCT GGATAAGCCT GGTGGGGCAT
TTTAGCTTTT TAGTCTTCGC CACGTACCTG CTGATTCTTT TCCCCCTGAC GTTTATCGTC
ATGTCGCAGC GGCTGATGCG GTTTTTATCC GCCATCCTTG CCACTGCGGG CATGACGCTG
CTGCTTATCG ACAGCGAAGT CTTCACACGT TTCCACCTGC ATCTCAATCC CATCGTCTGG
GAACTGGTGA TTAACCCCGA CCAGAACGAA ACCGCACGTG ACTGGCAGCT GATGTTTATC
AGCGTGCCCA TTATTCTGCT GATTGAGATG CTGTTTGCCA CCTGGAGCTG GCAAAAACTC
CGTAGCCTGA CGCGTCGTCG TCACTACGCC AAACCTGTCG CCACGCTATT TTTCGTGGCG
TTTATCAGCT CACATGTCAT GTATATTTGG GCTGATGCTA ATTTCTACCG CCCAATCACC
ATGCAGCGCG CCAACTTGCC GCTTTCCTAC CCGATGACAG CGCGTCGCTT CCTGGAAAAA
CACGGGCTGC TTGATGCGCA GGAATATCAG CGTCGCCTGG TCGAGCAAGG CAATCCAGAG
GCCGTGTCGG TACAGTATCC ACTAAGCGAC TTACGCTATC GCGATATGGG ACAAGGCCAG
AACGTCTTAC TCATTACCGT CGACGGTCTG AATTATTCTC GCTATGAGAA ACAGATGCCT
GCGCTGGCAG AATTCGCCGA GCAAAACATT AACTTCACGC AGCATATGAG CTCCGGGAAT
AGCACCGATG CAGGCATTTT CGGTCTGTTC TATGGCATTT CTGCGGGTTA TATGGATGGC
GTGTTGTCTT CTCGTACGCC TGCGGCCCTG ATAACCTCGC TTAATCAGCA AGGTTATCAA
CTGGGGCTGT TCTCATCGGA CGGTTTCAGT AGCCCGCTTT ATCGACAGGC GTTGTTGTCC
GACTTCTCCC TGCCTGCAGC GCAGAGTCAG TCCGACGATA AAACGGCGGA TCAGTGGGTG
AACTGGCTGA ATCGCTATGC GCAGGAAGAT AACCGCTGGT TCTCCTGGGT AGCGTTTAAC
GGCACGACGC TGGATGACAG CAGTCAGAAA GGCTTTGCCC GTCGTTACGG CCGTGCGGCT
GGCGATGTTG ACGCGCAGAT CGCGCGCGTC ATTAACGCAC TCCGTGAATC CGGCAAACTG
GATAATACTG TAGTGATCAT TACCGCAGGC CACGGTGTGC CGCTGGGTGA TGAGACAAAA
GAAATGGAAT GGTCGCGCCC TAACCTGCAC GTCCCACTTG TTGTTCACTG GCCGGGCACG
CCTGCACAAC GCATCAGCAT GTTGACCGAT CACAAAGACG TCATGACAAC GTTGATGCAG
CGGTTGCTGC ACGTCAGCAC ACCGGCGAAT GAATACTCAC AGGGTCAGGA TATTTTCAGC
GCCACGCGAC GCCACAACTG GGTGACGGCA GCCAGCGGAA ATACTCTGGC AGTCACGACA
CCAGCGCTAA CATTGGTGCT TGGCAGCAAC GGTAATTACC AGACGTATAA TCTGCAGGGT
GAGAAAATAC ATGACCAGAA ACCGCAGTTG AGTCTGCTGC TTCAGGTGCT GACGGACGAG
AAACGGTTCA TCGCTAACTG A
 
Protein sequence
MVTNRQRYRE KVSQMVSWGH WFALFNILLG MVLGSRYLFV ADWPTTLTGR IYSWISLVGH 
FSFLVFATYL LILFPLTFIV MSQRLMRFLS AILATAGMTL LLIDSEVFTR FHLHLNPIVW
ELVINPDQNE TARDWQLMFI SVPIILLIEM LFATWSWQKL RSLTRRRHYA KPVATLFFVA
FISSHVMYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLVEQGNPE
AVSVQYPLSD LRYRDMGQGQ NVLLITVDGL NYSRYEKQMP ALAEFAEQNI NFTQHMSSGN
STDAGIFGLF YGISAGYMDG VLSSRTPAAL ITSLNQQGYQ LGLFSSDGFS SPLYRQALLS
DFSLPAAQSQ SDDKTADQWV NWLNRYAQED NRWFSWVAFN GTTLDDSSQK GFARRYGRAA
GDVDAQIARV INALRESGKL DNTVVIITAG HGVPLGDETK EMEWSRPNLH VPLVVHWPGT
PAQRISMLTD HKDVMTTLMQ RLLHVSTPAN EYSQGQDIFS ATRRHNWVTA ASGNTLAVTT
PALTLVLGSN GNYQTYNLQG EKIHDQKPQL SLLLQVLTDE KRFIAN