Gene Ent638_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1506 
Symbol 
ID5114474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1660471 
End bp1661976 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content48% 
IMG OID640491694 
Productsulfatase 
Protein accessionYP_001176237 
Protein GI146311163 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.265184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGA CAACGTTAGC AGTGTTTGTC AGCGCGGTCA TTGGCATGAC AGGGGGAATG 
GGAAATGTTC TGGCGGCAGA GCAATCAGCG AATCAATTAA ATAAACCTAA CGTCGTGATT
ATTTTAGCCG ACGACCTGGG CTATGGCGAT TTAGGCATAT ATGGGCATCC CATCGTTAAA
ACCCCGAATA TCGATAAACT CGCGCAGGAA GGGGTGAGGT TTTCGCAATA TTACGCACCC
GCGCCACTGT GTTCGCCTTC ACGTGCAGGT TTACTGACAG GACGCACCCC CTTCAGAACG
GGGATTCGAT CCTGGATCCC GACCAATAAA AATATCGCAC TGGGGCGTAA CGAAAAGACC
ATAGCCAGTT ACCTGAAAGA CCAGGGTTAC GACACGGCAA TGATGGGGAA ATGGCATCTT
AATGCCGGTG TTGACCGCCA CGATCAGCCC CAGGCTGAAG ATGCTGGTTT CGACTATACG
TTGGTCAATG CTGCTGGTTT TGTCACCAGC GATCTGGATA AGGCGAAAGA GCGTCCGCGT
AATGGCGTGG TGTACCCGAA TGGGTTTTAT CGAAACGGTA AAGCGCTGGG GACCGTTAAC
CAAATCAGCG GTGAATTTGT CAGTCAGGAA GCCATTAACT GGCTAAACGA TAAAAAAGAT
AACAAACCTT TCTTTATGTA TGTGGCTTTC ACAGAGGTCC ATACGCCGCT GGCGTCACCC
AAAAAATACC TCGAAATTTA TAAAAATTAT ATGAGCGAGT ATGAAAAGCA GCATCCCGAT
ATGTTTTATG CCGACTGGGT GGATAAGCCT TATCGTGGTC CGGGAGAATA CTACGCCAAT
ATCAGTTACA TGGATGAACA GGTTGGTAAA GTCCTCGCAA AAATCAAATC AATGGGGCAG
GAGGACAACA CGATAATTAT CTTTACCAGC GATAACGGTC CTGTCACGCG CGAAGCGCGT
AAGTGGTACG AACTTAATAT GGCAGGTGAA ACGGATGGCT TACGGGGTCG CAAAGATAAT
TTGTGGGAGG GGGGAATACG CGTGCCAGCG ATCATTAAAT ATGGTCAGCA TTTACACGCC
GGCACGGTAA CCGACACGCC TGTAAGCGGT CTGGATATAT TACCCACTCT TGCAGAACTG
ACGCATTTTA ACTTGCCGAC CGACCGGATT ATTGATGGGG AATCTATTGT GCCCGTACTT
GAGGGACAAA CGATGAACCG CCAGCAACCC TTGTTATTCG CGATTGATAT GCCGTTCCAG
GATGATCCGA CGGATATGTG GGCACTACGC GACGGCGACT GGAAGATGAT ATTTGACCGC
AATAGCAAAC CTAAATATCT CTATAACCTC AAGCTGGATC GTGGCGAGAC AATGAATCAA
CTGGGTAAAC AACCCGTGCT GGAGCAAAAA ATGATAGCCG CGTTAGCACG TTATCAGTCC
AGTATTGAAA ATGATTCACT TATGAAGGCT AGGGGCGATA AACCGACACC AGTAGACTGG
AACTAA
 
Protein sequence
MRKTTLAVFV SAVIGMTGGM GNVLAAEQSA NQLNKPNVVI ILADDLGYGD LGIYGHPIVK 
TPNIDKLAQE GVRFSQYYAP APLCSPSRAG LLTGRTPFRT GIRSWIPTNK NIALGRNEKT
IASYLKDQGY DTAMMGKWHL NAGVDRHDQP QAEDAGFDYT LVNAAGFVTS DLDKAKERPR
NGVVYPNGFY RNGKALGTVN QISGEFVSQE AINWLNDKKD NKPFFMYVAF TEVHTPLASP
KKYLEIYKNY MSEYEKQHPD MFYADWVDKP YRGPGEYYAN ISYMDEQVGK VLAKIKSMGQ
EDNTIIIFTS DNGPVTREAR KWYELNMAGE TDGLRGRKDN LWEGGIRVPA IIKYGQHLHA
GTVTDTPVSG LDILPTLAEL THFNLPTDRI IDGESIVPVL EGQTMNRQQP LLFAIDMPFQ
DDPTDMWALR DGDWKMIFDR NSKPKYLYNL KLDRGETMNQ LGKQPVLEQK MIAALARYQS
SIENDSLMKA RGDKPTPVDW N