Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1509 |
Symbol | |
ID | 5114477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1664503 |
End bp | 1666257 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640491697 |
Product | arylsulfotransferase |
Protein accession | YP_001176240 |
Protein GI | 146311166 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.407507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCT CTCATGTCAA CGCTGCGCCC GCGCCCCGTC CGTCAATTGC TCAGGGGATT AATCCTGACG ATCCTATTGT GGTGAGCCAT AACCAGATTT ACAGCGCTTT AATGGAACAA GTGACGACTT CGCATACCCA GGATAATGCC CTCGTATTTT TGGATCCCTT TACCACGTCA CCGTTAAGCT TTTATATTGG CCTGTGGTCT GATGCTAGCG ATACGGCCAC CATTAGCGTA TCTGATGCAC AGGGTAAATT CCCTATCTTG ACGTTTACCC AACCCGTTGT GGCGGGTGCC AATTTAATTC CCGCTGCCGG ATTACTACCT GGCATTCAAA ATAAGATTAC GGTGATTTCT GCATCAGGAT CAACGCTTTT ACCATTAATT GAAACCGCGC CTTTACCCCC GACCGATGCA GAAGTCAGCG ATCCAACCGA CCCCGCCAAC TATAATTTAT TCCCGCAGAT TACGGTGAAT TCACTCGCAA CCGATGAATC ACTTCTGGCC GACGGGCTTT ATTTTATCTC GTATTTTGAT CGTAATAATC TTGCGCTGGA TAACAAAGGA AATGTGCGCT GGTATACGGT GAAATCGATG CCATCGAATA ATTTATTGCG TCTGGAAAAT GGGCATTTCG TCTCTTCCGC TGTCGCCCAA AGCGGTTATC TGAAGATGTA TGAATTCGAT ATGGTTGGCC GCGTTCATGC CATGTACGAT CTTGATAACG CCTGTCATCA TTCGTTATAT CAGCAGTCTT CCACCTACGC CTATAAAGGC GTGAATAACT GTTTGGTTGC TGCATCGGAA TATATGCCGG GCATGCGGCC TGACGGTGGA TTAAGCATTG AGGATGGCGT ATCCATCATC AGTCTCGAAA CTGGTGAGGA GATTGATTAC TACGATATGG TGCAGGTTTT AGGGTTGAGT CGCGCAACAC GTCCATCTAA CCCACCGGAT ACGGCCGGTG GCACGCTTGA CTGGCTGCAT ATTAACCAGG CGTATATCAA CGAAACGAAT AATATGTTGA TCACATCCGG TCGTAATCAG AGTGCCGTCT TTGGCCTGAA AGTGGGGACA TACGACCTGA GCTTCATTAT GGGGACTCAC GGTGACTGGC CTGAAGAACT GAGCCGTTAT CTGCTGACAC CTTTGAGAGC CGACGGGACG CCATACGACC TTACCGATCC TCAACAGGCG CAAGAAGCGG ATGCCGTGTT CTGGAACTGG GGTCAACACA ATGTACTGGA AATCCCCAAT GCGACGCCGG GCATCATTGA TATTTCGCTG TTCAATAACA GCAACTATCG CTCGCGATCT GACGCCAACA GCGTGTTGCC GCAGGACAAT GAGAGCCGAA TTGGCCACTA CCGCATTAAT CTGAATACCA TGACGGTACA AATGCTGGCT GAGTACACCT CTGGCGCAGA GGGCTACAGC AGCTTGTGCG GCTGTAAGCA GGAAATGCCC AACGGGAATA TCGTCGTCAG CTTCGGCGGC GCGCTGTTCG ACAGCAACGG ACTGCCGCTC ACCTGCGATC CGGGCTACAG CGATGTGGCT TTAGAACCAG GAAACGGTGA CGTGGAAGGG CGGCTGCCGC TTCGTGAAAT GAATGCAGAG GGCATAATTC TGCAGGATAT TACGATCAGC TCAGGGCTTT ATAGAAATAG CGGAAATATT CCACCTTCAC AGACCGGATT TTATCGCTAT AACATCACGT GCTTCCGCAT GTATAAACTG CCGCTATTTG GCTAA
|
Protein sequence | MTLSHVNAAP APRPSIAQGI NPDDPIVVSH NQIYSALMEQ VTTSHTQDNA LVFLDPFTTS PLSFYIGLWS DASDTATISV SDAQGKFPIL TFTQPVVAGA NLIPAAGLLP GIQNKITVIS ASGSTLLPLI ETAPLPPTDA EVSDPTDPAN YNLFPQITVN SLATDESLLA DGLYFISYFD RNNLALDNKG NVRWYTVKSM PSNNLLRLEN GHFVSSAVAQ SGYLKMYEFD MVGRVHAMYD LDNACHHSLY QQSSTYAYKG VNNCLVAASE YMPGMRPDGG LSIEDGVSII SLETGEEIDY YDMVQVLGLS RATRPSNPPD TAGGTLDWLH INQAYINETN NMLITSGRNQ SAVFGLKVGT YDLSFIMGTH GDWPEELSRY LLTPLRADGT PYDLTDPQQA QEADAVFWNW GQHNVLEIPN ATPGIIDISL FNNSNYRSRS DANSVLPQDN ESRIGHYRIN LNTMTVQMLA EYTSGAEGYS SLCGCKQEMP NGNIVVSFGG ALFDSNGLPL TCDPGYSDVA LEPGNGDVEG RLPLREMNAE GIILQDITIS SGLYRNSGNI PPSQTGFYRY NITCFRMYKL PLFG
|
| |