Gene Ent638_1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1509 
Symbol 
ID5114477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1664503 
End bp1666257 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content50% 
IMG OID640491697 
Productarylsulfotransferase 
Protein accessionYP_001176240 
Protein GI146311166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.407507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCT CTCATGTCAA CGCTGCGCCC GCGCCCCGTC CGTCAATTGC TCAGGGGATT 
AATCCTGACG ATCCTATTGT GGTGAGCCAT AACCAGATTT ACAGCGCTTT AATGGAACAA
GTGACGACTT CGCATACCCA GGATAATGCC CTCGTATTTT TGGATCCCTT TACCACGTCA
CCGTTAAGCT TTTATATTGG CCTGTGGTCT GATGCTAGCG ATACGGCCAC CATTAGCGTA
TCTGATGCAC AGGGTAAATT CCCTATCTTG ACGTTTACCC AACCCGTTGT GGCGGGTGCC
AATTTAATTC CCGCTGCCGG ATTACTACCT GGCATTCAAA ATAAGATTAC GGTGATTTCT
GCATCAGGAT CAACGCTTTT ACCATTAATT GAAACCGCGC CTTTACCCCC GACCGATGCA
GAAGTCAGCG ATCCAACCGA CCCCGCCAAC TATAATTTAT TCCCGCAGAT TACGGTGAAT
TCACTCGCAA CCGATGAATC ACTTCTGGCC GACGGGCTTT ATTTTATCTC GTATTTTGAT
CGTAATAATC TTGCGCTGGA TAACAAAGGA AATGTGCGCT GGTATACGGT GAAATCGATG
CCATCGAATA ATTTATTGCG TCTGGAAAAT GGGCATTTCG TCTCTTCCGC TGTCGCCCAA
AGCGGTTATC TGAAGATGTA TGAATTCGAT ATGGTTGGCC GCGTTCATGC CATGTACGAT
CTTGATAACG CCTGTCATCA TTCGTTATAT CAGCAGTCTT CCACCTACGC CTATAAAGGC
GTGAATAACT GTTTGGTTGC TGCATCGGAA TATATGCCGG GCATGCGGCC TGACGGTGGA
TTAAGCATTG AGGATGGCGT ATCCATCATC AGTCTCGAAA CTGGTGAGGA GATTGATTAC
TACGATATGG TGCAGGTTTT AGGGTTGAGT CGCGCAACAC GTCCATCTAA CCCACCGGAT
ACGGCCGGTG GCACGCTTGA CTGGCTGCAT ATTAACCAGG CGTATATCAA CGAAACGAAT
AATATGTTGA TCACATCCGG TCGTAATCAG AGTGCCGTCT TTGGCCTGAA AGTGGGGACA
TACGACCTGA GCTTCATTAT GGGGACTCAC GGTGACTGGC CTGAAGAACT GAGCCGTTAT
CTGCTGACAC CTTTGAGAGC CGACGGGACG CCATACGACC TTACCGATCC TCAACAGGCG
CAAGAAGCGG ATGCCGTGTT CTGGAACTGG GGTCAACACA ATGTACTGGA AATCCCCAAT
GCGACGCCGG GCATCATTGA TATTTCGCTG TTCAATAACA GCAACTATCG CTCGCGATCT
GACGCCAACA GCGTGTTGCC GCAGGACAAT GAGAGCCGAA TTGGCCACTA CCGCATTAAT
CTGAATACCA TGACGGTACA AATGCTGGCT GAGTACACCT CTGGCGCAGA GGGCTACAGC
AGCTTGTGCG GCTGTAAGCA GGAAATGCCC AACGGGAATA TCGTCGTCAG CTTCGGCGGC
GCGCTGTTCG ACAGCAACGG ACTGCCGCTC ACCTGCGATC CGGGCTACAG CGATGTGGCT
TTAGAACCAG GAAACGGTGA CGTGGAAGGG CGGCTGCCGC TTCGTGAAAT GAATGCAGAG
GGCATAATTC TGCAGGATAT TACGATCAGC TCAGGGCTTT ATAGAAATAG CGGAAATATT
CCACCTTCAC AGACCGGATT TTATCGCTAT AACATCACGT GCTTCCGCAT GTATAAACTG
CCGCTATTTG GCTAA
 
Protein sequence
MTLSHVNAAP APRPSIAQGI NPDDPIVVSH NQIYSALMEQ VTTSHTQDNA LVFLDPFTTS 
PLSFYIGLWS DASDTATISV SDAQGKFPIL TFTQPVVAGA NLIPAAGLLP GIQNKITVIS
ASGSTLLPLI ETAPLPPTDA EVSDPTDPAN YNLFPQITVN SLATDESLLA DGLYFISYFD
RNNLALDNKG NVRWYTVKSM PSNNLLRLEN GHFVSSAVAQ SGYLKMYEFD MVGRVHAMYD
LDNACHHSLY QQSSTYAYKG VNNCLVAASE YMPGMRPDGG LSIEDGVSII SLETGEEIDY
YDMVQVLGLS RATRPSNPPD TAGGTLDWLH INQAYINETN NMLITSGRNQ SAVFGLKVGT
YDLSFIMGTH GDWPEELSRY LLTPLRADGT PYDLTDPQQA QEADAVFWNW GQHNVLEIPN
ATPGIIDISL FNNSNYRSRS DANSVLPQDN ESRIGHYRIN LNTMTVQMLA EYTSGAEGYS
SLCGCKQEMP NGNIVVSFGG ALFDSNGLPL TCDPGYSDVA LEPGNGDVEG RLPLREMNAE
GIILQDITIS SGLYRNSGNI PPSQTGFYRY NITCFRMYKL PLFG