Gene Swit_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3141 
Symbol 
ID5199550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3453473 
End bp3455350 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content72% 
IMG OID640582688 
Productsulfotransferase 
Protein accessionYP_001263627 
Protein GI148556045 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.423755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC CGCCGCCGCA TCTTTCGCCG ATCTTCGCCC GCATCAAGAT CGGCGACCTG 
CCCGGCGCCC GCGCCGCGGC CGAGCAGATA TTGCGCTCGC TGCCCGCCGA TCAGCCGCTG
CTGGCGCTCG CCGGCATGCT CGCCTGCCGG ACCGGCGACC TGCCCGGCGG CATCGCCCGG
CTGCGCGAGG CGCTGAAGCT CGCCCCCGGC GACCAGTCGA CCCGCAACAA CCTGGTCCGC
GCCCTGATCG AGACCGGCGC GCTCGACGAG GCGGCCGAGG CCTGCGCAGG GGGCGAGCGC
GATCCGAAGA TGCTGCGCCT GTCCGCCTAT ATCGAGCATC AGCGCGGCCA TCTCGACCGC
GCGGTCGCCG ACTATGAGGC GGTGGTCGCG GCGCTGCCCG ATGATTTCGA GAGCTGGAAC
AACCTCGGCA ACCTCTACGC CGCGACCGGC CATGGCGAGC GGGCCGAGGC GCCGCTGCGC
ACGGCGATCG CGCTCCGCCC CGACATCGCG CTGCCCTATC TCAACCTCGC CAAGCTGCTC
GCCGGGCTGC AGCGGCCCGA GGACGGCGCC GAACTGCTGC GCGCCGCCGC CGCCGCGATC
CCCGACAATG CCGAGATCCG CGCCGAACTG GGCCTGGCCG AAGCCGCGCT CGGCGCTTTC
GACGCGGCCG AGCGGGCCTA TCGCGCCGCG ATCGACCTCT CGCCGGGCTT CACCCCCGCC
TGGCTCGACC TCGCCATGCA GCTCGACAGC CTCAACAAGG TCGACGCGCT GGCGGCGCTG
TCCGACGCGG CCCGGACGAA GGGCATCGGC GCGGCCGAGG GAGCCGGCTT CATCGAGGCC
GCCGCGCTGC GCCGCCAGGG CGCCCATGCC GAGGCGCTCG CCGTCGCCCG CCAGGTTCCC
GCGACGATCA ATCCGGTCCG CCGCAACCAG CTGATCGGCG AGCTCGCCGA CCGGACCGGC
GACACCGACC TCGCCTTCGA GGCCTTTTCG GCGATGAACG CCGCCGCCCG CGCCCAGGCG
CATCCCGAGG CGCTCGGCCC CGCCTTCATC GACGAGATCG TCGGCAATGG CGAGCGGCTG
ACGCCGGCCC GGATCGCCGC GTGGAGCAAG GTGGAGATCG ATCCCGCCCC ACCCGCGCCG
ATCTTCCTGG TCGGCTTCCC GCGCTCGGGG ACGACGCTGC TCGACACGCT GCTGATGAAC
ATCCCCTCGC TCCATGTGCT CGAGGAACTG CCCATCGTCC GCACGGTGCA GCGCGCGATG
GGCGATCCCG ATCGGCTGGA CAGCCTGACC GATGCCGAGG CCAATGCGCT GCGCCGCACC
TATTTCGAGA CGCTCGACGG CCTCGCGCCG CCCGCGCCGG GGCAGCGGAT CGTCGACAAA
TTCCCGCTGC ATATGGCGCG GATGGCCATG ATCCACCGGC TGTTCCCCGA CGCCAAGGTG
ATCTTCGTCG AACGGCATCC GTGCGATTGC GTGCTGAGCG GGTTCATGTC GAGCTTCGAG
CTCAACCCCG CGATGCTGAG CTTCACCTCG CTGGAGGGCG CCGCGAGGCT CTACGACGCG
GCCGCGACCG CCTGGACGCG GGCCGAGGCG CTGTTGCCGA TCGACGTCTG CCGGATTCGC
TACGAGACGA TGATCGAGGA TCTGGAAGGG GAGATGCGCC GGCTGCTCGA CTTCCTCGGC
ATCGCCTGGG ACGACGCCGT GCTCGACAAT CGCGGCAGCG CGGCCCGGCG CGAGCATATC
CGCACCGCCA GCTACGCGCA GGTCACAGAG CCGATCTACA ACCGCGCGAT CGGCCGCTGG
GAACGCTATC GGTCGCGGAT GGCCGACATC CTGCCGACCC TCGCCCCGTG GGCCGAGAGG
ATGGGCTATC GGGTCTGA
 
Protein sequence
MTLPPPHLSP IFARIKIGDL PGARAAAEQI LRSLPADQPL LALAGMLACR TGDLPGGIAR 
LREALKLAPG DQSTRNNLVR ALIETGALDE AAEACAGGER DPKMLRLSAY IEHQRGHLDR
AVADYEAVVA ALPDDFESWN NLGNLYAATG HGERAEAPLR TAIALRPDIA LPYLNLAKLL
AGLQRPEDGA ELLRAAAAAI PDNAEIRAEL GLAEAALGAF DAAERAYRAA IDLSPGFTPA
WLDLAMQLDS LNKVDALAAL SDAARTKGIG AAEGAGFIEA AALRRQGAHA EALAVARQVP
ATINPVRRNQ LIGELADRTG DTDLAFEAFS AMNAAARAQA HPEALGPAFI DEIVGNGERL
TPARIAAWSK VEIDPAPPAP IFLVGFPRSG TTLLDTLLMN IPSLHVLEEL PIVRTVQRAM
GDPDRLDSLT DAEANALRRT YFETLDGLAP PAPGQRIVDK FPLHMARMAM IHRLFPDAKV
IFVERHPCDC VLSGFMSSFE LNPAMLSFTS LEGAARLYDA AATAWTRAEA LLPIDVCRIR
YETMIEDLEG EMRRLLDFLG IAWDDAVLDN RGSAARREHI RTASYAQVTE PIYNRAIGRW
ERYRSRMADI LPTLAPWAER MGYRV