Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3141 |
Symbol | |
ID | 5199550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 3453473 |
End bp | 3455350 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640582688 |
Product | sulfotransferase |
Protein accession | YP_001263627 |
Protein GI | 148556045 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.423755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGC CGCCGCCGCA TCTTTCGCCG ATCTTCGCCC GCATCAAGAT CGGCGACCTG CCCGGCGCCC GCGCCGCGGC CGAGCAGATA TTGCGCTCGC TGCCCGCCGA TCAGCCGCTG CTGGCGCTCG CCGGCATGCT CGCCTGCCGG ACCGGCGACC TGCCCGGCGG CATCGCCCGG CTGCGCGAGG CGCTGAAGCT CGCCCCCGGC GACCAGTCGA CCCGCAACAA CCTGGTCCGC GCCCTGATCG AGACCGGCGC GCTCGACGAG GCGGCCGAGG CCTGCGCAGG GGGCGAGCGC GATCCGAAGA TGCTGCGCCT GTCCGCCTAT ATCGAGCATC AGCGCGGCCA TCTCGACCGC GCGGTCGCCG ACTATGAGGC GGTGGTCGCG GCGCTGCCCG ATGATTTCGA GAGCTGGAAC AACCTCGGCA ACCTCTACGC CGCGACCGGC CATGGCGAGC GGGCCGAGGC GCCGCTGCGC ACGGCGATCG CGCTCCGCCC CGACATCGCG CTGCCCTATC TCAACCTCGC CAAGCTGCTC GCCGGGCTGC AGCGGCCCGA GGACGGCGCC GAACTGCTGC GCGCCGCCGC CGCCGCGATC CCCGACAATG CCGAGATCCG CGCCGAACTG GGCCTGGCCG AAGCCGCGCT CGGCGCTTTC GACGCGGCCG AGCGGGCCTA TCGCGCCGCG ATCGACCTCT CGCCGGGCTT CACCCCCGCC TGGCTCGACC TCGCCATGCA GCTCGACAGC CTCAACAAGG TCGACGCGCT GGCGGCGCTG TCCGACGCGG CCCGGACGAA GGGCATCGGC GCGGCCGAGG GAGCCGGCTT CATCGAGGCC GCCGCGCTGC GCCGCCAGGG CGCCCATGCC GAGGCGCTCG CCGTCGCCCG CCAGGTTCCC GCGACGATCA ATCCGGTCCG CCGCAACCAG CTGATCGGCG AGCTCGCCGA CCGGACCGGC GACACCGACC TCGCCTTCGA GGCCTTTTCG GCGATGAACG CCGCCGCCCG CGCCCAGGCG CATCCCGAGG CGCTCGGCCC CGCCTTCATC GACGAGATCG TCGGCAATGG CGAGCGGCTG ACGCCGGCCC GGATCGCCGC GTGGAGCAAG GTGGAGATCG ATCCCGCCCC ACCCGCGCCG ATCTTCCTGG TCGGCTTCCC GCGCTCGGGG ACGACGCTGC TCGACACGCT GCTGATGAAC ATCCCCTCGC TCCATGTGCT CGAGGAACTG CCCATCGTCC GCACGGTGCA GCGCGCGATG GGCGATCCCG ATCGGCTGGA CAGCCTGACC GATGCCGAGG CCAATGCGCT GCGCCGCACC TATTTCGAGA CGCTCGACGG CCTCGCGCCG CCCGCGCCGG GGCAGCGGAT CGTCGACAAA TTCCCGCTGC ATATGGCGCG GATGGCCATG ATCCACCGGC TGTTCCCCGA CGCCAAGGTG ATCTTCGTCG AACGGCATCC GTGCGATTGC GTGCTGAGCG GGTTCATGTC GAGCTTCGAG CTCAACCCCG CGATGCTGAG CTTCACCTCG CTGGAGGGCG CCGCGAGGCT CTACGACGCG GCCGCGACCG CCTGGACGCG GGCCGAGGCG CTGTTGCCGA TCGACGTCTG CCGGATTCGC TACGAGACGA TGATCGAGGA TCTGGAAGGG GAGATGCGCC GGCTGCTCGA CTTCCTCGGC ATCGCCTGGG ACGACGCCGT GCTCGACAAT CGCGGCAGCG CGGCCCGGCG CGAGCATATC CGCACCGCCA GCTACGCGCA GGTCACAGAG CCGATCTACA ACCGCGCGAT CGGCCGCTGG GAACGCTATC GGTCGCGGAT GGCCGACATC CTGCCGACCC TCGCCCCGTG GGCCGAGAGG ATGGGCTATC GGGTCTGA
|
Protein sequence | MTLPPPHLSP IFARIKIGDL PGARAAAEQI LRSLPADQPL LALAGMLACR TGDLPGGIAR LREALKLAPG DQSTRNNLVR ALIETGALDE AAEACAGGER DPKMLRLSAY IEHQRGHLDR AVADYEAVVA ALPDDFESWN NLGNLYAATG HGERAEAPLR TAIALRPDIA LPYLNLAKLL AGLQRPEDGA ELLRAAAAAI PDNAEIRAEL GLAEAALGAF DAAERAYRAA IDLSPGFTPA WLDLAMQLDS LNKVDALAAL SDAARTKGIG AAEGAGFIEA AALRRQGAHA EALAVARQVP ATINPVRRNQ LIGELADRTG DTDLAFEAFS AMNAAARAQA HPEALGPAFI DEIVGNGERL TPARIAAWSK VEIDPAPPAP IFLVGFPRSG TTLLDTLLMN IPSLHVLEEL PIVRTVQRAM GDPDRLDSLT DAEANALRRT YFETLDGLAP PAPGQRIVDK FPLHMARMAM IHRLFPDAKV IFVERHPCDC VLSGFMSSFE LNPAMLSFTS LEGAARLYDA AATAWTRAEA LLPIDVCRIR YETMIEDLEG EMRRLLDFLG IAWDDAVLDN RGSAARREHI RTASYAQVTE PIYNRAIGRW ERYRSRMADI LPTLAPWAER MGYRV
|
| |