Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1986 |
Symbol | |
ID | 4027070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2239021 |
End bp | 2243268 |
Gene Length | 4248 bp |
Protein Length | 1415 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637967182 |
Product | sulfotransferase |
Protein accession | YP_574037 |
Protein GI | 92114109 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.634546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAGA AACGCCACTC ATCAAGCACT AACAAGCTCA CGCTGCCGCA AGCACGCAAG CAGGTTCAGC GTCACCCCAC GGATCCTGAT GCCTGGCTTA CGCTAGGTAA GTTGCAAGCT GGCCAAAAAG AATTTTATGA CGCCAAGGCC TCATTGGAAA AAGCACGAGA GCTGCGTCCT AACCATCATG AACAGGAAGA ATGGTTGGGC TATGTCGCAC ACAAACAGCA TCGTCTCCAG GATGCCCTCC AGCATCTCCA GGATGCCTTG GAGCTGGCGC CCAATTCAGC ATTTGGCTTG GCAACCCTCT CTTACCTTTA TCTCGACATG GGAAAGCCAT ACAAATCCAT TGAATACGCT AAAAAAGCTT GGCAATTTTC TCCGAAGAGC TTGCGTGTGC TCGACTCTCT TGCTAATAGT CTAAGTGCCT TATATCGTTA CAATGAAGCT CTCGACATTT ACGACCAACT CATCAAGCTA ACACCAAGCA GCTACATACC TTGGAATAGT GCTGGCAACA TGTATCGAGA GTTGGGGTTG CTGAACAAAG CATATCGCTG TTATCAAAAA GCTAGCGCGC TAGCGCCACA CAATGCCATT CCCTATTCCA ATCATCTGAC TGCGTTGCAC TACGATCCCC GTGCCAGTCG CACCGAGATT GCGGCCTTTG CCAAATCCTG GGAAAAGCGT TTCGCCCCCG AAAAGAGCGC ACTCTCCCCC CGGCCTGCAC GCGTAGATAA ATCTTACCAG CGCCACTTGA AAGTCGGCTT GCTCTCGGAT GGCCTTCGCA ATCACCCCGT GGGCAAGATG ATCGTCCGCT GCCTGGAAAA CATACCACCG AACCAGATGA CACTGGTGGC CTATAGCAGT AGTGAAATCG ATGACGCGCT GACACGGCGC ATCAAGAATC AAACTCATAC ATGGTACCCT ATTCGACACC TGAACGATGA CGACCTCGTA CAGCAAATTC GTGATGACGA AATTGATGTT CTTATTGACC TATCAGGCCA TAATGCCGGT ACTCGAATGC GCGCGATTGC CATGCAGCCG GCACCATTGC TCGTCAAATG GGTCGGCGGG TTGATCAATA CCACGGGTGT GCAGGCCATC GATTACTTGA TCAGCGATCA TGTCGAGACG CCTGGAGGCG AAGACGAGTA CTATACCGAG AAACTCATCC GCCTGCCGGA CGACTATATC GTCTTCGATC CGCCTGCAAA GTTGCCTGCA TTGCGCGAAC TGCCGGCCAA GCGAAACGGC TATATCACAC TAGCGTGCTT CAACAACCCC ACCAAGCTCA ATGATGTCAC GCTCAAGCAG TGGGCAGGCA TCATGCACGA GCTGCCTGAT TCCCGCTTGA TGCTCAAGGG ACGCCCCTAC ACCAGTGAGA GTTTCTGCGA ACGCCTATAT GCCACACTTG AAGCTGCGGG CATTGCTCGC GAGCGTTTGA TCATCGAGGG GCCGGGCAGC AATTACGAGA TGCTGGACGC CTACAACCGA GCGGATATCG CCCTCGATCC CTGGCCCTAC TCGGGTGGTT TGACGACCTG CGAGGCCTTC ATCATGGGGG TCCCCGTGGT GACTCTCCCC GGCCCGACCT TTGCCGGCCG CCATAGCGCG ACACACCTGG TACACGCCGG CATGCCGGAA CTGGTAGTCA ACAGTTGGGA CGAATATCGT GCACGCGTGA TCGAGCTTGC CAGCGATCTG GAAAGCCTGG GCACCATTCG GCAGCATCTG CGTGACGTGT TGTTGCAATC TCCCGTGTGT GACGGCCCAC GCTTCGCCAA ACACTTCACC GATGCCATGC GCGCCATCTG GCAACGTTAC TGCGACGACA AGGCGCCCGC CTCGCTTACC TTCAACAAGG AAGGCGAAGC TCGGTTCGAG GACGAGGATG TCCCGGTGGA AATCCATTAC GCCGAGGCCC CCGAGGATGA CTCGACGTTT CAATGGCAAT TCGACGGCAA GCTCATCGCC GTGGACAACG GCGGGCAACT GCTTGAAAGC GACGTCGTTC GCCAACTGCT GCAAAAGGAA GCGCTCGAGC TGATCGCCTT CGATCCCAGC AGCCAAGCTC CGGACACCTC GCTCAAGCAG CACAAGGGCG TTCATTACTA CCCCAATGCG ACGCTGGGAG ACGGCCAGCC AGGCCAGTTG CATGCCTGTC TCGACCCCAA GTTGAGCGCG AGCCTGGCAC CGCTAGACGA CGAGTATCAG CCTGAGGCCA TCCGCAAAGG CAGCCAGGTG CTCACCCGGC TGCCGCTCAA TACCATCGCG CTGGATAGCA TCCAGGGCCT ACCCGCCATC GACTGGCTGG TATTGGATGG CCTGAATGAC GCCTCTGCGA TTCTCGACAA TGGCACCCAA GCGCTGAAAG ACACCCTGCT GCTTCAGGTT AAGGTCGCGT TTCAGCCCAC CCACGAACGG CAACCCAACC TGGCGGAAAT TCAGCATTGG GCAAGTCGTA ACGGCTTCCG CTTATATCGG CTGCATGAAC CGCAGCACCG CAGCCACCTC CCCGAGGAGG TACCGGAGGC ACAGCGCCAG GCTACAGAGC TGACAAGCGC AGACGCCCTG TTACTGCCCA GTTACGCTCG AATGGAAGCA CTCTCCGATA ATCAGCGCAT GCGGCTCGCG TTTCTGCTGC ATTCAATATA TGGCATTAAA GACATTACAT ATAATCTACT GGAGAATTCA GAAGATCAGA AAAGCCTCTC CTACCTGCAT GCGGAAGGAC TAAAAAAAAT TTTCCCGACA ACAGAAGGAA AAAACGAAAA ATATAGTGCA AACGAAAGCC AGATACTAGA CAAAACTATT TTTGTTGTAG GGTGCGGACA TAGCGGAACT ACTCTGATGG CTTCTTTACT GGGAGCTCAC CCTGAGGTCC ACACAATACC AAGAGAGACT TACTGGTTCT TGAACAACCC AAACCTAAAT AACGAATATT ACCAAGAAAA GCGCAAATCT CGAAGGGAAG GAAAAAGCAT TGTCTGCGAA AAGACGCCAA GGCATATCTA CAAAATAAAA GAAATAAAAG AAAAATTCCC AAATGCTCAA ATAATTGCCA TGACTCGTGA CAGCAAAGAT GTTGTATCAT CTTTGAAAAA ACGCTCTGGG AATTTCGAGC ACAGCGTACA GCGTTGGATA AGCGACAACA AAGCACTTCT GGAATTCAAG AACGAAAGTT GGATAAAGCT AGTAAAATAT GAGAACCTTG TCACGAGGAA GGAATCAGTT GTACATGAAA TACTATCTTT CCTAGACTTG ACTTACACGG AAGAAGTATT TGATTTCCAC AAAAAAAACT ACAAATGGTT TGGTATAGAA GATGCCAAAA AAACGGATGG CCGGGGCGAG GAAAACCATA TCTCGCTACG TTCATGGCAG ATGACACAGC CTATTCATGA CAACCGTGGC GTGTGGAAGA AAGGGCTAAG TAAAAAAGAA GTTGCTATAG TAGATGCCAA ATGCAAAAAT CTAGAGAAAA CCCTAGATTT TACAAAAGAA AGCACCTCAG AGAAAGGGAA GATCAAGGTT TTTGGTGACT CCCACATACA ATCGATACAA AATATTAATA AAAACAGCAA TGAAGAGTCA CTTCTAGTAC ATGTTATTCA TGGCGCGACC ATACTGGGAC TGGGAAAAAG GAAGTCAACA TTAGAAACAA GAAAAAAAAT AACAAACTCC TTAAACCCTA ATGACTTTGT CACCCTTGGG TTTGGCCAAG TAGACCTAGA ATTAGGGTTC TACTATCGAA AGATAATAAA AAAAGAAAGA ATTTCTCCTA ATTACTTTTT CGACTTACTT ATAGAGAACT ATAAAAGATT CATAATTGAA ATAAAAAACA ACTGCGCAGG AGTTGTTATA AAGGGCGTCA ACCTTCCAGT GCTGAAGGAA CATGACGCGG CCGTCAGCTA TGTATCCAGA ATAATAACAG AAAATATAAG CTGCGACACC ACAAAACACT CGCTATTAGC AGAGCTAAGA TCGAAGTACG ATTCTTATGA AAGTAGAGCA ATTTTCGCAC TGGAGTTAAA CAAAAAAATA AAAGGTCTTG CAAAAGACAC ACAATGCGAA TACTTCGACA TAAATAGCTA CATATCCAAA GAAAGCACAA ATGAGGTTGA CAACAAGTAT ATACCAGAAA ATATTGACCA TCATATCTTA GTTTCAGAAG ACCTTCGCAG CTATACAGTC CAGCTCATCG AGGAAAAAGT AAGGCAATTA ATAGCGAAGA AAAAATAA
|
Protein sequence | MSKKRHSSST NKLTLPQARK QVQRHPTDPD AWLTLGKLQA GQKEFYDAKA SLEKARELRP NHHEQEEWLG YVAHKQHRLQ DALQHLQDAL ELAPNSAFGL ATLSYLYLDM GKPYKSIEYA KKAWQFSPKS LRVLDSLANS LSALYRYNEA LDIYDQLIKL TPSSYIPWNS AGNMYRELGL LNKAYRCYQK ASALAPHNAI PYSNHLTALH YDPRASRTEI AAFAKSWEKR FAPEKSALSP RPARVDKSYQ RHLKVGLLSD GLRNHPVGKM IVRCLENIPP NQMTLVAYSS SEIDDALTRR IKNQTHTWYP IRHLNDDDLV QQIRDDEIDV LIDLSGHNAG TRMRAIAMQP APLLVKWVGG LINTTGVQAI DYLISDHVET PGGEDEYYTE KLIRLPDDYI VFDPPAKLPA LRELPAKRNG YITLACFNNP TKLNDVTLKQ WAGIMHELPD SRLMLKGRPY TSESFCERLY ATLEAAGIAR ERLIIEGPGS NYEMLDAYNR ADIALDPWPY SGGLTTCEAF IMGVPVVTLP GPTFAGRHSA THLVHAGMPE LVVNSWDEYR ARVIELASDL ESLGTIRQHL RDVLLQSPVC DGPRFAKHFT DAMRAIWQRY CDDKAPASLT FNKEGEARFE DEDVPVEIHY AEAPEDDSTF QWQFDGKLIA VDNGGQLLES DVVRQLLQKE ALELIAFDPS SQAPDTSLKQ HKGVHYYPNA TLGDGQPGQL HACLDPKLSA SLAPLDDEYQ PEAIRKGSQV LTRLPLNTIA LDSIQGLPAI DWLVLDGLND ASAILDNGTQ ALKDTLLLQV KVAFQPTHER QPNLAEIQHW ASRNGFRLYR LHEPQHRSHL PEEVPEAQRQ ATELTSADAL LLPSYARMEA LSDNQRMRLA FLLHSIYGIK DITYNLLENS EDQKSLSYLH AEGLKKIFPT TEGKNEKYSA NESQILDKTI FVVGCGHSGT TLMASLLGAH PEVHTIPRET YWFLNNPNLN NEYYQEKRKS RREGKSIVCE KTPRHIYKIK EIKEKFPNAQ IIAMTRDSKD VVSSLKKRSG NFEHSVQRWI SDNKALLEFK NESWIKLVKY ENLVTRKESV VHEILSFLDL TYTEEVFDFH KKNYKWFGIE DAKKTDGRGE ENHISLRSWQ MTQPIHDNRG VWKKGLSKKE VAIVDAKCKN LEKTLDFTKE STSEKGKIKV FGDSHIQSIQ NINKNSNEES LLVHVIHGAT ILGLGKRKST LETRKKITNS LNPNDFVTLG FGQVDLELGF YYRKIIKKER ISPNYFFDLL IENYKRFIIE IKNNCAGVVI KGVNLPVLKE HDAAVSYVSR IITENISCDT TKHSLLAELR SKYDSYESRA IFALELNKKI KGLAKDTQCE YFDINSYISK ESTNEVDNKY IPENIDHHIL VSEDLRSYTV QLIEEKVRQL IAKKK
|
| |