Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4758 |
Symbol | |
ID | 5197121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 5234759 |
End bp | 5238136 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640584314 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001265233 |
Protein GI | 148557651 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.446948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0794526 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAG CTCCAACTCT TCATTGCACC GCACAACGGG ATGTGCTGAT CCTTCCATCC ATGATTCAGG CGGCGCTGCA CCATCAGACG GTCTACCGCT ACGACCACCC GGTCCCGCTG GGCCCCCAGC TCATCCGCCT GCGGCCGGCG CCGCACAGCC GGACGGCGGT GAACAATTAT TCGCTGCGCA TCGCCCCGGA AAACCACTTC ATCAACTGGC AGCAGGACCC GCACGGCAAC TGGCTGGCGC GGCTGGTGTT TCCCGAACGG ACCGACGAAT TCTCGATCAC CGTCGACCTG ATCGCCGACC TGGTGGTGGT CAATCCGTTC GACTTCTTCG TCGAGGATTA TGCCGAGCAG CGGCCCTTCG CCTATGCGCC CGCGATCGCC GCCGACCTCG CCGCCTATTT CGAGATCGAG CCGCAAGGGC CGCTGTTCGA CGGATTCGTC GCGCCCTTCG TCGATCTCCG CGCCCGCACG ATCGACTTCC TCGTCGACCT CAACGTCGCG ATCCATCGGC GGGTCAACTA TGTCATCCGC ATGGAGCCGG GCGTGCAGAC GCCCGAGGAG ACGCTGGCGG CCGGCACCGG AAGCTGCCGC GATTCGGCCT GGCTGCTGGT CCAGGTCGCC CGGCGCCTCG GCTTCGCCGC GCGTTTCGTC TCTGGCTATT CGATCCAGCT CACCCCCGAC ATCGTCCCGA TCGACGGACC CAAGGGCGTC GCCGCCGACG TCTGCGACCT CCATGCCTGG GCGGAGGTCT ATGTGCCCGG CGCCGGCTGG ATCGGAATGG ACGCGACCTC GGGCATGTTC GCGGGGGAGG GGCACATCCC GCTCGCCGCG ACTCCGCACC ACCGGTCGGC GACCCCGATC GAGGGCGCGC TGCTCGAACC CGCGCATGTC GATTTCCATT TCGACATGGA CGTCCAGCGC ATCGCCGAGG CGGTGCGCAT CACCAAGCCG TTCACCGACG CCCGCTGGGA GGCGCTCGAC GCGCTGGGCG AGGCGGTCGA CGCCGATCTG GTCGCGCAGG ACGTGCGGCT GACGACGGGC GGGGAGCCGA CCTTCGTCGC CCTCGACGAT CCGGAGGCGC CCGAGTGGAA CGGCGACGCG GTCGGCCCGA CCAAGGCCGG CTATGCCGAC CGGCTGATCC GCAGGCTGCG CGAGCGCTTC GCGCCGGGCG GCCTGCTCCA CCATGGCCAG GGCAAATGGT ATCCGGGGGA GAGCCTGCCG CGCTGGGGCT ATTCGGTCTA TTGGCGGACC GACGGCGTGC CGGTGTGGAA CGACGCCGCG CTGATCGCGG GCGAGGGGAC CGATGGGGCG GGTGCGACGG TCGGGGCCGA GCAGGCCGAG GCCTTCCTGT CCGCGATGGC CGAGGGGCTG GGCGTCGGCG CGGCGATGGT CGCGCCGGCC TATGAGGACC CGGTCGACTG GTCGGTGAAG GAGGCGCAGC TTCCCGCCAA TGTCGACCCG AGCGATCCGA AGATCGACGA TCCCGAGGCG CGCGCCCGCA TGGTCAAGGC GTTCGAGCGG GGGCTGGGCA GCCCGGTCGG CTATGTCCTG CCGATCCAGC GCTGGAATGC GCGGCAGGCG GCGACGCCGG GGCGCTGGCG GTCGGAACGC TGGAGCCTGC GGCGCGGCAA GCTGTTCGCG GTGGCCGGCG ACAGCGCGCT CGGCTATCGC CTGCCGCTCG GATCGCTGCC GCATGTGCCG GCCGCCGACT ATCCCTATCT CCACCCGCGC GACACCACCG AGCCGCGCGA GCCGATGCCC GACTTCTACC GCCAGCGGAT CGAGGCGGTC GCGCCCGCCG CCGGCGCCGC CGCGCAGGAG CGGCAGGAGC AGGTGTTGAT CGAGGGCGCC GTCCGCACCG CGATCACGGT CGAGGCGAAG GACGGTCATG TCGCGGTGTT CCTGCCCCCG ACCGAGCGGC TCGAAGACTA TCTGGAGCTG ATCGCCGAGA TCGAGGCGGC CGCCGCGCGC ACCCGCATCC CGGTGCGGGT CGAGGGCTAT GCGCCGCCGC CCGATCCGCG CCTCAACCTG CTGAAGGTGA CGCCCGACCC CGGCGTGATC GAGGTCAACG TCCAGCCGGC GGCGAGCTGG CGCGAGACGG TCGAGATCAC CACCGGCCTC TACGATCTGG CGCGCGAGAC CGGGCTGACC GCCGACAAGT TCATGGTCGA CGGCCGCCCG ATCGGCACCG GCGGCGGCAA CCATATCGTG CTCGGTGGCC GCTCGGTGAA CGATTCGCCC TTCATCCGCC GCCCCGACCT GCTCAAGAGC TTCCTGCTCT ACTGGCAGCG GCATCCCTCC CTCAGCTATC TCTTCTCGGG GCTGTTCATC GGCCCGACCA GCCAGGCGCC GCGCATCGAC GAGGCGCGCC ACGACGGCCT CTACGAACTG GAGATCGCGC TGGCGCAGGT GCCCGCGCCG GGCGAGGGCG AGGCGCCGCC GCCCTGGGTG GTCGACCGGC TGTTCCGCAA CCTGCTGGTC GACGTGACGG GCAACACCCA CCGCACCGAG ATCTGCATCG ACAAGATGTT CTCGCCCGAC GGGCCGACCG GCCGGCTCGG CCTGCTCGAG TTCCGCGGCT TCGAGATGCC GCCCGAGCCG AAGATGAGCC TGGCCCAGCA ATTGCTGCTG CGCGCGCTCA CCGCCTGGTT CTGGCGCGAG CCGCAGAAGG GCGGGCTGGT CCGCTGGGGC ACCGCGCTGC ACGACCGCTT CCTGTTGCCG CATTTCGTCT GGGCCGATTT CCTGGAGGTG CTCGGCGACC TGCGCCATGC CGGCTACGGC TTCGATCCCG CCTGGTTCGA GGCGCAGCGC CAGTTCCGCT TCCCGGTCCA CGGCACCGTC TCGGCGGGCG GGGTGACGCT GGAGGTCGCG CATGCGCTGG AGCCGTGGCA CGTGCTGGGG GAGACCGGCG TGATCGGCGG CACCGTGCGC TATGTCGACA GCTCGACCGA GCGGCTCCAG CTGCGCGCCA CCGGGCTGGT GCCGGGCCGC CACGTCGTCG CGGTCAACGG CCGCGCGGTG CCGATGACGC CGACCGGGGT GCCCGGCGAG GCGGTCGGCG GGGTGCGCTA CAAGGCGTGG AAGCCGGCCA ACTGCCTCCA CCCGCTGCTC GACGCCGACG CGCCGCTGAC CATCGACGTG CTCGACAAGT GGAACGAACG TTCGCTAGGC GGGTGCGTCT ATCATGTCGC GCATCCGGGA GGCCGCAACT ATGACACCGT GCCGGTCAAC GACCTGGAGG CCGAGGCGCG GCGGCGGGCG CGGTTCCAGG ACCATGGCCA TACGCCGGGC CCGGTGACGA TCCCGATGCC GGAACGGGCG AGCGAGTTCC CGATGACGCT CGACCTGCGC GTCCCGCCGG GCTGGTAG
|
Protein sequence | MTKAPTLHCT AQRDVLILPS MIQAALHHQT VYRYDHPVPL GPQLIRLRPA PHSRTAVNNY SLRIAPENHF INWQQDPHGN WLARLVFPER TDEFSITVDL IADLVVVNPF DFFVEDYAEQ RPFAYAPAIA ADLAAYFEIE PQGPLFDGFV APFVDLRART IDFLVDLNVA IHRRVNYVIR MEPGVQTPEE TLAAGTGSCR DSAWLLVQVA RRLGFAARFV SGYSIQLTPD IVPIDGPKGV AADVCDLHAW AEVYVPGAGW IGMDATSGMF AGEGHIPLAA TPHHRSATPI EGALLEPAHV DFHFDMDVQR IAEAVRITKP FTDARWEALD ALGEAVDADL VAQDVRLTTG GEPTFVALDD PEAPEWNGDA VGPTKAGYAD RLIRRLRERF APGGLLHHGQ GKWYPGESLP RWGYSVYWRT DGVPVWNDAA LIAGEGTDGA GATVGAEQAE AFLSAMAEGL GVGAAMVAPA YEDPVDWSVK EAQLPANVDP SDPKIDDPEA RARMVKAFER GLGSPVGYVL PIQRWNARQA ATPGRWRSER WSLRRGKLFA VAGDSALGYR LPLGSLPHVP AADYPYLHPR DTTEPREPMP DFYRQRIEAV APAAGAAAQE RQEQVLIEGA VRTAITVEAK DGHVAVFLPP TERLEDYLEL IAEIEAAAAR TRIPVRVEGY APPPDPRLNL LKVTPDPGVI EVNVQPAASW RETVEITTGL YDLARETGLT ADKFMVDGRP IGTGGGNHIV LGGRSVNDSP FIRRPDLLKS FLLYWQRHPS LSYLFSGLFI GPTSQAPRID EARHDGLYEL EIALAQVPAP GEGEAPPPWV VDRLFRNLLV DVTGNTHRTE ICIDKMFSPD GPTGRLGLLE FRGFEMPPEP KMSLAQQLLL RALTAWFWRE PQKGGLVRWG TALHDRFLLP HFVWADFLEV LGDLRHAGYG FDPAWFEAQR QFRFPVHGTV SAGGVTLEVA HALEPWHVLG ETGVIGGTVR YVDSSTERLQ LRATGLVPGR HVVAVNGRAV PMTPTGVPGE AVGGVRYKAW KPANCLHPLL DADAPLTIDV LDKWNERSLG GCVYHVAHPG GRNYDTVPVN DLEAEARRRA RFQDHGHTPG PVTIPMPERA SEFPMTLDLR VPPGW
|
| |