Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_2109 |
Symbol | |
ID | 5078314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 2418988 |
End bp | 2421117 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640499271 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001183629 |
Protein GI | 146293205 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTAT CAAAGCCTAC GGCAGAAGTT ATCACAGCCC CACAAATCGG TGACAATATC AGCCGTCAAA CTCTCTTTTG GCTGCTACTG ACTAACCTTG CCGTATTAAG CCCCTTATTT GATAAAGCGA CCCCTTGGAC ACTTGGGATC TGCGCAATTT GTCTACTTTG GCGCATAGGC ATTTATATAG GCAAAGTGGC GAAACCGCCG CGCTTTTTAG TGACAAGTTT GGCCATTGGT GCTGCCATCA CCTTAGCACT GGTCTCAGGC CAAATGGGAA TTCTCAATGC CTTAATCAAT CTGCTTATTT TAGGTTATGC CTTAAAATAC ATTGAGATGC GTAATCAAAG GGATGTCCGA GCTGTAGTGG TTGTTGGCTA TTTTCTAATC GCGCTAACGT TTATCGATCA CCAATCGATG CTAAGCACTG CGCATTTACT CGCAGTGACA GTGGTGAATA CCTGCGTACT TGTGACGTTA TATCAAAGTC AATACACATG GCGCCATACA TTGTGGTTTG GGGGCAAGTT GCTGTTACAG AGTGTACCGC TCGCGTTATT GCTCTTTTTA GTCTTGCCCC GCTTTACACC GCTCTGGCTC GTCCCCAATA TGAAAGAGGC CAAAACAGGC TTATCCGATA CCCTTGCCAT TGGTGATATT AATAAACTGA CCCGCTCCAC CGAGCTTGCC TTTAGAGCAA GTTTTACAGA ACACACGCCA CATCATGCTG ACCTTTATTG GCGGGCATTG GTGATGGAAG ATTACGACGG AGTGACATGG CGACAAGAGA AAGGGATCAA AAAACTGCAA CAGGATGCCT TGATTTCCCC CCCCTCTAGA CCAAGCCCTG CCCTACAGAT CTCCACGTTA AACATGAAGG ACAATCAAGC AAAAGTTCGC CGTCAACAGG TGCAGGTTTA TCGATATCAG ATCATTGCCG AGCCAAGTCA TCAGCCATGG CTTTTTGGTC TTGATGTGGC CTACAGCGAA GATAAAGCGG TAGTCAATCT ACCCGATTAT CGCATTGTAG CCCTACGTAA TCTCGATCAG CGAATGAGCT ATCAGCTTGA TTCTTGGCCC CATGCCAAAA TGGATCTTGT ACTCTCTTCT CGGCAAAGGG ATATCAATCT GGCGCTACCA GATAAAAGTA ATCCACGTAC CCTGCAACTT GCGCAGCAAT TTAAGCAAAC CTACCCCGAT GCCAAACAAC GTCTAGGGGC CATAATGCAA TATTTCAGCA CTGAGCCCTT TTTCTACACG CTTACACCAC CTACGCTTGG TCCACAGCAA GTGGATGATT TTCTGTTTGA AAATAAAGCG GGTTTTTGCG TGCATTATGC AACTGCATTG ATTTTTATGG CGCGGGCGAC GGGCCTCCCC GCGCGTATGG TCACGGGCTA TCAAGGCGGT GAATATAATC CCAAAGCGGG ATATCTGAGC GTATATCAAT ATATGGCACA CGCGTGGGCC GAAGTTTGGC TTGAAAACGA AGGTTGGGTA CGTTTCGACC CCACAGCTAT GGTGGCGCCT AATCGTATTG AACTCGGTTT TGATGCCGAA TTTACACCAG AGGACAGTTA CCTAAAAGAA AGCCCTTTTA GTAGCTTACG CTTTAAATCC ACACCATGGC TCAATGAGCT AAGACAACGC TTTGCCAGCA TAGATTACTA CTGGAGTGTG TGGGTATTAG GCTTTAATCA AGAAAGACAA AATCAGGTGC TCAGTCATAT TTTAGGGGAT GTCACTAAAA CCAAAGTAGC GGTATTTATG GGGTTGTGCC TAAGTTTGAT TGGACTTTAT ATCGCCTACA GCGTGGGATT ATTTCAGCGT AAGCAGACAA ATGATCCGAT AAGTTCACGC TATCAAAAAA TTTGTTTACG CCTTGCAGCA CATGGGATAA CGCGTCAAGA AGGTCAAGGC CCGAATGATT TTGCCGCGCA AGTGATGAAA GTTTATCAAC ATAAAGCACC TAACTTTTGC ACCCTATTTA ATGCATTAAC CCAAAGCTAT GTCGCGCTTA AATATCAAAA TTTATCCCCC AAAGTATATC AACAACAGTT AAGGCAATTT AACAACACGG CAAAGGCATT ATATTGGCGA TTAATACGGC CGAATAACCC ATTAAAATAG
|
Protein sequence | MSLSKPTAEV ITAPQIGDNI SRQTLFWLLL TNLAVLSPLF DKATPWTLGI CAICLLWRIG IYIGKVAKPP RFLVTSLAIG AAITLALVSG QMGILNALIN LLILGYALKY IEMRNQRDVR AVVVVGYFLI ALTFIDHQSM LSTAHLLAVT VVNTCVLVTL YQSQYTWRHT LWFGGKLLLQ SVPLALLLFL VLPRFTPLWL VPNMKEAKTG LSDTLAIGDI NKLTRSTELA FRASFTEHTP HHADLYWRAL VMEDYDGVTW RQEKGIKKLQ QDALISPPSR PSPALQISTL NMKDNQAKVR RQQVQVYRYQ IIAEPSHQPW LFGLDVAYSE DKAVVNLPDY RIVALRNLDQ RMSYQLDSWP HAKMDLVLSS RQRDINLALP DKSNPRTLQL AQQFKQTYPD AKQRLGAIMQ YFSTEPFFYT LTPPTLGPQQ VDDFLFENKA GFCVHYATAL IFMARATGLP ARMVTGYQGG EYNPKAGYLS VYQYMAHAWA EVWLENEGWV RFDPTAMVAP NRIELGFDAE FTPEDSYLKE SPFSSLRFKS TPWLNELRQR FASIDYYWSV WVLGFNQERQ NQVLSHILGD VTKTKVAVFM GLCLSLIGLY IAYSVGLFQR KQTNDPISSR YQKICLRLAA HGITRQEGQG PNDFAAQVMK VYQHKAPNFC TLFNALTQSY VALKYQNLSP KVYQQQLRQF NNTAKALYWR LIRPNNPLK
|
| |