Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2413 |
Symbol | |
ID | 3785505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2749415 |
End bp | 2750524 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812502 |
Product | transglutaminase-like |
Protein accession | YP_413094 |
Protein GI | 82703528 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.329453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA GGGATTTTAT CAAACTGGCG GGGGTAAGCG CCGCATTATT TCCCGTGGCG CCCTCAGTAT TCGGCCAGCA GTCTGGTCAA CCGGTTATTC CGCCGCGACG CTACACTTAT CGTGTAACTT ATAACATCGA TCTTCCGGGG GATGGAAAAA AAGCACGTTT GTGGCTACCG CTGCCGGATA CGGAGGATTC TCCCCACCAG TTTTCCCAGG GAAGTGTCTG GAGCGGGACC GCAAGCACGG CCAGATTCGA GAACGTTCCT GGAACAACCT CACCCATGTT TTACGCTGAA TGGAACCGTA GCGGCCCCCG CAGCGTAACG GTGAGCAGCG TGATCAAAAC ATCGGACCGC GCTGTCAACC TGGAGCGCTA CAGGGAGGGT AATTCGAGCA CCCTTCCCGC GGATGTGAAA CGTTATCTGC AGCCCACCAA ATTTATCCCG TTGGATGGCA TCGTCCGCAA AACTGCGCTA TCCATTACCA AGGAGGCCAA AGCCCAATCG CAGCTGCAGA AAGCGCGCGC GATATATGAT TGGGTAGTCG AAAATTCCTA TCGCGACCCG TCCACGCGCG GCTGTGGACG GGGTAATATC AAAGCCATGC TGGAAACCGG TCATCTTGGC GGTAAATGCG CCGATCTGAA CGCATTGTTT GTAGGACTGG CGCGCGCGGC GGGCATTCCG GCCAGGGACA ACTATGGAAT CCGCATTGAC GAGTCTGCGG CGCATAAAAC GCTTGGCCAA GCCGACGATA TCACCACTGC CCAGCACTGC CGCCCTGAAT TTTACCTGAC CGGCCTTGGC TGGGTCCCTG TTGATCCCGC GGATGTGCGG CAACTGGCAC TGGATGAAGA ACTTCCCATT GAGCACCCAC GAGTGATCGA GCTACGTGAA AAACTTTTTG GTTCATGGGA AATGAACTGG GTGGCATTCA ACCACGGCAG GGATATCAGG CTGGCGCGAG ACAGCGTCCT GGGTGAACTG CCATTTTTCA TGTACCCTCA GGCCGAAGTA GCGGGACACG AGCGGGACAG CCTTGAACCG GCGGAGTTTG CTTACAAGAT AACCTCAGCC CGGCTGGTGG GTACGGGGAT CAAGTTTTAG
|
Protein sequence | MKRRDFIKLA GVSAALFPVA PSVFGQQSGQ PVIPPRRYTY RVTYNIDLPG DGKKARLWLP LPDTEDSPHQ FSQGSVWSGT ASTARFENVP GTTSPMFYAE WNRSGPRSVT VSSVIKTSDR AVNLERYREG NSSTLPADVK RYLQPTKFIP LDGIVRKTAL SITKEAKAQS QLQKARAIYD WVVENSYRDP STRGCGRGNI KAMLETGHLG GKCADLNALF VGLARAAGIP ARDNYGIRID ESAAHKTLGQ ADDITTAQHC RPEFYLTGLG WVPVDPADVR QLALDEELPI EHPRVIELRE KLFGSWEMNW VAFNHGRDIR LARDSVLGEL PFFMYPQAEV AGHERDSLEP AEFAYKITSA RLVGTGIKF
|
| |