Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2032 |
Symbol | |
ID | 3784582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2327271 |
End bp | 2328368 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637812121 |
Product | thiosulphate-binding protein |
Protein accession | YP_412719 |
Protein GI | 82703153 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCCG ATAGTTCCCG TTTTGCTGTA GGTATATTTA TAGTGTTGGT GAGTCTCGGC GGCGCCTACG CTACTCTGAA TAATTTTTCC GATCGTCCTG CCGGCTCCGC TGAGCGGTTA CACGAGGAAT TGAACGTCGG TTTTGCCTCT CACTGGAGAG CTCGGACGGG CGTGAACATC AAGGTCGATC AGGCCCGGAG CAGGTCGGGG AAGCCCGTAC ATATTACGCT CGATGGGCTT GATATTCCCG CCCTTGCGCT GTCCTACGAT GTGGACAAGC TGCATGATAA GGAAAGATTT ATTGCACCTG ACTTCCGTCA GCTTTTAGCG CAGGATTTTC GGACTGGCTC TTATCCTTCC CCCTATACCT CGACCATCGT ATTCCTGGTA AGGAAAGGCA ATCCAAAGAA ACTCAAGGAC TGGGGCGATC TGGTACGCTC GGATATAAAA GTAGTTACAC CCAATCCAAG GCATTCTGAA AGCGGCCGCT GGAATTATCT TGCGGCGTGG GGATACGCCG TGAGGCGATC AGGCGGCAGC GAACAAGCTG CGCGTGAATT TGTCAGTCAG TTGTTTGCCA ATGTCCAGAC AGTGGATTAC GAGGGTAAAA AGCCGGGAAA CTTGGGTGCC GCCTTTGTCT TTCGCAACAT CGGCGATGTA CTCCTCACCT GGGAAAATGA AGCGTACCTG ATCGTTCAGA ACAGTGGAGC CGATAAGTTT GAAGTCATCA CCCCGTCCAT ATCAATAGTG GCTGAACCCG CTATAAGTGT GGTGGACGCA GCTGCACGTG GGAAAAGCAC ACGCCGTGTA GCGGCTTCAT ACATCGAATA CTTATATACA CCCCAGGCGC AGCATATCGC CGCCAAGCAC TATTACCGTC CCCGCGATCC GGCCATTACC ACGAAGTACG TGGACAGGTT TCCGCGCCTT GAGTTGTTTA CAGTTGACGA GGTTTCCAGT GGCTGGCAGA AAGCCCAGAA AATACATTTT GCCAGGGGCG GTGTTTTTGA CCAGATCACC GGTGATGTTC CGAATTCTGT CGCCGTGAGG GGCGCTATCG ATAGGGACCA TATTCAAGCC GGCAATGCTA AAGGCTGA
|
Protein sequence | MTPDSSRFAV GIFIVLVSLG GAYATLNNFS DRPAGSAERL HEELNVGFAS HWRARTGVNI KVDQARSRSG KPVHITLDGL DIPALALSYD VDKLHDKERF IAPDFRQLLA QDFRTGSYPS PYTSTIVFLV RKGNPKKLKD WGDLVRSDIK VVTPNPRHSE SGRWNYLAAW GYAVRRSGGS EQAAREFVSQ LFANVQTVDY EGKKPGNLGA AFVFRNIGDV LLTWENEAYL IVQNSGADKF EVITPSISIV AEPAISVVDA AARGKSTRRV AASYIEYLYT PQAQHIAAKH YYRPRDPAIT TKYVDRFPRL ELFTVDEVSS GWQKAQKIHF ARGGVFDQIT GDVPNSVAVR GAIDRDHIQA GNAKG
|
| |