Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0507 |
Symbol | |
ID | 3785636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 572128 |
End bp | 573120 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810589 |
Product | thiosulphate-binding protein |
Protein accession | YP_411207 |
Protein GI | 82701641 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.347365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAT GGATAACGAC GAGCCTTACC GCAATCCTCC TGTGGGGAGG CAATGCCTGG GGAGATAAAA CAATATTGAA TGTATCCTAT GACCCGACTC GCGAGCTGTA TCAGCAGTTC AATTCGGCAT TCGCGAAGCA TTGGGAGAGC AAGACCGGAG AAAAAATAGC CATAAAACAA TCTCATGGCG GCTCCGGGAA ACAGGCACGC TCTGTCATCG ACGGGCTTGA TGCTGATGTC GTGACGCTCG CGCTGGCGAA GGACATCGAT GAAATCGCGG AAAAAGCCAG CCTGCTGCCG GCGGACTGGC AAAAGCGCTT GCCGCATAAC AGCACGCCTT ACACTTCAAC CATCGTACTG CTGGTCCGGA AGGGCAACCC CAAGAACATC AAAGACTGGG ACGACCTGGT CAAGCCCGGC GTCGCCGTAA TCACCCCCAA TCCGAAAACC TCGGGGGGCG CCCGCTGGAA TTACCTCGCC GCGTGGGAAT ACGGCAAGCG AACGTACGGC GACGATGCCA AGGCGAAGGA GTTCGTAGCA AAGCTCTACC GGAATGTCCC TGTGCTGGAC TCGGGCGCGC GCGGCTCAAC CACGACATTT GTGGAGCGCG GCATCGGCGA TGTCTTCATT TCGTGGGAAA ACGAAGCATT CCTCGCTATC AGGGAACTCG GGCCCGACAA ATTTGAAATT GTGGCACCCT CCCTCAGCAT TCTTGCCGAA CCATCGGTCG CGGTGGTAGA CAAGGTGGCC GACAAGAAGG GTACCCGTAC CATCGCGGAA GCCTATCTCC AATATCTCTA TTCAGACACA GGCCAGGAGA TTGCCGCTAA GAATTTCTAC CGTCCAACCA ATGCCGCAAT CGCAGCGAAA TATGCAAGTC AGTTTCCCGG CCTCAAGCTG TTCAAGATCG ACGATGCGTT TGGGGGCTGG AAGAACGCGC ACAAGCTTCA CTTTGCCGAT GGCGGCACTT TTGATCAGAT TTATCAGAAA TAA
|
Protein sequence | MKAWITTSLT AILLWGGNAW GDKTILNVSY DPTRELYQQF NSAFAKHWES KTGEKIAIKQ SHGGSGKQAR SVIDGLDADV VTLALAKDID EIAEKASLLP ADWQKRLPHN STPYTSTIVL LVRKGNPKNI KDWDDLVKPG VAVITPNPKT SGGARWNYLA AWEYGKRTYG DDAKAKEFVA KLYRNVPVLD SGARGSTTTF VERGIGDVFI SWENEAFLAI RELGPDKFEI VAPSLSILAE PSVAVVDKVA DKKGTRTIAE AYLQYLYSDT GQEIAAKNFY RPTNAAIAAK YASQFPGLKL FKIDDAFGGW KNAHKLHFAD GGTFDQIYQK
|
| |