Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0959 |
Symbol | |
ID | 3785750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1112791 |
End bp | 1114050 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811042 |
Product | hypothetical protein |
Protein accession | YP_411654 |
Protein GI | 82702088 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAC CCTTTTCTGC CCTGCTTGCC TTGGTCTTCG CCACTGGTCT TGGGGCACCG GAGGCTTTTG CACAGTTGCA AACCCGGATC GTTGCAAACG GCCTTGATCG CCCTTTGTTC GCAACATCGC CAGCAGGAGA CTCACGCCTG TTCATTGTCG AACAGGGCGG GTTGATCAAA ATCCTCCAGA ATGGGAGTGT GCAGCCAACA CCATTTCTCG ATCTGTCAGG CTCGGTCAAT ACCGAGGGCG AACGCGGCCT GCTGGGAATG ACATTCGATC CAAACTTTGC CAGCAACCGT CGATTTTATG TCGATTACAT AGACAGGACT TCGCTCAATA CCGTAGTCGC AACGTATCAG GTAAGTGCAA CGCAGCCAAA CGTGGCAGAT ATCACCAGCC GACAAACAGT GCTCACTGTG CAGCAACCTG AGTTCAACAA TCACAAGGCC GGGTGGCTCG GCTTCAGGCC CGGCGAGCCG GGGAATCTTT ACATAGCCAC AGGAGACGGC GGCCTCCGGG ATGATCCTGG TAACCGCGCG CAAAACCTGT CCAGCAATCT CGGCAAAATT TTGCGAATCG ATGTTTCATC TGATCGCCTG CCCAATGATC CCACGCAGTA TGGCTATGCA ATACCGGATG GAAATGCCAC AGGCAGCAAT CCGGAGATAT ATGCATCTGG ATTGCGTAAC CCGTTTCGTG ACAGTTTTGA CCGGGAAAAC GGCACTTTCT ACATCGGTGA CGTCGGGCAG AACGCACGCG AGGAAATCGA TATAGGCGCT GCCGGAGCGA ACTATGGGTG GCGCAGATTC GAGGGAACGC TGGTGAATTT TCCTAACGAT CCACAGATTC CCAACCACAC GCCGCCCATC TTTGAATATA ACCATACCGC GGATGGCGCC TCAGTCATTG GCGGCTATGT CTACCGCGGC TCAGAAATCC CCGGCCTGGA AGGCACCTAC TTCTTTGCGG ACTTCGTCAA CGACAAGGTG ATGTCCTTCC GCTTTACTGG CTCAGGCATT ACCGACCTCA CCGACCGCAC TGCCGAACTG CTCTCCCCAA CGGGCATTTC GGGGAATATC ACCTCATTCG GCGAGGATGC TTCCGGCAAC CTTTATCTGG TAAGCCTCAA TGGGCAAGTC GGACGAATCG CCCTTATACC CGAACCTGCA AGCTACGCAA TGATGCTGGC GGGGATGGGT TTGATCGGGG TGTGGGTTAG GCGAAGAGGG AAAGCGAGGA AAGTCCCAGG GCTCACCTAA
|
Protein sequence | MIKPFSALLA LVFATGLGAP EAFAQLQTRI VANGLDRPLF ATSPAGDSRL FIVEQGGLIK ILQNGSVQPT PFLDLSGSVN TEGERGLLGM TFDPNFASNR RFYVDYIDRT SLNTVVATYQ VSATQPNVAD ITSRQTVLTV QQPEFNNHKA GWLGFRPGEP GNLYIATGDG GLRDDPGNRA QNLSSNLGKI LRIDVSSDRL PNDPTQYGYA IPDGNATGSN PEIYASGLRN PFRDSFDREN GTFYIGDVGQ NAREEIDIGA AGANYGWRRF EGTLVNFPND PQIPNHTPPI FEYNHTADGA SVIGGYVYRG SEIPGLEGTY FFADFVNDKV MSFRFTGSGI TDLTDRTAEL LSPTGISGNI TSFGEDASGN LYLVSLNGQV GRIALIPEPA SYAMMLAGMG LIGVWVRRRG KARKVPGLT
|
| |