Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0972 |
Symbol | |
ID | 3785763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1129676 |
End bp | 1130848 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811055 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_411667 |
Protein GI | 82702101 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTAT CACCGCTGGT ACATCCCTCG GAAGAATTGA GTAAAGAAGA AATTTCGCGC TACAGCCGCC ATCTGCTGAT TCCGGATGTC GGTCTGGAAG GGCAGCAGCG TCTCAAGAAC AGCAAGGTCC TGGTAATCGG CGCCGGAGGG CTGGGATCGC CGACGCTCCT CTATCTGGCC GCAGCCGGCG TGGGAACCTT GGGCATTATC GATTTCGATA TTGTTGATGA ATCCAATCTA CAGCGGCAGG TCATTCATCG GCAGCAGGAT ATCGGCAGAC CCAAATGCCG GAGTGCGCAG GACTCGGTCA AGGCCCTGAA TCCCTATATC CAGGTTCGGA TTCACGATGA ACGACTCGAA ACAACAAATG CTATTGACAT CATATCGGAC TACGATCTGG TAATAGACGG AACGGATAAC TTCTCCACCC GCTATCTCGT CAACGATGCC TGCGTGCTGG CTGGAAAGCC CTATGTCTGG GGGTCCATTT TCCGCTTCGA AGGGCAGGCC TCGGTCTTCT GGGAAGATGC GCCAGGAGGA CGCGGCCTGA ACTACCGCGA CCTTTATCCC GAACCCCCGC CGCCCGAAAT GGCCCCGTCG TGTGCGGAAG GCGGCGTGCT GGGCATACTG TGCGCATCCA TCGGCGCGAT CATGGCTACC GAGGCCATCA AATTGATTAC CGGTCTGGGC AATACCCTGC TCGGCAGGCT TGCTGTCTAC GATGCCCTGG ATATGACCTT CAGATTCATC CCGCTGCGGC GAGCCCCGGT CAGGACACCG ATCACCCGGC TGATCGATTA CCAGGCATTT TGCGGCCTCC CGCAGTCAAC GACGGGCGGC AAACCGAACG TACCCACGAT CAGCGCCCTG GAATTGAAAG AGATGCGGGA TTGCAGCGTA GCCATGCAGC TTATCGATGT TCGCGGAATC CAGGAATGGA ACATCGTGCA TATCGAAGGG GCAAATCATA TTCCCAAGGA CAGGATGATG TCCGAGGAAG TTTTAGCCCG ATTGAACAAG GATGAGCTTA TCGTGCTTCA CTGCAAAATG GGCGTGCGGT CCCGGGATAT TCTCATGGAG ATGCGCAAGC GGGGGTTTAC GAATGTCAAA AGCCTGGATG GCGGAATCCT GGCGTGGATC AGGGATGTGG ATCAGACATT GCCAAGTTAT TGA
|
Protein sequence | MQLSPLVHPS EELSKEEISR YSRHLLIPDV GLEGQQRLKN SKVLVIGAGG LGSPTLLYLA AAGVGTLGII DFDIVDESNL QRQVIHRQQD IGRPKCRSAQ DSVKALNPYI QVRIHDERLE TTNAIDIISD YDLVIDGTDN FSTRYLVNDA CVLAGKPYVW GSIFRFEGQA SVFWEDAPGG RGLNYRDLYP EPPPPEMAPS CAEGGVLGIL CASIGAIMAT EAIKLITGLG NTLLGRLAVY DALDMTFRFI PLRRAPVRTP ITRLIDYQAF CGLPQSTTGG KPNVPTISAL ELKEMRDCSV AMQLIDVRGI QEWNIVHIEG ANHIPKDRMM SEEVLARLNK DELIVLHCKM GVRSRDILME MRKRGFTNVK SLDGGILAWI RDVDQTLPSY
|
| |