Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1927 |
Symbol | |
ID | 3784223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2216478 |
End bp | 2218331 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812013 |
Product | hypothetical protein |
Protein accession | YP_412614 |
Protein GI | 82703048 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) [COG4121] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03197] tRNA U-34 5-methylaminomethyl-2-thiouridine biosynthesis protein MnmC, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.224364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGACT GGCAAAACGG ACAACTGTAT TCCACTCGCT TCGGTGACGT CTACTTCTCA AGAGACTCGG GACTGGAAGA AAAACAGTAC GTCTTCCTGC AGGGCAACCG GCTTGCGGAC CGTTTTGAGT CCTTGCAGCC TGATACCGCG TTTTCCATTG GAGAAACGGG ATTCGGCACA GGATTGAGCT TTTTATGCAC TTGGCGGCTA TTCATCCAGA TTGCACCCCT TCGGACCAGT CTTGATTTTT TCAGTGTTGA AAAATATCCG CTTGATGAAA AAGAACTGAG CGCAGCGCTC GCGCTTTGGC CCGAACTGGG CCCATACGCC GATGAACTTA TGCTGCGCTG GCAGCGGCGT GTACCCGGAT GGAATCGGTG GAGCTTCGCC GGAGGAAGAG TGCGTCTCAC GCTGGCAATA GAGGACGTGA CCCGGGCGCT GCCTGAAACG CACGGTATCG ATGCATGGTT TCTTGACGGC TTTTCACCGG CGCGAAACCC GGAAATGTGG ACACTCCAGA TTTTTCACTG GATTGCGCGG GCGTCGCGAG CAGGCGCAAC CTTTGCGACC TATACCAGTG CCGGCGTTGT TCGTCGCGGT TTGGAACAAG CAGGGTTTCA GGTCAAAAAA ATATCCGGCT TTGGCCATAA GCGTGAAATG CTGCAAGGTG ACCTTCCTGG CCCCCCTCCC GTTCGACTGG CTCCCACCAC CGCGATCGTT ATCGGAGGGG GAATAGCAGG GTGCGCCGCT GCTTCGGCGC TGGCCAGTCG TGGACTTATA GTTGAACTTC TGGAATCACA CACCCTTGGC GCGGGTGCGT CAGGCAACCC GATTGGTATA CTGCACGCCC GCCTGAGTGC AGGAATGAAC GCCCTGCACC GCTTTGTGCT GGCATCCTAC GGACATGCGC TCGCCTTGCT TGACGAAAAA ATACCCGTCG ATGGCGTCAT GCGGAGTGAA TGCGGAGAAC TGCAGCTGTC ATTCTCCGCC GAAGAAGCAA GACGAATCGG GAAGCTTGCG ACCCTCGACT GGCCCGCGCA TGTTTTCCGA CCAGTAGATG CGGCTGAAGC ATCGGCCCTT GCGGGAATTG AGCTTTCATA TGGTGGCCTT TGGTTTCCCG GTAGCGGTTG GCTTGCTCCG CCTCAACTTT GTGTAGCCTT GCTTGGCAGT CAGGCTATCA CCCTGTATAC CGGTCGCACG GTAAAATCAC TTACCCCAAC GAGTCACGGG TGGCGTGTGC AAGCGGAAGA TCAGAGGAAG CAAGCGTGGT CTCTGGAGGC CGAGATAGTT GTGGTTTGCA CCGGATATCA GGTGAAATCG CTTCCAGCAT TGGCAAATCT GCCGCTAACC CCGGTACGGG GACAGCTTAC CTTGATCCCT GCAACAACCG CAAGCCAGAA TCTCCGCACC ATCGTATGCG GGAGTGGCTA TTTCTCCCCT GCTGTTGCAG GACGACATAT GGTGGGAGCA ACCCATCGTT TTAACGATAC ATCGATTAAC CTGAATGTAT CGGAGCATGC GGAAAACTTA TCCAGACTGC GAGAAATTTC TCCTGTCCTC CGCAGGTTGA GTGACGAGGT AAGTCAAGAT ATCAGGCAGC TTGAGCAATT GGATGGACGC ACATCTATCA GGGGGTCTGT TCCAGGCGCC ATGCCGCTCG TCGGCGAACT TTTGCCCGGA CTGTATACCA GCCTCGGCCA TGGAACGCGT GGACTGATTA CCGCGGGAAT TTCAGCCGAA TTGGTCGCGG CAACCGCCTG CGGGCAACTG CTGCCGTTGC CATTGTCCGT TGTCAATGCG CTCTCGCCTG TCCGAAGAGC TTCTCCCGCT ATTCCGGTTT CAATCAAGGG ATAG
|
Protein sequence | MLDWQNGQLY STRFGDVYFS RDSGLEEKQY VFLQGNRLAD RFESLQPDTA FSIGETGFGT GLSFLCTWRL FIQIAPLRTS LDFFSVEKYP LDEKELSAAL ALWPELGPYA DELMLRWQRR VPGWNRWSFA GGRVRLTLAI EDVTRALPET HGIDAWFLDG FSPARNPEMW TLQIFHWIAR ASRAGATFAT YTSAGVVRRG LEQAGFQVKK ISGFGHKREM LQGDLPGPPP VRLAPTTAIV IGGGIAGCAA ASALASRGLI VELLESHTLG AGASGNPIGI LHARLSAGMN ALHRFVLASY GHALALLDEK IPVDGVMRSE CGELQLSFSA EEARRIGKLA TLDWPAHVFR PVDAAEASAL AGIELSYGGL WFPGSGWLAP PQLCVALLGS QAITLYTGRT VKSLTPTSHG WRVQAEDQRK QAWSLEAEIV VVCTGYQVKS LPALANLPLT PVRGQLTLIP ATTASQNLRT IVCGSGYFSP AVAGRHMVGA THRFNDTSIN LNVSEHAENL SRLREISPVL RRLSDEVSQD IRQLEQLDGR TSIRGSVPGA MPLVGELLPG LYTSLGHGTR GLITAGISAE LVAATACGQL LPLPLSVVNA LSPVRRASPA IPVSIKG
|
| |