Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2384 |
Symbol | |
ID | 3784975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2715871 |
End bp | 2717439 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637812473 |
Product | hypothetical protein |
Protein accession | YP_413065 |
Protein GI | 82703499 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCGC TCAAGAACAA TCTATCCACT GCCATTTACT CCACGGCTGA GATCAGAGAA ATCGAACACC TGGCGGCCGG CCTGCCGGGC CGGCCTCAAT TGATGGAAGC AGCCGGGCTC GCCGCCGCTG AAATTGCCCG TGACCGGCTG CTCTCCCCCC ATAAACCCCG CTTGCTGGTA CTGGCGGGGC CCGGGAACAA CGGGGGCGAT GCCTTTGTCG CGGCGCGCCA CCTGCGGGAG TGGGGGTTCA AGGTAACGCT GGTGTTTACC GGCGAGCAGG CAAAGTTATC GGCGGATGCG CTGCGGGCGC TGAATGCCTG GGTTGCCACC GGCGCTGGGA TGGTCTCCGA AATTCCGGAA AATGAAACAT GGGATGCCGT GATCGACGGC TTGTTCGGTA TTGGTCTGGA CCAGCAAGGC GGACGAGAAC TGGGCGGCAA ATATCTGGCT ATGGTGAATA CCGTTAACGC CATGATGCTG CCGGTGCTGT CCATCGATAT TCCCAGCGGT CTGGGCAGCG ATACCGGCGC TGTGTGCGGC GCGGCTATCA TCGCGACCAT GACGGCGACA TTCATTGGCC TGAAACCCGG CCTCTTCACG AATGAGGGCC CCGATTATTG CGGGAAAGTT TTTTTGCACG ACCTTGATCT CGATGTCTCA TCCCTGAAGA AACCCGATGC GTGGCTGATG GACCAGATGC ACATCCGGCG GCTCCTGCCC CCTCCCCGGC GGGCTAACAG TCACAAGGGC ATGTTCGGCA GTATCGGGGT AATCGGAGGA ACAGCCGGCA TGGTCGGGGC TGCGCTTCTG GCGGGTACAG CGGCGCTGAA ACTGGGTGCC GGACGCGTTT ACCTGGGATT GATGGCGCCG GACGCGCCAG CGGTCGACAC CTTCCAGCCG GAACTGATGC TGCGCCCTAT CCAGGATCTG TTCAAGCTGG AGCAGCTGAA CTGTCTGGTG GTTGGTCCGG GTTTGGGAAC GGAAACCGCC GCATACTTCT GGCTCAAGTG CGCGCTGCAA ACCACTCTAC CACTGGTTCT CGATGCAGAT GGCTTGAATC TCGTTGCCTC GCATTCCGAA ATAGCGGGAT TGCTGCGGGA ACGGTTGCGC GAACGCCATG CGCCTTCCAT TCTCACTCCT CATCCGGCCG AAGCCGCCCG CCTCCTGAAA AGCACCACGA CTTCCGTACA ACAGGACCGG ATGGCAGCCG CTGCGGAGCT GGCCCAGCGT TTTAACTGCT GGATTGTGTT GAAAGGCGCA GGCAGCGTGT GCGCGATGCC GGAGGGCAGA CGCTTCATCA ACACAAGTGG AAATCCCGGC TTAAGCAGCG CAGGAACGGG CGATATCCTC TCCGGGATGA TTGGCGCCTT TCTGGCACAG CGATCGAGCC CGGAAAACGC GCTGCTCGCT GCTGTATACC TGCACGGAGC GGCTGCCGAC GTATTGCAGA AGCAGTATGG CGGCGGCATT GGAATGACCG CCTCCGAAAT TCCCAACGTC GCCCGTAACC TGTTGAACCA ATGGATTGCC GTTAATTCTG CCCCGGCCCC TCACCAGCAA GAAGGCTGA
|
Protein sequence | MNALKNNLST AIYSTAEIRE IEHLAAGLPG RPQLMEAAGL AAAEIARDRL LSPHKPRLLV LAGPGNNGGD AFVAARHLRE WGFKVTLVFT GEQAKLSADA LRALNAWVAT GAGMVSEIPE NETWDAVIDG LFGIGLDQQG GRELGGKYLA MVNTVNAMML PVLSIDIPSG LGSDTGAVCG AAIIATMTAT FIGLKPGLFT NEGPDYCGKV FLHDLDLDVS SLKKPDAWLM DQMHIRRLLP PPRRANSHKG MFGSIGVIGG TAGMVGAALL AGTAALKLGA GRVYLGLMAP DAPAVDTFQP ELMLRPIQDL FKLEQLNCLV VGPGLGTETA AYFWLKCALQ TTLPLVLDAD GLNLVASHSE IAGLLRERLR ERHAPSILTP HPAEAARLLK STTTSVQQDR MAAAAELAQR FNCWIVLKGA GSVCAMPEGR RFINTSGNPG LSSAGTGDIL SGMIGAFLAQ RSSPENALLA AVYLHGAAAD VLQKQYGGGI GMTASEIPNV ARNLLNQWIA VNSAPAPHQQ EG
|
| |