Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2203 |
Symbol | |
ID | 3786228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2502644 |
End bp | 2503648 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637812290 |
Product | oxidoreductase-like |
Protein accession | YP_412887 |
Protein GI | 82703321 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATC TGATGAATGC ACCCGTGCCG CTATCCCCTG CCCTCGAATT AGCGTTGCCT GTTGATACTG CTGTCGTCGG CGTCGGATAT TTCGGCAGCT TGCATGCATT ATGCTACTCC CGCCTGCCCG GAAGCCGCCT CCAGGCATTG ATCGACCCCG ATCCGATTAC GCAATTCCTG GCTGAGCATC TAGGCGTCCC CTGGTTTCCA GGTGTTGCAG AACTTCCGTC CACAATCCGC GCGGTATCGG TTGCCACACC GGTTGCCATG CATTTCGGGC TGACCAGATC GCTGCTCCAG CAAGGGCTGG ACGTGCTGCT GGAAAAGCCG ATTGCGGAAA CCGCGGCACA GGCGACGGAA CTGCGAATGG TAGCGGAGGC AAACCAATGC ATCCTGCAGA TCGGACATAT CGAACGCTTC AATCCGGCCT ATACCGCTGG CGGAACGCTC CTCCCGTTCG CCCGGACCGT CCGCTCGGTG CGGACCACAC GACATCCTCC GCGATCCAGC GCACTGGACG TGGTCATCGA CCTGATGATC CATGATCTTG ACCTGATTCT GCATAGCCTG GATTCCTCCG TGGTGGAACT GCGCGCCTCC GGTAGAAGCT GCGGTTTAAC AGCCATAGAT GAAGCAGAAG TTGAACTGAC TTTCTCCAAT GGGTGCCGGG TATACCTGGA TGCACACTGG GGGCGAGATA CGGAGCAGGA CGCGCGCTGC ATGGTTGCGG AACTGGAAAA CGATGAAACC TGGGTCATCG ATTTCAGGCG CCGGATGACC TATCGCAAGG AACCCGGCAG CTCGACGGGT CCTTTGCCTG CGGACGGCCA CACGCTTCCG TTTCCCATGC AACGGATACA GGAAGATACG TTGAGCCTGC AACTCGCGGC CTTCCTCGAT GCCTGCCGCA ACCGTTCACT ACCCCGGGTT ACGCCGGCGG AAGGCAATGC TGCCCTGGAG CTGGCGCACT GCATCCGGCA GCAGATACTC AGGCCCTGCC CGTGA
|
Protein sequence | MMNLMNAPVP LSPALELALP VDTAVVGVGY FGSLHALCYS RLPGSRLQAL IDPDPITQFL AEHLGVPWFP GVAELPSTIR AVSVATPVAM HFGLTRSLLQ QGLDVLLEKP IAETAAQATE LRMVAEANQC ILQIGHIERF NPAYTAGGTL LPFARTVRSV RTTRHPPRSS ALDVVIDLMI HDLDLILHSL DSSVVELRAS GRSCGLTAID EAEVELTFSN GCRVYLDAHW GRDTEQDARC MVAELENDET WVIDFRRRMT YRKEPGSSTG PLPADGHTLP FPMQRIQEDT LSLQLAAFLD ACRNRSLPRV TPAEGNAALE LAHCIRQQIL RPCP
|
| |