Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0903 |
Symbol | |
ID | 3784950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1027803 |
End bp | 1029230 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637810985 |
Product | hypothetical protein |
Protein accession | YP_411598 |
Protein GI | 82702032 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.354068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTGG AGCGCCTACG TCAACTGCAA CCCTGCCTGT GGACATTGCC GCGCGCACCG GGCGAAGAAC GCGCGCAGGT CCTCCTGTAT GGCAGCGCGC CCTTGCTCGT CAGCATGGAC GACAAGGTAC TCGAGCAGAT CGCCAACGTT GCTTCCCTGC CGGGGCTGGT TGGAGCGGCA ATGACCATGC CGGATGCGCA TTGGGGATAT GGTTTTCCCA TAGGGGGTGT TGCTGCTTTC GATGCTGAGC AGGGGGGGGT GATTTCCGCA GGCGGGGTCG GCTTCGATAT TTCGTGCGGC ATACGCTGTC TGCGCAGCAA TCTGAATCTG GAGGATGCGG TAGAGCATTT TCCCCAGCTC GGAAAAGCCT TGTTCCGCGC TATCCCTGCT GGAGTGGGCG AGGAGGGCGA GATCAAGCTG AACCCGGAGC AGCTCGACCA GGTGATGCAC GGCGGTGCAC ACTGGGCGGT ACAGCAAGGC TACGGCACTC CAGCAGATCT GGATTATGTT GAAGAACAGG GGCGGGTAGC AGGAGCGATC CCGGAAAACG TATCCGAACT TGCCAAAAAG CGCCAGCGCG GCGAGATGGG CACGCTCGGC TCAGGCAATC ATTACCTGGA AGTACAGGTG GTGGACCGCA TCTTCGATCC TGGCGTCGCG CTGGCGCTTG GCCTGCATGA AGGACAAATC CTGATTTCGA TACATTGCGG TTCGCGTGGG CTGGGGCACC AGATCGGTAC CGATTATCTG GTGCTATTGG CAAAAGCAGC CAGCCGCTCG GGCATTCATT TACCCGATCG TGAACTCGCT TGCGCGCCTG TCAAATCCCC CGAAGGCCAG CAATATATCG GCGCGATGAA TGCTGCGATC AATTGCGCGC TTGCCAACCG GCAAATCCTG ACGCATCTTA CGCGCTCCGT ATTTACGGAA ATTTATCCCC AGGCCGAGCT TGAAACCTTG TTTGATGTTT CGCACAATAC CTGCAAGGCC GAAACCCATC AGATCGACGG TGAATCGAGG TTGCTGTATG TGCACCGCAA GGGCGCTACA CGCGCATTCG GCCCGGGCCA CCCCATGCTG CCGGAACGCT ACCGCCAGGT AGGACAACCC GTCGTTATCG GTGGAAGCAT GGGAACAGGC TCCTACATTC TCGTTGGTGA CAGCGAAAAT CCCGCCTTCG CTTCTTCAAG CCACGGTGCA GGCCGGGCCA TGAGCCGGCA CCAGGCACTC GCGCGATGGA AAGGACGTGC GCTGGTGGAC GAGTTGGCGC AACAAGGTAT TCTGATCCAT ACCCGCTCCA TGCGAGGTGT GGCGGAAGAA GCCCCGGGCG CCTATAAGGA TGTCGATCTG GTGGCGGAGG CCACGGAAGA AGCCGGGCTC GCCCGGCGCG TCGCGTTTCT CCGACCCAAA GTCTGCGTTA AGGGTTAA
|
Protein sequence | MNLERLRQLQ PCLWTLPRAP GEERAQVLLY GSAPLLVSMD DKVLEQIANV ASLPGLVGAA MTMPDAHWGY GFPIGGVAAF DAEQGGVISA GGVGFDISCG IRCLRSNLNL EDAVEHFPQL GKALFRAIPA GVGEEGEIKL NPEQLDQVMH GGAHWAVQQG YGTPADLDYV EEQGRVAGAI PENVSELAKK RQRGEMGTLG SGNHYLEVQV VDRIFDPGVA LALGLHEGQI LISIHCGSRG LGHQIGTDYL VLLAKAASRS GIHLPDRELA CAPVKSPEGQ QYIGAMNAAI NCALANRQIL THLTRSVFTE IYPQAELETL FDVSHNTCKA ETHQIDGESR LLYVHRKGAT RAFGPGHPML PERYRQVGQP VVIGGSMGTG SYILVGDSEN PAFASSSHGA GRAMSRHQAL ARWKGRALVD ELAQQGILIH TRSMRGVAEE APGAYKDVDL VAEATEEAGL ARRVAFLRPK VCVKG
|
| |