Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1424 |
Symbol | |
ID | 3786622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1634947 |
End bp | 1636146 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637811512 |
Product | hypothetical protein |
Protein accession | YP_412119 |
Protein GI | 82702553 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0154311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA TTCTTCTTGC TGCCTTTGTC GTTCTGGAAA TAGGCTTTGC CCCGCCCCTT CGCGCCGCCG AATCAGCAGG TTTTGCTCTC AATTATATTG GCCAGCAAAT CGTACCCAAC AAGACAAGAT TCAAAGGGAC AACGGTCGGG GGGTTGTCAT CTCTGGACTA TAGCGCAAGC ACTGACCGTT ACCTATCCAT CAGCGACGAT CGCAGCAGAA CCAATCCAGC CCGGTTTTAT GAATTGTCTC TGGATCTCGC CAAATTCCAG TGCTCGGCCA AGCCGGGTAT GGCAGGCGTG ACCTTTCAGG CTGTAACCAC GATCCAGCAA GCCGGTGGAG GGGCATTCGA AAAAAACTCC GTGGATCCGG AAGGTCTCCG TTTTGACGGC AGCCGCAACA AGATTTATTG GAGTGAGGAA GGCCGCCGGG AGATATCGGG TTTTCGAAGC CCTGCGGTGC GGGAAATGAA TGCTGATGGC AGACATTCCC GCGATTTCGT TGTTCCTATT TACTACTCTC CCAGTGGCTC CCGTCTTTGG ACATTTACCG GCAGTAAGGG TGTTTACGAC AATTTGGGAT TTGAGAGTCT GACACTCTCC ACCGACGGTA CAACCCTGTA TACCGCCACC GAAAACGGCC TGGTTCAGGA TTCTCCCCCT GCCAATGCCT ATAGAGGCTC ACGCGCACGT ATTCTTGCCT TCGACATTGC CACCGGGAAA TCAGTCGCGG AATATGCTTA CGATGTTGAA CCTGTTACAT CCGTACCATC CTTGCTCGGC GGTTTCACCA TCATCGGCGT GAGCGACTTC CTCGCCATCG GCGACCGCCA ATTCATCACT ATAGAGCGCG CGTTATCCCC CGGCACGATC ACGCCTGGCC GTATTAACAC CGGATATACC GTCCGGCTTT ATTACGCAGA TGCAAGGGAC GCCACCAACA TTTCCGGAAT GGAATCAATC GCGGACAAGA ACATCTCTCC GGTAAGAAAA ATTCTCCTGC TTGATATGTC AGACCTGAAA AATGCGGATG GCTCGGCTCT GGCTATTGGT AACATAGAAG GCATCACCTT TGGTCCCGAA TTCAGGGGCA AACGCACTAT CTTGCTGGTG GCTGACAACA ATTTCTCCAG AATGCAATTC ACCCAATTTG TCGCATTGGA AATTGCCTCT GAATCGGAGC TAGTGGAGCG GTTACAATAA
|
Protein sequence | MKIILLAAFV VLEIGFAPPL RAAESAGFAL NYIGQQIVPN KTRFKGTTVG GLSSLDYSAS TDRYLSISDD RSRTNPARFY ELSLDLAKFQ CSAKPGMAGV TFQAVTTIQQ AGGGAFEKNS VDPEGLRFDG SRNKIYWSEE GRREISGFRS PAVREMNADG RHSRDFVVPI YYSPSGSRLW TFTGSKGVYD NLGFESLTLS TDGTTLYTAT ENGLVQDSPP ANAYRGSRAR ILAFDIATGK SVAEYAYDVE PVTSVPSLLG GFTIIGVSDF LAIGDRQFIT IERALSPGTI TPGRINTGYT VRLYYADARD ATNISGMESI ADKNISPVRK ILLLDMSDLK NADGSALAIG NIEGITFGPE FRGKRTILLV ADNNFSRMQF TQFVALEIAS ESELVERLQ
|
| |