Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1731 |
Symbol | |
ID | 3786208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1978141 |
End bp | 1979511 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811817 |
Product | hypothetical protein |
Protein accession | YP_412420 |
Protein GI | 82702854 |
COG category | [S] Function unknown |
COG ID | [COG4267] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGTA TCGGTTTCGA AATCCGCAAA ATCCTGGAGC GCGACAATTA CTGGTCGGTG TTGCGAGCCT ATGGTTATGC GGGCCTGGTC AGCGGTGGTC CATGGGTGCT CTCGATTTTG AGCATCATGA TGATAGGCGT CCTTGCCGTC ATGCTGGGTG TGGAACAGCG CGAGGTCAAT GCCTTCCAGA TCTGCGTTAC CTATATGATG GCGAGTTCAC TGATCTGGAC CGGGGGTATG CAGCTCATGT TCACTCGGTT CGTCGCGGAC CGGACTTATG CCGGTGACAA GGCAGAGGTG CTGCCCAATC TGTTCGGCGT CCTGATGTTG ACCATGGCGG GTGGCTTTGC GTGGGCAGCG CCCTTTATCT TCTCGCTTTT CACGGGACCA TTTTTCCAGC AACTGTTGCT GCTGTGCAAT TTTGTTGTGC TGAGCGGGAT GTGGATTGTC CTCATTTTCC TGTCGGGGAT GAAGGCGTAT GGGCGAATCA TCTGGACTTT GGCCAAGGGT TACTCCCTGG GAATTGCAGT CGGACTCCTT GCATCTCCAT GGGAACTCAA CGGCCTGTTG CTCGGTGTGC TGACCGGTCA CGGCTATCTG CTGTTTTCCT TCCTGCATCA TATCGTGCGG GAATATCCCG GAAATTCTCT GCTCAAGTTT GACTTTCTCG ATCGAAGCCG GAGCTTTTAC AGCCTGTTTG CGGTCGGCAC GCTCTATTAC CTGGCGGTGT GGATCGACAA ATTCATCTTC TGGTTCGTGC CCTATACTTC CGAGGTGGTC ATCGGTCCCT TGCGCGCATC CATCATCTAC GATCTTCCCA TTTTTCTGGC ATATCTATTC ATCCTGCCGG GGATGGCCGT ATTTCTGGTA AGCATAGAAG CGGATTTTGC CGAACAGCAC GAACGTTTTT ACCGTGCCGT GCGTGAAGGC GACACGCTCA TGCATATCGA ATATAGGCGT GACCGGATGG TGTACGCTGC CCGGCAAGGT ATCTACGAAA TCTTCAAAGT GCAGGGTCTG ACGGTGGTGT TGTGCCTTCT GTGGGGCAGG GGCCTGCTGC AAACCATTGG CATATCCCCG CTCTACATTC ATTTGTTTTA TATCGATGTG GTCGCAGTCA GTGTGCAGGT GCTGCTGATG GCGATACTCA ACATACTTTT TTATCTCGAC GCCCGGCGGG AAGTATTGAT CGTTACGGCA TGTTTTTTTA TCACCAACCT GCTGTTCACC GTTGCCACGC TGCAACTAGG AGCCGAATCC TTCGGATACG GCTTTGCAGT ATCCGTTACG CTGTCCGCAT TTCTCGGTCT CTTCATCCTC TCGCATAAGT TCAACCGGCT GGAGTATGAG ACATTCATGC TGCAGGGTTA G
|
Protein sequence | MAGIGFEIRK ILERDNYWSV LRAYGYAGLV SGGPWVLSIL SIMMIGVLAV MLGVEQREVN AFQICVTYMM ASSLIWTGGM QLMFTRFVAD RTYAGDKAEV LPNLFGVLML TMAGGFAWAA PFIFSLFTGP FFQQLLLLCN FVVLSGMWIV LIFLSGMKAY GRIIWTLAKG YSLGIAVGLL ASPWELNGLL LGVLTGHGYL LFSFLHHIVR EYPGNSLLKF DFLDRSRSFY SLFAVGTLYY LAVWIDKFIF WFVPYTSEVV IGPLRASIIY DLPIFLAYLF ILPGMAVFLV SIEADFAEQH ERFYRAVREG DTLMHIEYRR DRMVYAARQG IYEIFKVQGL TVVLCLLWGR GLLQTIGISP LYIHLFYIDV VAVSVQVLLM AILNILFYLD ARREVLIVTA CFFITNLLFT VATLQLGAES FGYGFAVSVT LSAFLGLFIL SHKFNRLEYE TFMLQG
|
| |