Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2432 |
Symbol | |
ID | 3784127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2770980 |
End bp | 2773799 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637812522 |
Product | putative signal peptide protein |
Protein accession | YP_413113 |
Protein GI | 82703547 |
COG category | [S] Function unknown |
COG ID | [COG3868] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0164247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATGA CGGTCATCAA TGGGATAACG CCAATGATGA AGTACCAGTC GACTTTCGGG AATCATTATG CTTTTTTAAG GCTCATCTCT AGCATGCTTT TATCCTCTAT CTGCTTGCCC GCATACGCGA GTACCGCGAT TTTCTACGGC CGACCTGCCC CGGTGGATCT ACTTTCACAT TTTACGCAGG TCATAGTAGA ACCCGAGAAC ATGGATAATG TGGATTATCT TTTTGACAAG GGAACTAAGG TATTTGCCTA CGTCAGCGTC GGAGAAGTCC ATGCGACACG AAGCTGGTAT TCAGAGATCC CGCAAAGCTG GTTTATTGGC AGTAATAAAG AATGGGGAAG CAGTATCCTC GACCTGACTC AGCAAGGATG GCATGACTAT CTTCTTGACA AGTATTTGTC CCGGTTATGG GAACAAGGTT ATCGTGGATT TTTTTTGGAT GGACTGGAGA CATACCAGAG GGTTGCGGTT GAACCTGCTG CCCGGTTAAA CCAGGAAAAA GCATTATCCA ATCTGATTAA AAGTATACAT GAGCGTTTCC CGGGAGTTGA ACTTATACTT AACCGGGGTT TCGATATCTT GCCGCATGTA GGCCAGTATG CGGTTGCATT AGCCGCCGAA TCGCTATTCC AACGCTGGGA TCCGATCGAT TTGGAATATA CGGAAGTTGC TGAGCCGGAT CGTAACTGGC TCTTACAGAA GTTATGGCAG GCACGAGATC GGTATGAACT GCAAATAATC GTTATTGATT ACGTTGATCC AAAGCAGAAG GATTTAGCGC GAGCTACCGC AAAAAAAATA TCTGATCTGG GCTTTACGCC CTGGGTGGCA AACCCCGCAT TGGATATTTT CGGCATTGGC AAGGTGGAGA TATTCCCTAG ACGCATCTTG GCGCTCTATG ACGGGCACGA ATATCCGGAA GGGCTGCAGC AGACGGATAT CCATAAATTG CTAGCCATGC CACTGGAACA TCTGGGTTAT ACACTCGATT ATCTGGACGT GAATGCTGGA TTACCCGGCA ATTTATTGAC AGGGCAATAT GCCGGAATCG TGAGCTGGTT CAACAACGAT GCTTTATTGC ACCCGGCGGT TTATAGAGAC TGGCTGTTGC GGCAGATGGA GGCTGGGGTA AAAGTTGTTA TTCTCGGAAA TTTAGGTTTT AAAGCTGATA ATGCTTTTTT AAAGCATCTA GGGGTGGAAC TAAGTCAGTC GGAATCCCTG GAGGGCTCAC TTATGGTCAG GGGGAGCGAT AAAATTATTG GTTATGAGGC GAAACCTCAA CCAGTGACAC GGGGGTTCAC TGCCTGGCAA GTATTGGAAG ATGGTGTTCA AAAGCATCTA AGTTTTACAG GTCAATCGGA AGAGCCGGTG GTTGCAGTAT TTTCCGGGAA TTGGGGCGGG ACCGCCTTGC ATCCTTATGT GATAGAAGCT GGTTACCAGG GGCTGAAACG ATGGATCATC AATCCATTTG AGTTTTTAAG CACTGCTCTT GATGTAAAAG GAATACCCGT ACCCGATGTC ACGACCGAGA ATGGCCGGCG TTTATTGCTG GTACAGATTG ATGGTGATGG GGCTGGGAAT AAAGCAGAAA TACCTGGAAC GCCGTTGGCG ATAAAAGTCA TCCAGGATCA ATTCTTGCAA AAATATAACC TGCCTTCTAC GGTTTCCATA ATTGAAGGAG AAACGGCGAA CCGCGGCTCC TCAGACAAGG TAACTGAGGC CGAGAAAATA GCACAAGATA TTTTCAAGTT GAACCATGTT GAAATAGCAA GTCATTCCTA TAGTCATCCT TCCGCATGGT TTCCAAAGAA CACTATTGAT GCTGATGGAG ATGAGTATCA GCTTTCTGTC GATGGATATG GATTTGACTT GCATCGGGAA ATCGCCGGAT CCGTGGAATA TATTAATAAC ACCTTGGCGC CTGAGAACAA GCGGGCGCGT GTTTTCTTAT GGACCGGGGA CGGGCTGGCA AGCAGAGAGG CTCTAGCTTT AGCGCGTTCA CTTGGGTTGG AGAACATGAA TGGCGGCGGC GCGACGATTT CCAATGATGA AAAAACCGTG ACACGTGTGC CCCCCCTGGG TTATGTAATG GATGACCAGC TCCAGTTATA TGCCCCCATG GCGAGCGAGC ATGTTTATAC AAACAGATGG CAAGGTCCCT TTCACGCATT TAGTCGCGTC GTCGAAACCA TGCAACTCAC TGACCGCCCC CTTCGTTTGA AACCATTGCA TATTCATTAT CACTTTTATT CAGGAAGTAA AACGGATTCC ATCAATGCGC TGAAATCCGT GTATGAGTGG ACTGTTAAAC AGGAACATCG ACCTGTCTGG GTCAGCGAGT ACGGTCAGAA GGTAAATGAA TTCCGCAATC TTACAATGTC TCGCCGTGTC GATGGTGCCT GGGATATTAG AGGGCTGAAT ACACTACGCA CGTTACGTCT GCCAGTGTCG ACAGGGTGGC CCGATTTGGA AAGATCGGAG GGGATTGTAG GGGTCCGTGA TGCGAGTCAA GGGCGATATG TGCATCTTTT GCCTAATCAT GGTCAAGTGC TGCTGTATAT GACACCCGAG CTTCCCTTAT CTCCTTATTT ATCCCACAGC AACGGGGAGA TTGACGAATG GCAGAAAATG TCAGGGGGCG TTAATTTTCG AATGCGTGCG CATACGCCAT TAGAAATGAC GGTCGCATCT GAAGGAGAGT GCCATATCAA TTGGTCCGGT GGCACTCTGG AAGGGCAGCG CGAGGGTCAA GGCTGGAAAT TTGTTTTTCC AGTGGCCGAT TCCGGGGATG CGACCCTCGT GTGTTCTTGA
|
Protein sequence | MPMTVINGIT PMMKYQSTFG NHYAFLRLIS SMLLSSICLP AYASTAIFYG RPAPVDLLSH FTQVIVEPEN MDNVDYLFDK GTKVFAYVSV GEVHATRSWY SEIPQSWFIG SNKEWGSSIL DLTQQGWHDY LLDKYLSRLW EQGYRGFFLD GLETYQRVAV EPAARLNQEK ALSNLIKSIH ERFPGVELIL NRGFDILPHV GQYAVALAAE SLFQRWDPID LEYTEVAEPD RNWLLQKLWQ ARDRYELQII VIDYVDPKQK DLARATAKKI SDLGFTPWVA NPALDIFGIG KVEIFPRRIL ALYDGHEYPE GLQQTDIHKL LAMPLEHLGY TLDYLDVNAG LPGNLLTGQY AGIVSWFNND ALLHPAVYRD WLLRQMEAGV KVVILGNLGF KADNAFLKHL GVELSQSESL EGSLMVRGSD KIIGYEAKPQ PVTRGFTAWQ VLEDGVQKHL SFTGQSEEPV VAVFSGNWGG TALHPYVIEA GYQGLKRWII NPFEFLSTAL DVKGIPVPDV TTENGRRLLL VQIDGDGAGN KAEIPGTPLA IKVIQDQFLQ KYNLPSTVSI IEGETANRGS SDKVTEAEKI AQDIFKLNHV EIASHSYSHP SAWFPKNTID ADGDEYQLSV DGYGFDLHRE IAGSVEYINN TLAPENKRAR VFLWTGDGLA SREALALARS LGLENMNGGG ATISNDEKTV TRVPPLGYVM DDQLQLYAPM ASEHVYTNRW QGPFHAFSRV VETMQLTDRP LRLKPLHIHY HFYSGSKTDS INALKSVYEW TVKQEHRPVW VSEYGQKVNE FRNLTMSRRV DGAWDIRGLN TLRTLRLPVS TGWPDLERSE GIVGVRDASQ GRYVHLLPNH GQVLLYMTPE LPLSPYLSHS NGEIDEWQKM SGGVNFRMRA HTPLEMTVAS EGECHINWSG GTLEGQREGQ GWKFVFPVAD SGDATLVCS
|
| |