Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1374 |
Symbol | |
ID | 3784469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1565077 |
End bp | 1569165 |
Gene Length | 4089 bp |
Protein Length | 1362 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811462 |
Product | hypothetical protein |
Protein accession | YP_412069 |
Protein GI | 82702503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCA AGTACCTCAG CAATTACTAC AGCTATAAAT TTGTCCCCCA GGTTCTGAAA TTCGGGGTAA TCGAAGTTGA TCCTAAAGCA GCGCTGGAAA AACAGGTGCA GGAGCATCTC GCAAGCGGAC ATCGCTATTT TTTCCAGGAT GAATTTCAGA ATGCGCTCAC CGAGTATCAG ACTGCCTACG CCCTGCTGCA CAAGTTTCTT CACCCGCAGT TTCCGGTAGA CGTTACTGCA ATCGCAACTG TTGTTCTGAA GCCTCTGCAA CTGACTGAAG CAATGATTGC TGCGACCGCG CAGGTGGCGA AATATCGTTC CACTATCACC GCCGGCCCCA TCGTAGCAGC GGGTAGCCCC CCGCGAGAGA TCTCCTCTGT TGTACAGAAA TTCAGCGGTA CAGGCAGCGC GGTTGTCGCG CCGCCTGCAG CCGTTCTTTA TGAGCAGGCT GCCGTATATC TTCAGGCCGG GGCAACTTCC GAAGCACAGC CCATCATCCA GCAGGCACTC GAACTCAATG GGGGACGGGA TCTGGAACTG GAATCCAACC TGCTGGTGGC ATCCGGCATT GCTGGTGTTC AACGGGGCGA ATTTGACAAT GCCCGGGATA ATTTCGCGAA AGCGTTCGAA TTGGCAAGAC GCGTGAGCGC AGTGCCTGGG ACTGCGCCGG GGGCAGCGGG ACCGGGGGTG GCTGGCATAC CGGGAATCAC GGATGTAGCG GGAGAGGGCG CTGCGGCAGC TGTGCCGCCC GCCCGAACTG CGGAATTGGG TGCAATCAGC AATAACATCG GCGTGGTTTC AGCCTTGACG GGCGACGCGC GTAGCTCTGG CGCGGCTTTC CAGGCGGCGG GAGATTCAGT TCCGCTTTCG TTCGGGAGGA CGCTTACACA ACCGCTCAAC CCCGGCACCG CCACTGCAAT GCAGAGGCCG ATGGGCAGCG AAGGGCTCGG CTTCATCCTG AATTCATCCT CCGGTTCGAA CCAGTATATG AATGTGTCTC CCACTGTTTC TGCCGCAAGC GCCGCACAGC AGTTCGGGGT ATTCAAGGGA GACCAGATCG TGAGTGTCGA TCTTCAGGGC GATGCGGTTG CAAACCTTAC CGCAACCCTC TACCAGCCGC GTATCACCGC TACGGCACTT CCGGAGCTCG CCACTTTCGA AATTCTCGAA ATCAATTTCA TCGCCTACAT CCCTCATACC TACGGCTTCA CCCTTCCATT AAGTCTCGGT GACTGCCATC TCGAACTGGG CGATTATGAG GAAGCCATCA GTTGGTACGA AAAGGCGCGG GACTATCCGT ATCTCAACCA GGGCATCGAG ACGCCAATGG TCTGGCTGAA ACTTGCCAAT GCCTATCTCC GCTGGGGCCA TTTCCTGTTT GAGGGGGGCC AGAAGGCCGA AGCGAGAACG CGGTATGAGC AGATCGTGCG CCTGACCGAC CCGATCCTCG ATCCGGCTTC ACCCCTGTAT AAATCGCCCG TTTTCGACGG CCTTGCGGCA CAGGTGCAGG CGATTCTGGT CGCGCCGCAG CCGCTTGATC CCGAGGTGCA CAACCCTTCG ATTGCGTCAG TCGTACTGCT GGCCAAGCTC AACATACAGA ATATCCAGGA CGGCATCGAC TTCCCGTTAC TCAGCCTGGC GCGCGAGCAG GTCCCGGTAT TCCGCTTCGA TTACCTGCAG AATGCCGCGC GTTATTTCGC GGAACATGCG ATTCAGGCGG AACGGACTTA TATCAATTTC AAGACCAGCG CCGAGCAGGA GGAATTCCAG CGCACCATGT TGCAGAATGC GGTCGATCTG GAGGCCGCCA ACGAACAACT CGAGATCAAG AAAGTCGAGA TTGCGCAGGA GCAGAAGGAG GCCATCGAGG CGAACCAGGC CTATGCGAAT ACCCAGCTCG AAAACGCCCG GGATCTGAAA GAGGAATATG CGGATGTAAG CCTGGAGGAG ATGGCACTCG ATGCGGAGAT CACTTACGTA GGCGCGCCTA CCACCGAATA TGACTTCTCG GGCTATGAGG AATATGGGAT CTCGAACGGG ACCCATCGCG TCGATGAGGT GCTGCGGACC TTGACACGCC GGCGCCGCGA GATATCGCGA GATTTCGAAC TCAACAACAT GGACCGGCGC ATCAGCGAGC TGGAAGCGGC AAAGAGGGTC GCTGATGAGC AGGTGGACAT CGCTACCAAA CAGAAAGAAG CTGCCGATAT CCAGAAGAAC ATCGCCACCC TGCGCAAGCA ACAGGCCGAG CAGCAGCTGG CGCTGTTCGA TTCGCAGGAA TTCACGCCTG ATCTGTGGAA TCGGCTGGCA AACGAAATTC GCCAGATTTC CCAGTCATAC CTCAGCCAGT CGATCGTCAT AGCCCGGCTG ATGGAGCAGG CATACGAGTA CGAGGTCGGC AAGGCCGTGA ATATCATCAA GCCATCGTAT AACCGGAATG ATCTGGCAGA TCTTCTGGCT GGCGATTTCC TGCTGCGTGA CATCGACTCA TTCACCTTCA TGCGGATCAT CCTGGGCGAG AAAAAGCAAC CCATGAAGGA AATCATTTCA CTGTCAGACC GCTACCCGGT CCAGTTCCTG CGCGATTTCC AGCGCACGGG TACCATGGGT TTCCGCACGG AGTTGAGTGA TTTCGACCGG AATTATCCTG GCGCTTACCT GCAGCGCATC AAGCGTGTCG AGGTCATCGT CGAAGGACTG ATCGGCCGGG GAGGAATCCA TGCCAGCCTC ACCAATACCG GACTCTGCCT CTCGCGCATG CGCAATGGCG GCATCAAAAT GCGGTTGCTG CAACCGGAAA CGCTGTTGCT GTCGCAATAC CGCATCGCCG CCGACGCGGT GATCTTCACG CCCGATGGGG AAATGCTGGG AATTTTCGAA CATAGCCCGG TCTCGACCTC ATGGGTATTC GAGCTCAATC CCGCGGCCAA CGACATCGTC CTGAACTACA TCACCGACAT CAAACTTGTG ATCTATTACG AGTCGTTCTT CGATCCCAAC CTCAAACCCA GAGTGCTTGA AGAGCTCGCA GTCACACAGG TCAATGCCGG GCGTCGTACC GTCGCGCTTC GGTACGAGCT TTTCGACGAG TTCTTTGCGT TCCAGGATAC GGGTGAGGTG ACGTTCACGC TTCGATCGAC CATGCTGCCT TTCTATCACC TCGATCCGCG CGTGCGCGAA CTCACGTTCC TGATCGAAAC GGAGGAGGGC GTTTCTGCCG AAAACCTCAC CGTTATCGTG TCCACTGCGG ATGGCACAAC CGCCACCCAG TCCACTACAG CGGATGGCGC ACTTTCGACC GGCGGATCGA GTCCGCTCAA TGCATTCATT GGCAAGCCCT GGTTACAGGA GTGGAAGATA ACCATTCCGG TCGCCCAGAA CCAGGCTCGT TTCGATGCCG GTTTCGAATG GTCGCAGGTA CAGAACATTG TAATGACGAC AGAGTACGAA TTCACCCCTC GCCGCATTCC GGGGCAACCA TACCTTCTCC TGCTCGACCG CTTTGATGCC GATACGCTCG CAAATTTCGA TGTAGTCGAT GATCCGCAGG CGACGGTCTC CGCACCCTCG CAATGGGTTT TCAACGCGGC AGCACAGCGG ATCGACCAGA TGTCGAGTAT ACATGGGGAT GCATTCGAGC CTGGTGCCAC GGGCCCGGAG AAGCCGGGCA CTTACCTGGT GCGCAAAACC ACGACGGAAC TCCCGGCGAT TCAGGATCAT ATCGTCGCAG TGGATGTCAG CTCCGAGGAT AACGGCGGTA TTGGCGTCGT GTTCCGCTGG CAGGACGTGG ACAATTTCTA TTACTTCCTC ATGGATGGGC AGCGCAACTA CCGCCGCATG GGCAAGAAGG TGGGCGGCGT ATTCCAGGAA CTGGATACGA AGGCCCTTGA TGATACGCAT GGGTATGAAA CGGGTACAAC CCATCGGCTG AGAATCCGCC TCGGTGGATC GGAGATGAGG GCCTATCTCA ATGATGAGCA AATCCTTCTC GGGCAGGATG CCTCGCTGCC GAATTCAGGC CGCGCCGGTC TTTTCTGCTG GGGCAGCGCA GGTGCGCATT TCGACAATTT CCGGATTGTC GCGCTCTGA
|
Protein sequence | MATKYLSNYY SYKFVPQVLK FGVIEVDPKA ALEKQVQEHL ASGHRYFFQD EFQNALTEYQ TAYALLHKFL HPQFPVDVTA IATVVLKPLQ LTEAMIAATA QVAKYRSTIT AGPIVAAGSP PREISSVVQK FSGTGSAVVA PPAAVLYEQA AVYLQAGATS EAQPIIQQAL ELNGGRDLEL ESNLLVASGI AGVQRGEFDN ARDNFAKAFE LARRVSAVPG TAPGAAGPGV AGIPGITDVA GEGAAAAVPP ARTAELGAIS NNIGVVSALT GDARSSGAAF QAAGDSVPLS FGRTLTQPLN PGTATAMQRP MGSEGLGFIL NSSSGSNQYM NVSPTVSAAS AAQQFGVFKG DQIVSVDLQG DAVANLTATL YQPRITATAL PELATFEILE INFIAYIPHT YGFTLPLSLG DCHLELGDYE EAISWYEKAR DYPYLNQGIE TPMVWLKLAN AYLRWGHFLF EGGQKAEART RYEQIVRLTD PILDPASPLY KSPVFDGLAA QVQAILVAPQ PLDPEVHNPS IASVVLLAKL NIQNIQDGID FPLLSLAREQ VPVFRFDYLQ NAARYFAEHA IQAERTYINF KTSAEQEEFQ RTMLQNAVDL EAANEQLEIK KVEIAQEQKE AIEANQAYAN TQLENARDLK EEYADVSLEE MALDAEITYV GAPTTEYDFS GYEEYGISNG THRVDEVLRT LTRRRREISR DFELNNMDRR ISELEAAKRV ADEQVDIATK QKEAADIQKN IATLRKQQAE QQLALFDSQE FTPDLWNRLA NEIRQISQSY LSQSIVIARL MEQAYEYEVG KAVNIIKPSY NRNDLADLLA GDFLLRDIDS FTFMRIILGE KKQPMKEIIS LSDRYPVQFL RDFQRTGTMG FRTELSDFDR NYPGAYLQRI KRVEVIVEGL IGRGGIHASL TNTGLCLSRM RNGGIKMRLL QPETLLLSQY RIAADAVIFT PDGEMLGIFE HSPVSTSWVF ELNPAANDIV LNYITDIKLV IYYESFFDPN LKPRVLEELA VTQVNAGRRT VALRYELFDE FFAFQDTGEV TFTLRSTMLP FYHLDPRVRE LTFLIETEEG VSAENLTVIV STADGTTATQ STTADGALST GGSSPLNAFI GKPWLQEWKI TIPVAQNQAR FDAGFEWSQV QNIVMTTEYE FTPRRIPGQP YLLLLDRFDA DTLANFDVVD DPQATVSAPS QWVFNAAAQR IDQMSSIHGD AFEPGATGPE KPGTYLVRKT TTELPAIQDH IVAVDVSSED NGGIGVVFRW QDVDNFYYFL MDGQRNYRRM GKKVGGVFQE LDTKALDDTH GYETGTTHRL RIRLGGSEMR AYLNDEQILL GQDASLPNSG RAGLFCWGSA GAHFDNFRIV AL
|
| |