Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1736 |
Symbol | |
ID | 3786038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1986197 |
End bp | 1989205 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811822 |
Product | hypothetical protein |
Protein accession | YP_412425 |
Protein GI | 82702859 |
COG category | [S] Function unknown |
COG ID | [COG3868] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.525409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGAAGCA CCTTAATACA AAATGCGTTC CGGTGCGCCC CACTGGCTTC CGGAGCAGAA AGGATATCTC GACAAGTTTT GCAAACTGCC ACACCCAATC CCATTATTCC CGCCACTGAC ATCGGAGACA GGGGGGGGCT GCGTATCTGT AAAGGTCGCA AAGATCATGG CGGCATCCGC GACAACCTGC CTCAACCATC AGGATTCCGC TTCCTGTTCC ATTTTTCCGG TGCGATCCGG AGCGAAGCAT CGTCGCTCTG TTCAGGCATC TGGCAACTGC TTCTGTTACT GATGCTGATG CTGATGGCAT CAGCGGCCCA CTCAGACGGC CTGGAGCTGG AGAAACGCGA TATTGTGTTT TACTATGGCA GCAGGCCCCC TGTCGAAGAT CTGCGCCATT TCGACCAGAT CGTCATCCAG CCAAGCCAGA TTCTGCCGCA TGAGAAAACA GCGCTGTTGA ATCTGGACTC TTCACTCGTC TTCGCCTATG TCTCGTTTGG TGAAATCGCC CGCAACAGCG AAGATATGGC GCGTATCGAT ACGGACTGGT CGATAGGCGT CAACCCTGCC TGGAACAGCC TGGTAATGGA CATGACCGAT CCTGCCTGGC GCGAATACCT GCTGAAGCAG CACTTCGAAC GCTTGTGGCG TGACGGTTAC CGCGCCTTTT TTCTCGATAC AGTGGACAGT TACCTGATCG TGACGCCGGA AGGCAAACAA CGCGAAGAGC AGGAAAAAGG ACTGGTTTCA TTGCTCGCGG AAGTCAAGCA GCGTTTTCCA GGATGCAAGC TCATTCTCAA CCGGGGCTTC GAAGTACTGG ACCGGGCCAG CCAGTATGCC GATGGTATGG TGGCGGAATC TCTTTTTCAT GGTTTTGACC CAGTTAGCGG TAAACACGCC CCAACCAAGG AAGAAAACAG AAAGTGGCTG CTCGATCAGC TGGTGCGGGC GCAAAACGAA TTCAAGGTGC CCATTACGGT GCTCGACTAT GTCGATCCGG GGAACTGGGT AGAGGCGGAA AAAACGGCGC GTGACATTCT CAAGCTGGGA TTCATGCCCT GGGTGGCCAA TGGAGACCTG ACGTGGCTGG GGCAGGGACG AATACGCCTG GCTCCCCGCA AACTGCTGGC TATCGTTAAT GGAACGCCCG CGCAGCAGAT GGAGCATGAT CTGTTTCGGC ATGCTGCCAT GCCGCTCGAA TATCTTGGCT TGGCGCTCGA CTATTGGTAC ATCGATCAGC TTCCCTTGCC GATTGAACCG CTCGTCGGGC GGTATGCCGG GATTATCGCG TGGCTGGGGG AGGACAGTGC GAACGGTACC GAACGGTATG AGAGTGTCTG CGCACGGCTG CAGTCAGAAG CGAATGCGAA CCTGCCCATA GTGTTCATGG GTTATCTGCC GGCAGGTGTT GCGTGCCGGA ATCTGCTGGA TTATCAGGGT GAATTGTACC CGACGACCGG CATATTGAAA CTCGATGCCA TGAATGATCG CCTGGGACGC CCTGAGTCCG CTCCGGTTAT CGGCAGCGGC ACGCCCGATG TACGGGTTCG AGACCGCAAT AACGCATGGC TCACCTTGAG CAGTGCTGAC AAAATATTTC ATCCGATTGC GGTGACGAAC TGGGGCGGGT ACGCGTTACA TCCCCATATC CTGAGTGAGA GCGTCTCCGG GCGGCACGAA TGGCTGCTCG ATCCTTTTGT TTTTTTCCAG GCGGCGTTGC GTCTGCCGGT GCAGCCGGTA TTTGACATGA CAACGGAAAA CGGGCGGCGC CTGGAGATTA TCGAGGTGAG GGGAGATCGC CTGTTTGCAA AGGATGAGCA GGGTGTGGAG GCAATCGACC GGCTCAGGGG CTGGATGGAA AAAAATCCGC TGCCGATAAC ATTGGGTGTC ATTGAAGCGG AAGTAGCCAC CGAAGAGCAG CAGAGCAAGC TGCGCCAGGT GGCCAGTTTG CCCCAAGTTC GTCTGGCAAG CCATACCTAC AGTCACCCTT TTTATTGGGG GGTGTTCGAA GGTAAGACAG ACGCGGATCA ACAATCCTAC CGATACAGCG TATTCATGAA GGATTACGCG GCGGAAATGA CGCGCGAAAC AGGTGGCACT CTTCCATTCC TGCGTACGAT GGCACCGGAT TCCCCTCCGT TATTGATCTG GTCTGGGGAT GGCAAGCCAG GGCCCGCAGT GCTGGCCGCC GCGCAGAAAG CCGGGCTGCC GCATTATGGC GGAGGGGGGC TGCATTGGCA GAGTGGACAG ATATCGCTTG CAGACCTTGA TCCCGCACTT CGTCCGACAG AATGGGGTAT TCAGGTGATG ACGCCGCTGA TTGCCGAACC TCTGTTCGCG CAGCTCTGGT ACGGCGAAGC CCTGAATTTT GGTAAGGTTG GCGAATGGAA CCGGGAGCTC GACCTTGCAC GCCGGTTGCG TGTTTCGTCC ATATCGATTC ATGCCGATGC GTTCCTGAAC GAACGGGGGA GGGAGCTTCT GGAGCAGATG GCGGGGGCGC AGCGGAAGGA AAATGTGCTC GGCATCTGGG TCGATGAATA TGTCACTCGC GTGCGTGCGT TCCAGACGGC AAGCATCGCA CGCGATCTCG ACGGCAATTG GTCTCTTTTC GGGGATACCC TCAGAACAGT GCGGTTGCCC CCGACCGAAA TGACGCCGGA AATCTCCAGC GATGTAGTTG GATATAATGA TCGCAACGAC AACCGCTACA TCCACCTCGC TCGGAATCAC GCAATCCTGA AAGCCGCTCA AGGTAAACGG AACGGAAATC CGGGACTGAG ATTGATCGAG GCGAGTGCCC CCCTTAAATC CTGGCATATC AACAATGATG GTTCCGCCAC GCTTTTATTT GAATCGGGCG GCCGTTTGGC GCCCCTGACC GTCACGGCTC CCGCTTCCTG CGCCTTGAGC ATAAATGACA CAAAGTTGAT ACCGCAGGTG AAGGGTGCAC ATTCCGTTTA TACAGTTCCA GGAAACCTGA CAGCGGGAAA ATTCCGGCTT GAGTGTTGA
|
Protein sequence | MRSTLIQNAF RCAPLASGAE RISRQVLQTA TPNPIIPATD IGDRGGLRIC KGRKDHGGIR DNLPQPSGFR FLFHFSGAIR SEASSLCSGI WQLLLLLMLM LMASAAHSDG LELEKRDIVF YYGSRPPVED LRHFDQIVIQ PSQILPHEKT ALLNLDSSLV FAYVSFGEIA RNSEDMARID TDWSIGVNPA WNSLVMDMTD PAWREYLLKQ HFERLWRDGY RAFFLDTVDS YLIVTPEGKQ REEQEKGLVS LLAEVKQRFP GCKLILNRGF EVLDRASQYA DGMVAESLFH GFDPVSGKHA PTKEENRKWL LDQLVRAQNE FKVPITVLDY VDPGNWVEAE KTARDILKLG FMPWVANGDL TWLGQGRIRL APRKLLAIVN GTPAQQMEHD LFRHAAMPLE YLGLALDYWY IDQLPLPIEP LVGRYAGIIA WLGEDSANGT ERYESVCARL QSEANANLPI VFMGYLPAGV ACRNLLDYQG ELYPTTGILK LDAMNDRLGR PESAPVIGSG TPDVRVRDRN NAWLTLSSAD KIFHPIAVTN WGGYALHPHI LSESVSGRHE WLLDPFVFFQ AALRLPVQPV FDMTTENGRR LEIIEVRGDR LFAKDEQGVE AIDRLRGWME KNPLPITLGV IEAEVATEEQ QSKLRQVASL PQVRLASHTY SHPFYWGVFE GKTDADQQSY RYSVFMKDYA AEMTRETGGT LPFLRTMAPD SPPLLIWSGD GKPGPAVLAA AQKAGLPHYG GGGLHWQSGQ ISLADLDPAL RPTEWGIQVM TPLIAEPLFA QLWYGEALNF GKVGEWNREL DLARRLRVSS ISIHADAFLN ERGRELLEQM AGAQRKENVL GIWVDEYVTR VRAFQTASIA RDLDGNWSLF GDTLRTVRLP PTEMTPEISS DVVGYNDRND NRYIHLARNH AILKAAQGKR NGNPGLRLIE ASAPLKSWHI NNDGSATLLF ESGGRLAPLT VTAPASCALS INDTKLIPQV KGAHSVYTVP GNLTAGKFRL EC
|
| |