Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2202 |
Symbol | |
ID | 3786227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2500755 |
End bp | 2502644 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637812289 |
Product | hypothetical protein |
Protein accession | YP_412886 |
Protein GI | 82703320 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGTAC TCATAACCAA TAACACGCTC GACACCCGCG GCGGCTCCGA ACTATATGTC CGCGACCTGG CCTTGGCGCT GCTGCGGCGC GGCCATAATC CAGTGGCCTA TAGTACCCGA CTTGGAGCGG TTGCCGAGGA GCTGCGCTCG GCGACCATTC CCGTCATTGA CGATCTCAAT CTGCTGACGG TTCCACCCGA CATTATTCAT GGCCAGCATC ATCTCGATGC AATGACGGCC ATGTTATATT TTCCCGATAC GCCCGCGGTC TACTTCTGCC ATGGCTGGCT GCCGTGGGAA GAAATGGCGC CACGCTTCCC CACGATCCGG CATTATGTTG CTGTGGACGA TCTCTGCCAG GAGCGACTGC AATGCCTCCA TGGCATCCCC CCCGAGCGCA TTCGTGTAAT ACGCAATTTT GTCGACCTGC AGCGATTCGG CCTGCACGCG GATTTACCTG CCATACCACG CAAGGCCCTG GTTTTCAGTA ATTACATCGG TGAGGACGGC TGCCTGGGAA TTCTGCGCCA AGCCTGTGCC GCGCGCGGCA TCGAACTCGA TGCCATCGGC CTTTCTGTCG GGCACAGTGA AGCCCGGCCT GAACGGATTC TCGGCTGCTA TGACATTGTC TTTGCGAAAG CACGTTGCGC GCTTGAAGCC CTTGCCAGCG GAACGGCTGT AATAGCCTGC GACGCCGCCG GTCTCGGGGG CATGGTCATG CCGGACAACT ATGAAACCTT TCGCGCGCTC AATTTCGGTA TACGGAGTCT GCGTAATCCC ATTACCCTGG ATACCATCAC GCGGGAACTC GACCGGTACG ATGCGCCCGG GGCACGCGAG GTTACCCGGC GCGTACGATC AGAAGCAGGT ATCGATCCGG CGATCGAACG GATTCTCCAA GTTTACCAGG AGGCAATGGA AGCGCACGTC CGCGAATCCA GGGAGAGGAA ACCGAAACGG GAAAGCCCCT CCGACTCCCG CCTTCAGTCG GCAAGCAGAT ATCTCCGCGA GATTGCCGAT TTCACCAAAA AGCGCCATCA GGTGGAGCAG GAGAAACATC TTGCTGTGGC TGAAGCCGCA GCGCAGCGTA CACGCGCTGA GCAGGCCGAG TTCGAGTTGG GGAAAATCCA CAATTCGCGT CTCTGGCCAC TCGTCATGTT GCTCTATCGG CTCAAGTATA GGTTATGGAA TCGTCCCATT GCCGTTCTTC GGGCACGTCG AGAAAATCGA TTCCGGGAGA ATCGAAATGA CCAAGACGGG AGCAAGGCTG GAGAAAACAA CCATTCGGCC GATCGCGTCA GCGTATTCGA GCAAATCTAT GTCCGCAATG CCTGGCAGAG TCCTGAATCC CGGTCAGGGC CGGGCTCCAC GCTGGAACGA ACCGAGATAC TGCGGTGCGA ACTTCCTCCT CTGCTGGCCC GCCTCGGGGT TCGCACACTG GTGGATGCAC CTTGCGGGGA TTGCAACTGG CGGCAGCATA CTGTAATCGA TCTCGATGCA TACATCGGTG TCGATATCGT TCCTGCGCTG ATAGAGGAAA ACCGGCAGCG CTTTCCCCAT TCCAACTGGA GATTCGAGGT TGCCGACCTG GTAGAAGATG ACTTGCCGCG CGGTGATGCA GTGCTCTGCC GCGATGCCCT GATCCATTTG TCGCTGACGG ATATTCTGCG GGCGCTTTCC AATATCCGCC GCTCCGGGGC AAAGTACCTG CTGGCAACCA GCCATGAAAC GACCAGCGCC AACACAGACA TCGCCACGGG CGGGTGGCGT TCCGTGAACT TGACGCTGGC GCCTTTCAAC CTTCCCCCTC CATTGGAGCG TATCGTCGAA AATCCGCAAA CCGGAAAGAT ACTGGGCATA TGGCTGCTGG CAGAGATACC GCTCTCCTGA
|
Protein sequence | MRVLITNNTL DTRGGSELYV RDLALALLRR GHNPVAYSTR LGAVAEELRS ATIPVIDDLN LLTVPPDIIH GQHHLDAMTA MLYFPDTPAV YFCHGWLPWE EMAPRFPTIR HYVAVDDLCQ ERLQCLHGIP PERIRVIRNF VDLQRFGLHA DLPAIPRKAL VFSNYIGEDG CLGILRQACA ARGIELDAIG LSVGHSEARP ERILGCYDIV FAKARCALEA LASGTAVIAC DAAGLGGMVM PDNYETFRAL NFGIRSLRNP ITLDTITREL DRYDAPGARE VTRRVRSEAG IDPAIERILQ VYQEAMEAHV RESRERKPKR ESPSDSRLQS ASRYLREIAD FTKKRHQVEQ EKHLAVAEAA AQRTRAEQAE FELGKIHNSR LWPLVMLLYR LKYRLWNRPI AVLRARRENR FRENRNDQDG SKAGENNHSA DRVSVFEQIY VRNAWQSPES RSGPGSTLER TEILRCELPP LLARLGVRTL VDAPCGDCNW RQHTVIDLDA YIGVDIVPAL IEENRQRFPH SNWRFEVADL VEDDLPRGDA VLCRDALIHL SLTDILRALS NIRRSGAKYL LATSHETTSA NTDIATGGWR SVNLTLAPFN LPPPLERIVE NPQTGKILGI WLLAEIPLS
|
| |