Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1733 |
Symbol | |
ID | 3786210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1981098 |
End bp | 1982237 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811819 |
Product | hypothetical protein |
Protein accession | YP_412422 |
Protein GI | 82702856 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.580195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGCA TGTATCGCTT GCTTGCATTC TCGGGAGTGA CTGCTGAAGT AGTAGCCGTT CTCCTGGTCT GGTTGAGTGC GGATGACGCG CTTGCAGATT GGCAGGAAGG CGCGCTGCTG TTTATGGCGG CTGTATTTCA CTCTGCTTCA TCGTATTACC TTGCGCGCAT GTTCTGGCAG GCGCTGCCGC GCCGCTACAA GCTTCCTCCC CGTAGAAGTC TGGGATTGCT GTTTGCCTTT CTATGGATTC TGCCGGTTTT CGGGGCGCTG GGCGTGCTGT GGAGCATAAC ACGTGCATTG AAACAGCCCC GGACCCGTTC CGCGAAGAAT GTAAAGATCA TAATCCTGCC TGAGCTGCCT TTTTCACCCC CTGTCATTTT CCCTGTTCCC CCTTACAGCC AGGGAGCCCT GCGCCAGATC GTCCATTTTG CCCAGCGCTC GCTCAAGCGG TTGAAAGCGG TGATGGCAAC ACGGCATATG TCGCCGAGAG AAGCCATGGA GATCTGGTCG AAGGCTACTC GCGACCCGAT CGATGACGTA AGGCTGCTTG CCTATGCGAT GAAGGATGCC CATGAAAAGA GGCTCACTGA CCGCGTCCTG GCTTTAACCG AAGCGCTGCC ACACCTTCCT CCACGAGCAC AGAATGCCTG CCGCAAGACG ATCGCTGCGC TATGCTGGGA ACTGGTATAT CACAAGCTGG TACAGGGTGC TGTCAGACAG CACTGGCTGA AAAACGCCCG CACACAAATG GAGGTCGTAT TGGGCTCGCC ATCGATTACG CGGCGCGACG TTCCCTCTGC ATCTGTATCG GCTTCTGTGC GTGCTGCATC CGAAGCCTCT CTGGCGTCGC CCAAGCATGA GATGGGTGAA GCGTCCTCTC TATCCGGGAG CGTGAACGCC GACAGTTGGT TGTTGTATGG TCGGATTTTA TTGGAATCAG GTGAAGCCGC CCTGGCGAGA AAGGCTTTTG TCAACGCACA AACCCATGGC GCGGATCAGC AACAACTGTT GCCATGGTTT GCCGAAATCG CTTTCCGGCA GCGGAAGTTT ACCGAAGCAA AGGCCTGTTT GTCCGCGCTT GCACGTGTTG GGGAGAAAGG GCGGGAACTG GCTCTGGTAA GAGCATGGTG GAATAAATGA
|
Protein sequence | MRRMYRLLAF SGVTAEVVAV LLVWLSADDA LADWQEGALL FMAAVFHSAS SYYLARMFWQ ALPRRYKLPP RRSLGLLFAF LWILPVFGAL GVLWSITRAL KQPRTRSAKN VKIIILPELP FSPPVIFPVP PYSQGALRQI VHFAQRSLKR LKAVMATRHM SPREAMEIWS KATRDPIDDV RLLAYAMKDA HEKRLTDRVL ALTEALPHLP PRAQNACRKT IAALCWELVY HKLVQGAVRQ HWLKNARTQM EVVLGSPSIT RRDVPSASVS ASVRAASEAS LASPKHEMGE ASSLSGSVNA DSWLLYGRIL LESGEAALAR KAFVNAQTHG ADQQQLLPWF AEIAFRQRKF TEAKACLSAL ARVGEKGREL ALVRAWWNK
|
| |