Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1993 |
Symbol | |
ID | 3785017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2289280 |
End bp | 2290806 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812082 |
Product | hypothetical protein |
Protein accession | YP_412680 |
Protein GI | 82703114 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCAC CGCTGCTGAT CGCGCAATCC CACCATCCCC TTGGTTTATT GCCCGGCTTC GCCAACCGGC ACGGCCTGAT CACGGGTGCG ACCGGCACAG GCAAGACCGT TACGCTTCAG GTGCTGGCAG AACGCTTCTC GTCGATCGGC GTGCCCGTAT TCATGGCGGA CATAAAGGGC GATCTGGCAG GTCTCTCCTT TCCTGGCGCC GATTCAGCCA AAATGAAGGA GCGGCTTGCG CAACTCCACC TTCCGGAACC TGAATGGACT GCATATCCGG TTACGTTTTG GGATATTTAC GGAAAAGCCG GACATCCGGT GCGCACCACC ATATCCGACA TGGGCCCGCT GCTGTTGGGC CGGTTATTCA ATCTGAACGA AACGCAGCAG GGTGTGCTGA CACTGGTATT CAAAATTGCG GATGATAACG GACTTCTGCT GCTTGACGTC AAGGATCTGC GCGCGCTGCT GCAGTTTGTC GGCGACAAGG CGAAGGACTT CACGGTGCAG TATGGTAACA TTTCCGCCGC CTCGATCGGT GCCATTCAGC GTTCGCTTCT GCAGATCGAG CAGCAGGGGG GGGATATCCT GTTCGGCGAA CCGATGCTCG ATATTCATGA CCTGATGCAG GTGGACAGCA GAGGGCGGGG GATGATAAAC ATTCTCGCAG CCGACAAACT GCTGAACGCT CCCAAGTTGT ATTCCACCTT TTTGCTGTGG ATACTGTCAG AGCTGTTCGA ACATCTTCCT GAAATAGGCG ATCCCGAGAA ACCGAAATTG ATATTTTTCT TTGATGAAGC CCATCTTTTA TTCAATGATG CGCCCCGTCC TCTTCTGGAA AAAATAGAGC AGGTCGTGAG GTTGATACGA TCCAAGGGAG TGGGCGTGTA TTTTGTAACA CAAAATCCCC TCGATGTTCC TGAAACTGTT CTGGGACAGC TCGGCAATCG CGTGCAGCAT GCCCTGCGCG CGTTTACGCC GCGCGATCAG AAGGCGGTGC GGTCGGCTGC GCAAACCATG CGGGCGAATC CCGGTCTGGA TACCGAAAAG GTGATCAGTG AGCTTTCGGT GGGTGAAGCG CTCGTTTCGT TGCTGGACGA GAAAGGCCGT CCCGCCATGG TTGAGCGGGC GTTTATTTTG CCGCCTCAAT CGAGGATAGG ACCGGTCACC GATGCGGAAC GCGCAGGTAT CATCAAATCC TCGATGGTTT TCGGCCATTA CGAGCAACAG GTGGACCGTG AGTCCGCATA CGAAATACTC GGAAGTCGGG CTCCGATGAC GCAGAATACC GGGAATGAGG AAGCTCCCGT CCGTGCCAGG GTAAAAAAGA CCGAATCCGA AGAGGGTATG ATGGAGGTAC TGGGTGATAT GCTGCTCGGC AAAACCGGCC CGCGCGGTGG ATATAAACCT GGCATTCTCG ATACCGCTGC ACGCAGCGCG GCGCGCTCCA TCGGTTCGCG TGCCGGCCGT GAAATCTTCC GCGGGATTCT GGGAGGTATG TTCGGCGGCA GTAACCGCAG ACGTTGA
|
Protein sequence | MAPPLLIAQS HHPLGLLPGF ANRHGLITGA TGTGKTVTLQ VLAERFSSIG VPVFMADIKG DLAGLSFPGA DSAKMKERLA QLHLPEPEWT AYPVTFWDIY GKAGHPVRTT ISDMGPLLLG RLFNLNETQQ GVLTLVFKIA DDNGLLLLDV KDLRALLQFV GDKAKDFTVQ YGNISAASIG AIQRSLLQIE QQGGDILFGE PMLDIHDLMQ VDSRGRGMIN ILAADKLLNA PKLYSTFLLW ILSELFEHLP EIGDPEKPKL IFFFDEAHLL FNDAPRPLLE KIEQVVRLIR SKGVGVYFVT QNPLDVPETV LGQLGNRVQH ALRAFTPRDQ KAVRSAAQTM RANPGLDTEK VISELSVGEA LVSLLDEKGR PAMVERAFIL PPQSRIGPVT DAERAGIIKS SMVFGHYEQQ VDRESAYEIL GSRAPMTQNT GNEEAPVRAR VKKTESEEGM MEVLGDMLLG KTGPRGGYKP GILDTAARSA ARSIGSRAGR EIFRGILGGM FGGSNRRR
|
| |