Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2521 |
Symbol | |
ID | 3786646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2885245 |
End bp | 2886834 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637812612 |
Product | hypothetical protein |
Protein accession | YP_413202 |
Protein GI | 82703636 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.630268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAG GAGTCCGCCA GGGCATCGCC CCGGACCCCT CACTGGCACT GGAGCATGAG GGGTTGAAAG CAGGCGCTCT GCTTGCGCTC GGGGTCATTG CGGGAATTCT CCTTGCTTAT CATGAGACAA CCTGGTCCAT GATTTCCACC TGGGCCAGAT CGGATACGTT TGCTCATGGC TTTCTGATCT TTCCGTTCAG CGGCTATCTG ATCTGGGGAG AGCGCAAGCA TCTCGCCACT CTTTTTGCGC GTCCCAATCC CAAAGTTCTG TTCGTGCTTG CGATTCTCGG TTTCAGCTGG CTTCTGGGAA CCTTGGCAAG TGTGCAGGTA TTTGCGCAAT TCATGGTGAT AGCGATGATT TCCGCCGCCG TCTGGGCAAT CCTGGGAAAT CAGATAGCAT GGGCACTGGC CTTTCCATTA GCCTACCTCC TGCTCGCCGT ACCTTTTGGC GATGCATTCA TACCGCCGCT GATAAACTTT ACCGCGGACT TTACCGTGAC AGCACTTCAG CTGACCGGAA TTCCTGTCTA TCGCGAAGGC AATTTTTTTA CCATTCCCAG CGGCAATTGG TCGGTGGTTG AAGCCTGCAG CGGGCTGCGC TACCTGATTG CATCCTTTAC GCTCGGAACC CTTTATGCAT ACCTGACCTA TCGGAGCCTC AGCCGTCGGC TGATATTTAT TGCCCTGTCT CTGATCGTGC CGATCCTTGC CAACGGTATC CGCGCCTATC TGATTGTAAT GACCGGCCAC CTGAGCGATA TGCAGCTCGC GGTGGGCATC GACCACCTGA TTTACGGATG GGTATTCTTT GGCTTTGTAA TGCTGGTGCT ATTCTGGATC GGATCCTTCT GGCGCGAAGA TGAGAACAGG TTTGAGGCCG GCTTTACAGC GCAGAGCTCG ACAGGAAATA ACCGCGCTCC TGGAGGTGAT TCCACCTTGA TAAGGACAAT TTTCACTGCT GTCGCGGCTT TAAGCATAGC CGTCATCTGG CCGATATACG CGGAGTGGCT GGAAAGGAAG ATTTCCTACG GTACGGAAGA AGTGGAAATC CATATCTTAG ATACATCCGG CAAGTGGCAA GGTAGCCCGG AACCCCTTTC CGATTGGAAA CCCATATACA CGGGTTCGAC TGCTCAGTTT CTGAGAAATT ACCGGAATAA CGGACAGTCG GTAGATCTTT ACATCAGTTA TTACCGCGAC CAGAAGCAGG GTGCTGAATT GATCAATTCC GAAAATGTGC TTGTAGCGGG AAAGGGATCA AAATGGCATG AGGCTGGCGA AGATATGCGC AGAATCCTCC TGGGTTCGCA GGAAGAAATC GTCAAACAGA ACCGGTTGCA CTCCTCTTCG GCATCGCTGC TTGTGTGGCG ATGGTATTGG ATAGGAGGTG AGGAGACCGC AAATCCGTAT TGGGCAAAGT TCCTGCTGGC CAGGAATAAG CTCCTGGGAA GAGGCGACGA TGCGGCTGAG CTCATTGTGG CAACACGATA CGAGGACAGC CTCGATGAAG CAGCTTCAGT GCTTCAGGAT TTTATTATGG ATAGGGCGCC TGCAATAACG GGCGCACTTC GAAATGCAGC CAACCGTTAG
|
Protein sequence | MNAGVRQGIA PDPSLALEHE GLKAGALLAL GVIAGILLAY HETTWSMIST WARSDTFAHG FLIFPFSGYL IWGERKHLAT LFARPNPKVL FVLAILGFSW LLGTLASVQV FAQFMVIAMI SAAVWAILGN QIAWALAFPL AYLLLAVPFG DAFIPPLINF TADFTVTALQ LTGIPVYREG NFFTIPSGNW SVVEACSGLR YLIASFTLGT LYAYLTYRSL SRRLIFIALS LIVPILANGI RAYLIVMTGH LSDMQLAVGI DHLIYGWVFF GFVMLVLFWI GSFWREDENR FEAGFTAQSS TGNNRAPGGD STLIRTIFTA VAALSIAVIW PIYAEWLERK ISYGTEEVEI HILDTSGKWQ GSPEPLSDWK PIYTGSTAQF LRNYRNNGQS VDLYISYYRD QKQGAELINS ENVLVAGKGS KWHEAGEDMR RILLGSQEEI VKQNRLHSSS ASLLVWRWYW IGGEETANPY WAKFLLARNK LLGRGDDAAE LIVATRYEDS LDEAASVLQD FIMDRAPAIT GALRNAANR
|
| |