Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1702 |
Symbol | |
ID | 3784801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1941849 |
End bp | 1943831 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811789 |
Product | Phage integrase |
Protein accession | YP_412392 |
Protein GI | 82702826 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA GAGTCAACTT GCATCTCGGG GGCAATTCAC ACCAAATGGG ACGGATTGCC GATAAGAATG AATTCCCTTG GTATCAATCT CGCAGGTTCA ACATTTTTGG ACTTGTATTT CTGACAAGCG CCGCTATCAG TCTCGCTTAC GTTTACAGCC AGCCGGCAAT TTATCGCAGC AGCGCGACAT TGCTTACTTC GGCCATGACG CCCATCGACC AGCAGAGCGA TGATGCCGAT ATTCAACACG TGGCAATCCA GCGCCAAATC CTGCTGGGGC GGGAGCTTGC CGCTGAAACC CTGAGGAAAT TGAAGGCTTC TCCCGCCGGT AAATCCTTAA ACCGGTTGAC TGAATCCGAT ATTCAGACTT TGTTGACCGT CGAGCCTGTT CCAGAAACCA ATCTCGTCGA GGTTGCGGCA GAAGGCTCGG ATCCCCGCTT TCTGCCTTTG CTCATCAATA CATGGATAGA CGTATACCTG GAAGCGCGAA GTGAAGAAGT AAAACGTTTG AAGAGCAATA CCGAGCGTAT TGTCGAGGGT GAACTGGAGG AGTTGGCTGA CAAGGTTGCT GCCGCGAGGA CGGCGCTGGA GAACTTCCGT GAAAACAATA ACATTCTATC GACAGAGCGC GAGGAGAATG AACCATCCGC CCGGCTGAAG GGGCTGAACG AGTCGCTTAA CAAGGCTTCC GAGGCGGAGG TCAAGGCGAA AGCGGAACTG GATGCCATCA GAGCTGCAAT TGCCCGGGGA AAGGCAGTGG TTCCCGACGA TGAGAAAATC AGCTTGGCCG ACCTCGAGAA GCGTCTGCAT GATTCACGTG AAAAACTGGC AGAATTCGAT AAAAAATTTA CCCGCGAGTA CTTAAACCTG CAGCCCGACC TGAAGTCTCT TCCTGAGCAA ATCAAGGAAC TCGAGACTGC TATCAACAAT AAGCGGTTGC GGGGTCAAAA TGTAGCTTTG AATGAAGCCG AAGTGAATTA CGCGGCCGCA CGACAGACCG TCAGGGATAT TCGCGAGCAA TTGGGCGAGC ACAGGCAGCG GGCATCGGCC TTCGCGACAA AATTTACCCA GCACGAGGCG TTGAAGACCG ATCTGGAAGG ACTGGAAAAG CTGTATCGGG ATACGCAGGA ACGGCTGGTA CAGGTGAAAA CCGGCCGCAA GGACAAATAT CCACAGGTCG ATGTGATTAG CAGGGCATAT GAATCAAGGG AGCCAGTAAG ACCGAACTAC AGTCGTGAAG CCCTGATAGC GCTCGGCGGC TCCCTGCTTC TGGGATTGAT CAGCGTGTGG GTTTTTGAGT ATCTGACCCA AAAGAAGGAA CAATCGCCTT CCATCGCAGT ATTCGGTGTG GGGAGATATA CCCCGCCTGC TACCGAGCTT ATTGATCATC CTCACGCCGC GCTCGGGACC AAAGCCGAAC CGTTAGAGCA GAAAACCAAC TTTGCCTTGC CAAAACCGGT GCACCGGGAA CTTTCGAGCC ATCAGTTACG GACCCTGATC AATGCCGCCA ACCTGAAGGG GAAGCAGCTT ATCGGTTTGC TGCTCAGCGG CCTCGACATC AATGAAGCGG CGTCCTTGAA GGCGAATCAG ATCAACGAGC GGGCCAATGT CATCAACCTC GAAGGAAGAA CTCCGAGGGC GGTGCCCCTC AGCAATCCGC TCAAATCCTT GCTCCAGCAT TCCGGTAACC GCCCGGTATG GGATCCGGAT GATTCGCAGA CACGCGTCGA TCTGGCAACC GCCCTGGTTT GCGCCGCAAT CGATTCGGGA TTGCCTGACC CCCAGGAAAT CACCGCCGAG ACGATCCGGC ATAGCTACAT TACTTATCTT GTACGCCAGG GCTTGCGGCT TTCCGAGCTG GAGCAGATCG TCGGTCATCT GGACCCTTCG GTGATTTCGA GTTATGGCAC CTATTCGCCG CCCCAGCAAG GCCGTCCTCT GCACGAGATC GAAGTGCTGC ATCCAGCGTT GCTCATAATC TGA
|
Protein sequence | MKNRVNLHLG GNSHQMGRIA DKNEFPWYQS RRFNIFGLVF LTSAAISLAY VYSQPAIYRS SATLLTSAMT PIDQQSDDAD IQHVAIQRQI LLGRELAAET LRKLKASPAG KSLNRLTESD IQTLLTVEPV PETNLVEVAA EGSDPRFLPL LINTWIDVYL EARSEEVKRL KSNTERIVEG ELEELADKVA AARTALENFR ENNNILSTER EENEPSARLK GLNESLNKAS EAEVKAKAEL DAIRAAIARG KAVVPDDEKI SLADLEKRLH DSREKLAEFD KKFTREYLNL QPDLKSLPEQ IKELETAINN KRLRGQNVAL NEAEVNYAAA RQTVRDIREQ LGEHRQRASA FATKFTQHEA LKTDLEGLEK LYRDTQERLV QVKTGRKDKY PQVDVISRAY ESREPVRPNY SREALIALGG SLLLGLISVW VFEYLTQKKE QSPSIAVFGV GRYTPPATEL IDHPHAALGT KAEPLEQKTN FALPKPVHRE LSSHQLRTLI NAANLKGKQL IGLLLSGLDI NEAASLKANQ INERANVINL EGRTPRAVPL SNPLKSLLQH SGNRPVWDPD DSQTRVDLAT ALVCAAIDSG LPDPQEITAE TIRHSYITYL VRQGLRLSEL EQIVGHLDPS VISSYGTYSP PQQGRPLHEI EVLHPALLII
|
| |