Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0107 |
Symbol | |
ID | 3786374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 113563 |
End bp | 115197 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637810177 |
Product | recombinase |
Protein accession | YP_410808 |
Protein GI | 82701242 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAAAG CCTATTCATA CATTCGAATG TCCACAGATA AACAGCTCAG TGGAGATAGT CTTCGGAGAC AACGTGAAGC CTCAGAAAAA TACGCCAAGG CTCACGGTCT AGAGTTGATC GAGACGTTAG ATGGGAAAGA TTTGAGAGAT ATCGGCGTGT CCGGATTCAA AGGTGCTAAC TCTAGAAGGG GGGTTCTTTC AGTTTTTCTT GAGCATCTTA ACGATGGGAA GATTGACGCT AATAGCATTC TCTTAATCGA AAGCTTCGAT CGGTTATCCC GTGCGGACGT CCTTGATGCA TTCTCGTTGT TTACCAATAT TTTAAACAAG GGTATCGAGA TCATAACGCT GTCTGACAAT CAGCGGTATA CGAGAGATTC AGTAAGGGTA AACCCTGGGC AGCTTTTTCT AACTATTGGA ATAATGACCA GAGCAAATGA GGAATCTGTA ACTAAGTCTA AACGAGGTCT ATCAGTTTGG AACAATAAAA GAGACAACGC GCAGAAGAAG CCGATAACAT CACGTTGTCC AGCTTGGTTA ACTTACTCTT CAGAGAGTGA AAAATTTGAG GTAATCGAAG AGCGTGCAAG GGTAGTTAGA CTCATTTTTG AGCTCTGCGC AAAGACAAGC GGAACATGGT CGATTTCGAG ATACCTTAAT CGTCACAATA TCCCTGTTTT TGGTGATGCT CAATTTTGGC AGAAGTCGTA CGTGAACAAA ATTTTATCGA ACAGAGCGGT GTTGGGGGAA TTTCAGCCTA ATTCACGGGA GAGCGGTCAA CGTATACCAG TAGGAAAAGT GATCGAAAAT TACTATCCGG CCATAATTTC GGCTACTGAG TTCCACTTAG CTTGGGCCGC GCGTGACAGA AGAAATAAGA CTGGCAGCGG CAGAAAGGGG ATAAATTTTT CAAATCTATT CTCCGGTCTG GTTTATTGCG GTAACTGCGG TTCAAAAATG AGCGTTAGAA ATCGGGGTGT ACCGCCAAAA GGTGGTAAAT GGCTTGTTTG CCAAAATCAT TTGGCAGGGG CTGGGTGTGA AATGCGCGAA TGGAAACTCC CTCTAGTAGA AGATGCGATA TTCAAGCATT TATACGAGGT TGATTTCAGC GAATTACTGG GCAACAAATC TGCAGGTTTT GACATCGAGA AAGAGCTATT TGCATTACAG GTAGAGCAAA AAGAGCTTGA GTCAAAAATG GATCAAGCTA TTGAAATGTC AGCGGCTCAA GAGTTGAATG AACAATCTCG ATTCAGGTAT GTGAAGCTGA TCAATCAGCT TGAGAGTGAT ATTAACGAGA AAAAAAAATG TATTTCAAGC AAAGAGAAGG AATTAGAAGT ATTCAAAAGT CAGCAAAGCC TCCTTCATAC AAAAGAATTA AAGGAGACTC TCTTGCTGTT AGAAAAAAAT AAAGATGATT ACTACTTTCG CTCCTCTGTT AATCAGTTGC TGACCAGAAC TATAGCTCGA ATTGATTTGA TTACAGATCA TGGAGGTCCC TTACCTTGGG AAATTGAACC TGATGACAAG ATCGTAGGAA ATTTCCGTGC GTTGTATCCA AAACTCAGCA GTTTATCTAT CGATGAATTA GTTCTTCAGA GGGTTTTCCT AGACTACATA ACACTGATCT GCTAA
|
Protein sequence | MRKAYSYIRM STDKQLSGDS LRRQREASEK YAKAHGLELI ETLDGKDLRD IGVSGFKGAN SRRGVLSVFL EHLNDGKIDA NSILLIESFD RLSRADVLDA FSLFTNILNK GIEIITLSDN QRYTRDSVRV NPGQLFLTIG IMTRANEESV TKSKRGLSVW NNKRDNAQKK PITSRCPAWL TYSSESEKFE VIEERARVVR LIFELCAKTS GTWSISRYLN RHNIPVFGDA QFWQKSYVNK ILSNRAVLGE FQPNSRESGQ RIPVGKVIEN YYPAIISATE FHLAWAARDR RNKTGSGRKG INFSNLFSGL VYCGNCGSKM SVRNRGVPPK GGKWLVCQNH LAGAGCEMRE WKLPLVEDAI FKHLYEVDFS ELLGNKSAGF DIEKELFALQ VEQKELESKM DQAIEMSAAQ ELNEQSRFRY VKLINQLESD INEKKKCISS KEKELEVFKS QQSLLHTKEL KETLLLLEKN KDDYYFRSSV NQLLTRTIAR IDLITDHGGP LPWEIEPDDK IVGNFRALYP KLSSLSIDEL VLQRVFLDYI TLIC
|
| |