Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0203 |
Symbol | |
ID | 3785876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 214241 |
End bp | 215470 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637810274 |
Product | nuclease SbcCD, D subunit |
Protein accession | YP_410903 |
Protein GI | 82701337 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00619] exonuclease SbcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.738096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCT TCCATACCTC CGACTGGCAC ATCGGCCGGA CCCTATATGG CAGAAAGCGG TACGAGGAAT TCGAGGCATT TCTGACGTGG CTGGCAGAAA CGATTCAGAG GAACGAAATC GATGTCCTCC TGGTAGCCGG TGATATCTTT GATACCAGTG CTCCAAGCAA TCGTGCCCAG GAGCTCTACT ACCGCTTTCT CTGGAAGATA GCAGCTTCAC CCTGCCGCCA TGTGGTCATC GTCGCGGGTA ATCACGATTC CCCTTCATTT CTGAATGCAC CCAGGGCGCT GCTCAAAGCC CTTGATGTCC ATGTGATCGG TAGCCCCTCA GCATCCCTTG AAGATGAAGT ACTGGTACTC CGAAATGAGC AAGGGGTGCC CGAGCTAATT GTTTGCGCCG TGCCGTATCT TCGTGACAAA GACATCCGCG TGGCGGAAGC CGGCGAGAGT GTCGAGGATA AGGAACGGAA GCTGCTCGTC GGTATTCGCG ATCACTACGC TGCCATAGCC GCCCTGGCTG AACAGAAGCG TGTGGAACTC GGGGTGGATA TTCCCATCGT TGCCACGGGC CACCTTTTCG CCGCCGGAGG GCAAACTATC GAGGGTGACG GCGTGCGGGA CCTGTATGTC GGTTCGCTGG CTCAGGTGAG TGCGGGGATT TTTCCTGAAT GCTTCAACTA CCTTGCGCTG GGCCACCTCC ATGTCCCGCA GAAGGTAAAC GGTTCCGAGA TCATGCGATA CAGTGGCTCT CCTCTCCCCA TGGGGTTCGG GGAGGCCAGA CAACAGAAGA GCGTTTGCCA GGTTGAATTC CACAGTACCG CCGCGTCTGT TCAATTGATC GATGTGCCGG TGTTTCAGAA GCTCGAACGT ATCAAGGGAG ACTGGGACAG CATCTCCACT CAGATTCGTA AACTGTCAGA GACCGGCTCT CAAGCATGGC TCGAAGTCAT CTACGAAGGT GAAGAAGTAA TCGGCGACCT GCGCGAGCGC CTGGAAATCG CCATTACCGA CACCCAAATG GAAATTCTGC GGGTGAAGAA CAACCCTGTT ATCGATCGCG TCCTTGGAAA AATCCATGAA GAAGAAACAC TGGACGATCT CAGTGTGAAT GACGTATTCG AGCGATGCCT GACCGTACAT GAAGTGCCTG AAGACCAGCG GCCAGAATTA CTTCGTGCTT ATCAGGAAGC GCTTTCGTCT CTCTATGAGG ACGACATGCA AGCGGAATAG
|
Protein sequence | MKLFHTSDWH IGRTLYGRKR YEEFEAFLTW LAETIQRNEI DVLLVAGDIF DTSAPSNRAQ ELYYRFLWKI AASPCRHVVI VAGNHDSPSF LNAPRALLKA LDVHVIGSPS ASLEDEVLVL RNEQGVPELI VCAVPYLRDK DIRVAEAGES VEDKERKLLV GIRDHYAAIA ALAEQKRVEL GVDIPIVATG HLFAAGGQTI EGDGVRDLYV GSLAQVSAGI FPECFNYLAL GHLHVPQKVN GSEIMRYSGS PLPMGFGEAR QQKSVCQVEF HSTAASVQLI DVPVFQKLER IKGDWDSIST QIRKLSETGS QAWLEVIYEG EEVIGDLRER LEIAITDTQM EILRVKNNPV IDRVLGKIHE EETLDDLSVN DVFERCLTVH EVPEDQRPEL LRAYQEALSS LYEDDMQAE
|
| |