Gene Nmul_A0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0203 
Symbol 
ID3785876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp214241 
End bp215470 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content54% 
IMG OID637810274 
Productnuclease SbcCD, D subunit 
Protein accessionYP_410903 
Protein GI82701337 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.738096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCT TCCATACCTC CGACTGGCAC ATCGGCCGGA CCCTATATGG CAGAAAGCGG 
TACGAGGAAT TCGAGGCATT TCTGACGTGG CTGGCAGAAA CGATTCAGAG GAACGAAATC
GATGTCCTCC TGGTAGCCGG TGATATCTTT GATACCAGTG CTCCAAGCAA TCGTGCCCAG
GAGCTCTACT ACCGCTTTCT CTGGAAGATA GCAGCTTCAC CCTGCCGCCA TGTGGTCATC
GTCGCGGGTA ATCACGATTC CCCTTCATTT CTGAATGCAC CCAGGGCGCT GCTCAAAGCC
CTTGATGTCC ATGTGATCGG TAGCCCCTCA GCATCCCTTG AAGATGAAGT ACTGGTACTC
CGAAATGAGC AAGGGGTGCC CGAGCTAATT GTTTGCGCCG TGCCGTATCT TCGTGACAAA
GACATCCGCG TGGCGGAAGC CGGCGAGAGT GTCGAGGATA AGGAACGGAA GCTGCTCGTC
GGTATTCGCG ATCACTACGC TGCCATAGCC GCCCTGGCTG AACAGAAGCG TGTGGAACTC
GGGGTGGATA TTCCCATCGT TGCCACGGGC CACCTTTTCG CCGCCGGAGG GCAAACTATC
GAGGGTGACG GCGTGCGGGA CCTGTATGTC GGTTCGCTGG CTCAGGTGAG TGCGGGGATT
TTTCCTGAAT GCTTCAACTA CCTTGCGCTG GGCCACCTCC ATGTCCCGCA GAAGGTAAAC
GGTTCCGAGA TCATGCGATA CAGTGGCTCT CCTCTCCCCA TGGGGTTCGG GGAGGCCAGA
CAACAGAAGA GCGTTTGCCA GGTTGAATTC CACAGTACCG CCGCGTCTGT TCAATTGATC
GATGTGCCGG TGTTTCAGAA GCTCGAACGT ATCAAGGGAG ACTGGGACAG CATCTCCACT
CAGATTCGTA AACTGTCAGA GACCGGCTCT CAAGCATGGC TCGAAGTCAT CTACGAAGGT
GAAGAAGTAA TCGGCGACCT GCGCGAGCGC CTGGAAATCG CCATTACCGA CACCCAAATG
GAAATTCTGC GGGTGAAGAA CAACCCTGTT ATCGATCGCG TCCTTGGAAA AATCCATGAA
GAAGAAACAC TGGACGATCT CAGTGTGAAT GACGTATTCG AGCGATGCCT GACCGTACAT
GAAGTGCCTG AAGACCAGCG GCCAGAATTA CTTCGTGCTT ATCAGGAAGC GCTTTCGTCT
CTCTATGAGG ACGACATGCA AGCGGAATAG
 
Protein sequence
MKLFHTSDWH IGRTLYGRKR YEEFEAFLTW LAETIQRNEI DVLLVAGDIF DTSAPSNRAQ 
ELYYRFLWKI AASPCRHVVI VAGNHDSPSF LNAPRALLKA LDVHVIGSPS ASLEDEVLVL
RNEQGVPELI VCAVPYLRDK DIRVAEAGES VEDKERKLLV GIRDHYAAIA ALAEQKRVEL
GVDIPIVATG HLFAAGGQTI EGDGVRDLYV GSLAQVSAGI FPECFNYLAL GHLHVPQKVN
GSEIMRYSGS PLPMGFGEAR QQKSVCQVEF HSTAASVQLI DVPVFQKLER IKGDWDSIST
QIRKLSETGS QAWLEVIYEG EEVIGDLRER LEIAITDTQM EILRVKNNPV IDRVLGKIHE
EETLDDLSVN DVFERCLTVH EVPEDQRPEL LRAYQEALSS LYEDDMQAE