Gene Nmul_A2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2002 
Symbol 
ID3784493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2300907 
End bp2302298 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content58% 
IMG OID637812091 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_412689 
Protein GI82703123 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.891146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTCCTT TTCCCTACGT CATCCTGGAC CTGGAAACCA CGGGCGGCAC GCCCCTGCAT 
GACCGCATCA TCGAGATTGC GCTCATTCGT TTCGAGGAAG GAATGGAAAG CGAGCGTTGG
GAAACGCTCG TTAATCCAGG CATATCCATT CCGCCTTTCA TCACGCATCT GACCGGTATC
AGCAACGGGA TGGTAAAGGA TGCGCCTTCC TTCGGGGATA TCGCCCACCG GCTCTACGGT
TTTCTTGATG GAGCGGTGCT GGCAGCGCAT AACGTCCGCT TCGACTACGG ATTCCTGAAG
AACGAGTACC GGCGCATGGG CGCGCTGCTC CAGCACAGGG TATTGTGCAC GGCCCGGCTT
TCGCGCAAGC TCTATCCTCA GCACAAGGGT CATGGACTCG ATGCCATCAT GCAGCGTCAT
GGGCTAAAGA CCGAGATGCG CCACCGGGCG ATGGGGGACG TGGAGCTCGT CGCAGCCTAT
CTTGAGATGG CAAGGCGCGA GCTGGGTGCC CGGGAGGTAC AAGAAGCGGC AGCCATCCTG
CTGAAAGACC CAAGCCTGCC CGCAGGCCTG GATGCTTCGA TTCTGGACCA GATTCCGGAC
AGGCCGGGTG TCTATTTCTT CTACGGCAAA AACGGCCTTC CCCTCTATAT CGGCAAGAGC
GTGACGCTGC GCTCCCGGGT CATGTCCCAT TTCAGCGGCG ACCATGCTTC GTTCAGCGAC
ATGCGCATTG CCCAGGAGGT CGAGCGGGTC GAATGGATGG AAACAGCCGG GGAACTGGGC
GCACTGCTGC TGGAGTCGAG GTTAATCAAG GAGCATCATC CCATCCACAA CAAGCGATTG
CGCCGCTCCC GCACGCTCTT TTCCCTGAAG CTGGGCGACG ACTCGTACGA GGCTCCCCTG
GTGAATATCG TGACGGAAGA GGATATCCAC CCTGAGGTGT TCGGCGATCT GTACGGGCTC
TTCCGCTCGA AAACAAAAGC GGTTGATGCA CTGCGCGAGG TTGTCCGGGA GAACAGGTTG
TGTCCCCGGG TGGTAGGCCT TGAGAATGGG AAGGGCGCGT GTTTCGCACA CCAGTTGAAA
CGCTGTAACG GCGTCTGCGC GGGCAAGGAA GTGCCGCAAC TGCATTACCT GCGCCTGAAA
CAGGCGCTGC TCCCCCTTAA GCTCAAATCA TGGCCCTATC CCGGCAGGAT CGGCATACGG
GAATACAATG CGTCGTCCGG CCGATCGGAA GTGCATGTCT TCCACTACTG GTGCCATCTG
GGAACGGTGG ACAACGAAGC CGGCCTGGAC GATGTGCTGG GGACGCGCTC ATCCATGAAG
TTTGATCTCG ATACCTACAA GCTGCTCCTA AAGACCCTGG GAAAGCAAAC AGAGGTGATT
ACGTTTGGAT AG
 
Protein sequence
MLPFPYVILD LETTGGTPLH DRIIEIALIR FEEGMESERW ETLVNPGISI PPFITHLTGI 
SNGMVKDAPS FGDIAHRLYG FLDGAVLAAH NVRFDYGFLK NEYRRMGALL QHRVLCTARL
SRKLYPQHKG HGLDAIMQRH GLKTEMRHRA MGDVELVAAY LEMARRELGA REVQEAAAIL
LKDPSLPAGL DASILDQIPD RPGVYFFYGK NGLPLYIGKS VTLRSRVMSH FSGDHASFSD
MRIAQEVERV EWMETAGELG ALLLESRLIK EHHPIHNKRL RRSRTLFSLK LGDDSYEAPL
VNIVTEEDIH PEVFGDLYGL FRSKTKAVDA LREVVRENRL CPRVVGLENG KGACFAHQLK
RCNGVCAGKE VPQLHYLRLK QALLPLKLKS WPYPGRIGIR EYNASSGRSE VHVFHYWCHL
GTVDNEAGLD DVLGTRSSMK FDLDTYKLLL KTLGKQTEVI TFG