Gene Nmul_A1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1386 
Symbol 
ID3784481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1578235 
End bp1579341 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content55% 
IMG OID637811474 
ProductDNA polymerase III, delta prime subunit 
Protein accessionYP_412081 
Protein GI82702515 
COG category[L] Replication, recombination and repair 
COG ID[COG0470] ATPase involved in DNA replication 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA TTTATGGATG GCAGGAAGAA GTCTGGCGGA AACTTACGGG CGCGTTGGGG 
CACGCCCTGC TGTTACGAGG CAGAAAGGGA CTGGGCAAGC TTGCATTTGC CCGCTATCTG
GCGAAGTCCA GACTCTGTGA AAATCGCTCC GTCGAAGGAA AGGCGTGCGA GGTCTGTGCG
AGCTGCCACT GGTTTGAGCA AGGCAATCAC CCCGATTTCT GCCTTGTCGA GCCCGAAGCG
GCCACGGCAA CGTCTGTCTC GGGGGAAGGA GCCGGGGAGG AGGGTGTGGA AACCGGAGAT
GAGGCAGAAG TTCAATCTCT TCCCCGGGCA GTGAATCAGC TCAGCGGCAG TGGCAAGTCC
ACAAAAAAAC CCAGCAGACA GATAAGTATC TCGCAAATAC GGGAGCTAGG CGACTTCGTC
AATATCACCA GCCATCAGAA CGGCTATAAG ATCATCCTGA TCCATCCGGC GGAAACCATG
AGTACGGCTG CTGCCAGTGC CCTTCTGAAG AATCTGGAAG AACCGCCGCT CCAAACGTTG
TTCATACTGA TAACGCATCA GGCACAGTAT TTATTGCCGA CGATCCGCAG CCGTTGCCGC
CAGATCATCA TGCCCGCCCC CGATGCAGCC TCCGCAGCAC TGTGGTTAAA ACAGCAGGGT
GTCAAAGCCC CCGAAAGATG CCTGGCCTCG GCTGGCTATG CTCCCCTGAC CGCACTGGAA
TTCGCGAATG AAGATTATCT TGTGCGGCAC AGCGCTTTCA TCCAGCAAAT CAGCACTCCA
TCAGGTTTCG ACGTGCTGGC ACGGGCAGAG GAAATGCAGA AATCGGACCT TGTCATGGTG
GTCAGCTGGC TGCAAAAGTG GTGCTACGAT CTGATGAGTT TTCGTATGGC GCAGAAGGTC
CGCTATCATC CGGACATGCT CGCGCAAATA AAACCCCTGG CATCCGGGCT TGATCCGTAT
TCAATGGCAA CTTATTTGCG CGCCCTGGAT AAAACGCAGC AGCTTGCCCG CCATCCGCTC
AATCCAAGAT TATTTCTGGA AGAACTGCTG TTTTCCTATG TGACGATGTT ATCCGAGAAA
TCCAGGAATC GGAGCAAGGC CGGCTGA
 
Protein sequence
MSNIYGWQEE VWRKLTGALG HALLLRGRKG LGKLAFARYL AKSRLCENRS VEGKACEVCA 
SCHWFEQGNH PDFCLVEPEA ATATSVSGEG AGEEGVETGD EAEVQSLPRA VNQLSGSGKS
TKKPSRQISI SQIRELGDFV NITSHQNGYK IILIHPAETM STAAASALLK NLEEPPLQTL
FILITHQAQY LLPTIRSRCR QIIMPAPDAA SAALWLKQQG VKAPERCLAS AGYAPLTALE
FANEDYLVRH SAFIQQISTP SGFDVLARAE EMQKSDLVMV VSWLQKWCYD LMSFRMAQKV
RYHPDMLAQI KPLASGLDPY SMATYLRALD KTQQLARHPL NPRLFLEELL FSYVTMLSEK
SRNRSKAG