Gene Nmul_A2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2421 
Symbol 
ID3785513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2758061 
End bp2759752 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content57% 
IMG OID637812510 
ProductDNA repair protein RecN 
Protein accessionYP_413102 
Protein GI82703536 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.955613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAGT TTCTTGGTAT TCGCGATTTT GTCATCGTTG ATAGGATAGA CCTGGAATTT 
GCTCCCGGTT TTACCGTCCT TACCGGGGAA ACCGGCGCCG GCAAATCCAT TCTGATTGAT
GCACTCTCAC TCGTGCTGGG GGAGAGGAGC GATGCGAGTG CGGTTCGTAA CGGCTGCGAG
CGGGCGGAAA TCAGCGCGGG ATTCGAGGTA GCGGACTTGC CCGAGGTGAT TGCCTGGTTG
CATGAAAACG GATTCGAAAA CACGGAAGAA GAGGGTGTCT GCCTGCTGCG CAGGTTGGTG
GACGCAGGTG GGCGCTCGCG CAGTTTCATC AATGGAAACT CCGCTACTTT GCAACAATTG
CGGGCGATCG GGGAAAATCT CGTAGACATT CAGGGGCAGC ACGTTCATCA GTCCATGTTG
CGCAGTGAAG TGCAGCGTGA GTTGCTCGAT TCCTATGCCG GAAGCAAGCC GTTGGCGAGG
CAAGTGGCGG AAGCGTACCG TCGGTGGCAG ACAGCGCGGC AGCAGCGCGA GGCATGGGAG
CAGAATGCAG CTGCTTTCGT GCGTGAGCGT GAAGAACTGG AGTGGCAGGT CAATGAGCTT
TCGACCCTGA ATTTCTCGAA TGATGAATGG GAAAGCCTGC AGGCCGAACA CAGCCGCTTG
TCCAATGCCG CCAGTCTGCT GGCGACGGCG CAGGCAGGGC TCGAAGCGCT GTCCGAGGGT
GAATTTGCAG TCCTCTCGCA GATCAACTCG GTGATTTCCC GGTTGCATCA AATGGTGGAT
TACGACCAGA CCCTGAAAGC GGTTCTCGAT TTGCTTGAAC CGGGTCAGAT CCAGTTGCAG
GAAGCGGTGT ATGAGCTGGG CCGCTACCAG CAGCGGCTGG AACTGGATCC GCAGCGCTTG
TCGGAGGTCG AGGAACGGTT GGCGGAGATT CATACCGCAG CACGGAAATA TCGGGTGACG
CCGGAAGAAC TGCCTGAACT GTTGAAAGGC TTTACCGAAC GGCTGCAAGC ATTGGGGCAT
AAGGGAGATG GGGAATCCCT GGCAAGGGAA GAAGCCGTTG CGAGCAGTGA ATATAGTGAA
CTTGCAAAAA AATTGACTAT CGAGAGGTCA GGCGCTGCCA GTACCTTGAG TCAACAGGTT
ACGGCCGCAA TGCAAACGCT GGCGATGGCC GGGGGTGAAT TTTCCGCGGC ATTGCTGCCT
CTCGAGCAGG GAACGTTACA CGGCCTGGAA CAGATCGAGT TTCAGGTTGC GGCCCACAAG
ACCCTGCCAT TGCGTCCCCT CGCAAAGGTC GCATCAGGGG GGGAATTATC GCGTATCGGT
CTGGCGATCA GCGTGATTAC TGCCAAGCTC GGTACGGTTC CCACCTTGAT CTTTGACGAG
GTGGATGTAG GGATCGGCGG GCGGGTGGCA GAAATCGTCG GCACCCTGCT GAAAAGGCTG
GGCCGGGAGC GGCAGGTGTT GTGCATCACC CACTTGGCGC AAGTGGCATC CGCCGGAGAT
CAGCAATGGC GGGTGAGCAA ATCAGCTGAT TCTGCAAGTG GAGGAAAAGT GACAAGCCGT
ATCGTTGCGC TCGACAGACA GGAGCGGGTC GAGGAAATTG CGCGCATGCT CGGGGGGACG
AAAATTACAG ATACGACTCG TAGCCATGCT GCGGAGATGC TGCTTTCGGC CGGGGAAGCA
GACGGTCACT GA
 
Protein sequence
MLKFLGIRDF VIVDRIDLEF APGFTVLTGE TGAGKSILID ALSLVLGERS DASAVRNGCE 
RAEISAGFEV ADLPEVIAWL HENGFENTEE EGVCLLRRLV DAGGRSRSFI NGNSATLQQL
RAIGENLVDI QGQHVHQSML RSEVQRELLD SYAGSKPLAR QVAEAYRRWQ TARQQREAWE
QNAAAFVRER EELEWQVNEL STLNFSNDEW ESLQAEHSRL SNAASLLATA QAGLEALSEG
EFAVLSQINS VISRLHQMVD YDQTLKAVLD LLEPGQIQLQ EAVYELGRYQ QRLELDPQRL
SEVEERLAEI HTAARKYRVT PEELPELLKG FTERLQALGH KGDGESLARE EAVASSEYSE
LAKKLTIERS GAASTLSQQV TAAMQTLAMA GGEFSAALLP LEQGTLHGLE QIEFQVAAHK
TLPLRPLAKV ASGGELSRIG LAISVITAKL GTVPTLIFDE VDVGIGGRVA EIVGTLLKRL
GRERQVLCIT HLAQVASAGD QQWRVSKSAD SASGGKVTSR IVALDRQERV EEIARMLGGT
KITDTTRSHA AEMLLSAGEA DGH