Gene Nmul_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2229 
Symbol 
ID3784930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2530367 
End bp2533201 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content56% 
IMG OID637812317 
Productexcinuclease ABC, A subunit 
Protein accessionYP_412913 
Protein GI82703347 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.748559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTCA TCAAGATCCG CGGTGCGCGC ACGCACAACC TCAAAAACAT CAACCTCGAC 
TTGCCGCGCA ACAAGCTTGT CGTCATTACG GGCCTGTCCG GCTCGGGCAA GTCTTCGCTG
GCGTTCGACA CGCTCTATGC GGAAGGCCAA CGGCGTTATG TGGAATCGCT CTCAGCGTAC
GCCCGCCAGT TCCTGCAATT GATGGAAAAA CCGGATGTCG ATCTGATTGA AGGATTATCG
CCCGCCATAG CCATAGAGCA GAAGGCGACT TCCCATAACC CGCGTTCGAC TGTCGGCACT
GTCACTGAAA TACATGATTA CCTGCGCCTG CTTTTCGCAC GGGTAGGAGA TCCGCAGTGC
CCCGATCATG GCATCACGCT GACTGTGCAG AGCGTTTCCC AGATGGTCGA TCATGTGCTG
CGACTGCCGC CGGATACCCG GCTGATGATA CTTGCTCCAC TGGTAGTGGG GCGCAAGGGC
GAGCAGGTGG AGTTGATCGA TGAACTGCGC GCGCAGGGCT TCATCCGGCT GCGGATCGAC
GGGAAAGTAT ACGAAATCGA CGCCCTGCCC AAGCTGCAAA AAAACCAGAA GCATACGATC
GAGGTGGTAA TCGACAGGCT CAAGGTTTCA CCGGATTCGA AGCAGCGGTT GGCGGAATCA
TTCGAAACCG CCCTGCGCCA CGCCGAGGGG CGCGCACTGG CGGTAGAGAT GGATTCCGGG
GCAGAGCATC TTTTTTCAGC CAGGTTCAGT TGCCCGATAT GCAGCTATTC GCTGCCCGAA
CTCGAACCCC GTCTGTTTTC GTTCAACAAC CCCATGGGCG CCTGCCCCAA ATGCGATGGC
CTGGGCAGAA TCACCTTCTT CGATCCCAAA CGGATCGTCG CGTTTCCGCA TCTGTCACTG
GCGGCCGGCG CCATCAAGGG ATGGGACAGG CGCAACCAGT TTTATCATCA ATTACTGGCG
AGCCTGGCCA GTCATTACGA TTTTGATCTG GAAATACCTT TCGAGCAATT GAACAGCCAT
ATTCAGGATC TCATCCTGAA TGGATCCGGC AATGAAAAAA TCACGTTTTC GTATCTGAAC
GAAAACGGAT CGAGAAATTA TCGCAAGCAT ACGTTCGAGG GAATCCTTCC CAACCTGGAA
CGGCGCTACA AGGAAACCGA TTCAGTTACC GTACGCGAGG AACTGGCGAA ATACCTGAAT
TCCCAGCTCT GCCCTGAATG TGCCGGAACC CGCTTGCGGC GGGAAGCTCG CCATGTGCAT
GTAGGCGGCA TGGCAATATA TGAAATCAAC GCTTTGCCGC TGAAAGAAGC AAAGGTTTTT
TTCGACCAGG TGACGCTGAC CGGGCATAAG CTTGCCATTG CAGAGAAGAT CATCAAGGAA
ATCTCAAGCC GTATATCATT TCTCAACAAT GTCGGACTGG ACTACCTGTC GCTGGATCGC
TCCGCCGATA CCTTGTCCGG AGGGGAATCC CAGCGCATCC GGCTTGCCAG CCAGATCGGA
TCAGGTCTCA CCGGTGTGAT GTATGTGCTC GATGAGCCTT CCATCGGCCT GCACCAGCGG
GACAACAGCC GCCTGCTGAA AACCCTCAAG AATCTGCGCG ATCTCGGCAA CAGCGTGATC
GTGGTGGAAC ACGATCAGGA TGCAATACTG ACCGCCGATC ACGTGATAGA CATGGGTCCG
GGAGCCGGTG AGCATGGTGG AGCCATTATC GCTCAGGGTA CGCCGGAAGC TATCCAGCGC
GACGCCGGTT CCCTTACCGG AAAATACCTT TCGGGCGAAT TGACCATAAG CACGCCCGAA
AAACGCACCG AACCGAAAAA TGACCGCTGG CTGCGGATTG AAGGCGCATC CGGAAACAAC
CTGAAAAACG TCACCCTGAA CCTGCCCGTG GGATTGTTCG TTTGCGTTAC CGGTGTATCC
GGCTCGGGTA AATCCACTCT CATCAACGAA ACGCTCTATC AAGCCGCTGC CCGGCATCTT
TATGGCAGCG CAACAGAACC CGCTCCCTAT CAGAATCTGG AAGGACTGGC GTTTTTCGAC
AAAGTGATCA GCGTGGACCA AAGCCCCATC GGACGCACGC CACGCTCGAA TCCAGCCACT
TATACCGGCT TGTTCACTCC GATCCGGGAG TTATTTGCGG GTGTGCCGCA GGCGCGGGAA
CGCGGCTATG GACCGGGGCG GTTTTCGTTC AATGTGAAGG GGGGGCGTTG CGAAGCCTGC
CAGGGTGACG GCATGATCAA GGTGGAAATG CACTTTCTCC CCGATATCTA TGTATCGTGC
GATGTCTGCC ACGGCAAGCG CTACAACCGC GAAACGCTGG AGATTCAATA CAAGGGAAAA
AACATTCATG AAATCCTGCA GATGACGGTA GAGCAGGCGC ATGAATTTTT CAGTCCCGTA
CCCGTGGTGG CGCGCAAGCT CCAGACCTTG CTGGATGTCG GCCTGGGTTA CATCAGCCTG
GGCCAATCGG CAACGACGCT TTCAGGCGGC GAAGCGCAAC GGGTGAAACT GTCGCTGGAG
CTCTCCAAAC GGGATACCGG ACGTACGCTC TACATTCTGG ATGAACCCAC CACCGGGCTT
CATTTCCAGG ATATCGACCT CCTGCTCAAA GTGTTGCATC GATTGCGCGA CCATGGCAAT
ACAGTGGTCG TGATCGAACA CAACCTGGAT GTGATCAAGA CTGCGGACTG GATTATCGAT
CTCGGCCCCG AAGGCGGCGA AGGGGGCGGT GAAATCATCG CACAGGGTCC GCCGGAGGAA
ATAGCAGCGA ATGAAAGAAG CTTTACCGGG CATCACCTCA GGGGCATGCT GGAGAATCTT
CACGCGACCG CCTGA
 
Protein sequence
MELIKIRGAR THNLKNINLD LPRNKLVVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLQLMEK PDVDLIEGLS PAIAIEQKAT SHNPRSTVGT VTEIHDYLRL LFARVGDPQC
PDHGITLTVQ SVSQMVDHVL RLPPDTRLMI LAPLVVGRKG EQVELIDELR AQGFIRLRID
GKVYEIDALP KLQKNQKHTI EVVIDRLKVS PDSKQRLAES FETALRHAEG RALAVEMDSG
AEHLFSARFS CPICSYSLPE LEPRLFSFNN PMGACPKCDG LGRITFFDPK RIVAFPHLSL
AAGAIKGWDR RNQFYHQLLA SLASHYDFDL EIPFEQLNSH IQDLILNGSG NEKITFSYLN
ENGSRNYRKH TFEGILPNLE RRYKETDSVT VREELAKYLN SQLCPECAGT RLRREARHVH
VGGMAIYEIN ALPLKEAKVF FDQVTLTGHK LAIAEKIIKE ISSRISFLNN VGLDYLSLDR
SADTLSGGES QRIRLASQIG SGLTGVMYVL DEPSIGLHQR DNSRLLKTLK NLRDLGNSVI
VVEHDQDAIL TADHVIDMGP GAGEHGGAII AQGTPEAIQR DAGSLTGKYL SGELTISTPE
KRTEPKNDRW LRIEGASGNN LKNVTLNLPV GLFVCVTGVS GSGKSTLINE TLYQAAARHL
YGSATEPAPY QNLEGLAFFD KVISVDQSPI GRTPRSNPAT YTGLFTPIRE LFAGVPQARE
RGYGPGRFSF NVKGGRCEAC QGDGMIKVEM HFLPDIYVSC DVCHGKRYNR ETLEIQYKGK
NIHEILQMTV EQAHEFFSPV PVVARKLQTL LDVGLGYISL GQSATTLSGG EAQRVKLSLE
LSKRDTGRTL YILDEPTTGL HFQDIDLLLK VLHRLRDHGN TVVVIEHNLD VIKTADWIID
LGPEGGEGGG EIIAQGPPEE IAANERSFTG HHLRGMLENL HATA