Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2229 |
Symbol | |
ID | 3784930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2530367 |
End bp | 2533201 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812317 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_412913 |
Protein GI | 82703347 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.748559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTCA TCAAGATCCG CGGTGCGCGC ACGCACAACC TCAAAAACAT CAACCTCGAC TTGCCGCGCA ACAAGCTTGT CGTCATTACG GGCCTGTCCG GCTCGGGCAA GTCTTCGCTG GCGTTCGACA CGCTCTATGC GGAAGGCCAA CGGCGTTATG TGGAATCGCT CTCAGCGTAC GCCCGCCAGT TCCTGCAATT GATGGAAAAA CCGGATGTCG ATCTGATTGA AGGATTATCG CCCGCCATAG CCATAGAGCA GAAGGCGACT TCCCATAACC CGCGTTCGAC TGTCGGCACT GTCACTGAAA TACATGATTA CCTGCGCCTG CTTTTCGCAC GGGTAGGAGA TCCGCAGTGC CCCGATCATG GCATCACGCT GACTGTGCAG AGCGTTTCCC AGATGGTCGA TCATGTGCTG CGACTGCCGC CGGATACCCG GCTGATGATA CTTGCTCCAC TGGTAGTGGG GCGCAAGGGC GAGCAGGTGG AGTTGATCGA TGAACTGCGC GCGCAGGGCT TCATCCGGCT GCGGATCGAC GGGAAAGTAT ACGAAATCGA CGCCCTGCCC AAGCTGCAAA AAAACCAGAA GCATACGATC GAGGTGGTAA TCGACAGGCT CAAGGTTTCA CCGGATTCGA AGCAGCGGTT GGCGGAATCA TTCGAAACCG CCCTGCGCCA CGCCGAGGGG CGCGCACTGG CGGTAGAGAT GGATTCCGGG GCAGAGCATC TTTTTTCAGC CAGGTTCAGT TGCCCGATAT GCAGCTATTC GCTGCCCGAA CTCGAACCCC GTCTGTTTTC GTTCAACAAC CCCATGGGCG CCTGCCCCAA ATGCGATGGC CTGGGCAGAA TCACCTTCTT CGATCCCAAA CGGATCGTCG CGTTTCCGCA TCTGTCACTG GCGGCCGGCG CCATCAAGGG ATGGGACAGG CGCAACCAGT TTTATCATCA ATTACTGGCG AGCCTGGCCA GTCATTACGA TTTTGATCTG GAAATACCTT TCGAGCAATT GAACAGCCAT ATTCAGGATC TCATCCTGAA TGGATCCGGC AATGAAAAAA TCACGTTTTC GTATCTGAAC GAAAACGGAT CGAGAAATTA TCGCAAGCAT ACGTTCGAGG GAATCCTTCC CAACCTGGAA CGGCGCTACA AGGAAACCGA TTCAGTTACC GTACGCGAGG AACTGGCGAA ATACCTGAAT TCCCAGCTCT GCCCTGAATG TGCCGGAACC CGCTTGCGGC GGGAAGCTCG CCATGTGCAT GTAGGCGGCA TGGCAATATA TGAAATCAAC GCTTTGCCGC TGAAAGAAGC AAAGGTTTTT TTCGACCAGG TGACGCTGAC CGGGCATAAG CTTGCCATTG CAGAGAAGAT CATCAAGGAA ATCTCAAGCC GTATATCATT TCTCAACAAT GTCGGACTGG ACTACCTGTC GCTGGATCGC TCCGCCGATA CCTTGTCCGG AGGGGAATCC CAGCGCATCC GGCTTGCCAG CCAGATCGGA TCAGGTCTCA CCGGTGTGAT GTATGTGCTC GATGAGCCTT CCATCGGCCT GCACCAGCGG GACAACAGCC GCCTGCTGAA AACCCTCAAG AATCTGCGCG ATCTCGGCAA CAGCGTGATC GTGGTGGAAC ACGATCAGGA TGCAATACTG ACCGCCGATC ACGTGATAGA CATGGGTCCG GGAGCCGGTG AGCATGGTGG AGCCATTATC GCTCAGGGTA CGCCGGAAGC TATCCAGCGC GACGCCGGTT CCCTTACCGG AAAATACCTT TCGGGCGAAT TGACCATAAG CACGCCCGAA AAACGCACCG AACCGAAAAA TGACCGCTGG CTGCGGATTG AAGGCGCATC CGGAAACAAC CTGAAAAACG TCACCCTGAA CCTGCCCGTG GGATTGTTCG TTTGCGTTAC CGGTGTATCC GGCTCGGGTA AATCCACTCT CATCAACGAA ACGCTCTATC AAGCCGCTGC CCGGCATCTT TATGGCAGCG CAACAGAACC CGCTCCCTAT CAGAATCTGG AAGGACTGGC GTTTTTCGAC AAAGTGATCA GCGTGGACCA AAGCCCCATC GGACGCACGC CACGCTCGAA TCCAGCCACT TATACCGGCT TGTTCACTCC GATCCGGGAG TTATTTGCGG GTGTGCCGCA GGCGCGGGAA CGCGGCTATG GACCGGGGCG GTTTTCGTTC AATGTGAAGG GGGGGCGTTG CGAAGCCTGC CAGGGTGACG GCATGATCAA GGTGGAAATG CACTTTCTCC CCGATATCTA TGTATCGTGC GATGTCTGCC ACGGCAAGCG CTACAACCGC GAAACGCTGG AGATTCAATA CAAGGGAAAA AACATTCATG AAATCCTGCA GATGACGGTA GAGCAGGCGC ATGAATTTTT CAGTCCCGTA CCCGTGGTGG CGCGCAAGCT CCAGACCTTG CTGGATGTCG GCCTGGGTTA CATCAGCCTG GGCCAATCGG CAACGACGCT TTCAGGCGGC GAAGCGCAAC GGGTGAAACT GTCGCTGGAG CTCTCCAAAC GGGATACCGG ACGTACGCTC TACATTCTGG ATGAACCCAC CACCGGGCTT CATTTCCAGG ATATCGACCT CCTGCTCAAA GTGTTGCATC GATTGCGCGA CCATGGCAAT ACAGTGGTCG TGATCGAACA CAACCTGGAT GTGATCAAGA CTGCGGACTG GATTATCGAT CTCGGCCCCG AAGGCGGCGA AGGGGGCGGT GAAATCATCG CACAGGGTCC GCCGGAGGAA ATAGCAGCGA ATGAAAGAAG CTTTACCGGG CATCACCTCA GGGGCATGCT GGAGAATCTT CACGCGACCG CCTGA
|
Protein sequence | MELIKIRGAR THNLKNINLD LPRNKLVVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLQLMEK PDVDLIEGLS PAIAIEQKAT SHNPRSTVGT VTEIHDYLRL LFARVGDPQC PDHGITLTVQ SVSQMVDHVL RLPPDTRLMI LAPLVVGRKG EQVELIDELR AQGFIRLRID GKVYEIDALP KLQKNQKHTI EVVIDRLKVS PDSKQRLAES FETALRHAEG RALAVEMDSG AEHLFSARFS CPICSYSLPE LEPRLFSFNN PMGACPKCDG LGRITFFDPK RIVAFPHLSL AAGAIKGWDR RNQFYHQLLA SLASHYDFDL EIPFEQLNSH IQDLILNGSG NEKITFSYLN ENGSRNYRKH TFEGILPNLE RRYKETDSVT VREELAKYLN SQLCPECAGT RLRREARHVH VGGMAIYEIN ALPLKEAKVF FDQVTLTGHK LAIAEKIIKE ISSRISFLNN VGLDYLSLDR SADTLSGGES QRIRLASQIG SGLTGVMYVL DEPSIGLHQR DNSRLLKTLK NLRDLGNSVI VVEHDQDAIL TADHVIDMGP GAGEHGGAII AQGTPEAIQR DAGSLTGKYL SGELTISTPE KRTEPKNDRW LRIEGASGNN LKNVTLNLPV GLFVCVTGVS GSGKSTLINE TLYQAAARHL YGSATEPAPY QNLEGLAFFD KVISVDQSPI GRTPRSNPAT YTGLFTPIRE LFAGVPQARE RGYGPGRFSF NVKGGRCEAC QGDGMIKVEM HFLPDIYVSC DVCHGKRYNR ETLEIQYKGK NIHEILQMTV EQAHEFFSPV PVVARKLQTL LDVGLGYISL GQSATTLSGG EAQRVKLSLE LSKRDTGRTL YILDEPTTGL HFQDIDLLLK VLHRLRDHGN TVVVIEHNLD VIKTADWIID LGPEGGEGGG EIIAQGPPEE IAANERSFTG HHLRGMLENL HATA
|
| |