Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_3021 |
Symbol | |
ID | 3705768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3416327 |
End bp | 3419188 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637739495 |
Product | excinuclease ABC subunit A |
Protein accession | YP_344993 |
Protein GI | 77166468 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGGA TTTGCATCCG TGGGGCGCGG ACCCATAATT TGAAGAACAT TGATCTGGAT CTGCCCAGGG AACGGTTAAT CGTTATTACC GGGCTGTCGG GTTCAGGAAA GTCATCTCTG GCTTTTGATA CGCTTTATGC TGAAGGGCAG CGCCGTTATG TGGAGTCTCT CTCGGCCTAT GCTCGCCAGT TTCTCTCCCT TATGGAAAAG CCCGATGTAG ATCATATGGA AGGGCTTTCC CCGGCCATTG CCATCGAGCA AAAATCCACT TCCCATAATC CCCGCTCCAC CGTCGGCACC ATCACCGAAA TCTACGATTA TTTGCGCTTG CTCTATGCTC GCGCCGGCGA GCCCCGTTGC CCGGAGCATG GGATTATCCT GGCGGCCCAG ACGGTGAGTC AAATGGTGGA TCAGGTTTTG GGCCTTCCGG AAGGAGGGCG ATACATGTTG CTGGCGCCGA TAGTGGAAGG GCGCAAGGGA GAGCATCTGC AAGTGCTGGA GAATCTGCTG AGGCGGGGCT TTATCCGGGC CAGGATTGAT GGTGAAGTGG TAGAGCTGGA GCAAGCGCCT CAACTGGATG GTAATAAAAA ACATACCATC GAGGCAGTGG TGGATCGTTT CAAAGTCCGC CCTGATCTCC AATTGCGCCT GGCGGAGTCC TTCGAGACGG CCATACAGTT ATCCGATGGC TTGGCTCGGG TAGCTTCTTT GGAGGAATCG GCCCTAGCGG AGCAAGTGTT TTCCGCCCAC TTGGCTTGCC CCGTTTGTCG CTATTCTTTA AGCGAATTGG AACCCCGGTT ATTTTCCTTT AATAACCCCA AGGGCGCTTG TCCTAGTTGC GAGGGGCTGG GTGTAAAGCC GTTTTTTGAT TCCTCCCGGG TGGTGGCCCA TCCTGAGTTG AGTTTGGCTG CGGGCGCGGT TCGGGGTTGG GATCGGCGCA ATGCTTATTA CTACCAGATG ATTCTTTCCC TGGCCCGGCA CTATGGCTTT GAGGTAGACC TCCCTTTCCA AGATTTGCCC GAAGCAGTAC GCAGGGTAGT ACTTTACGGC AGTGGCCAGG AGAAAATTAC CTTTCGTTAT CTCGATAGCC AAGACAAGCA GACTATCCGC CGCCATGCTT TCGAAGGGGT GATTCCCAAT ATGGAGCGCC ATTACCGGGA GACCGAGACC TCGGCGGTGC GAGAAGAATT GGCTCGCTAT TTAGCCGTGC AGGTTTGCCC GGCATGCCAG GGTACCCGGC TGCGCCAGGA GGCGCGCCAT GTATTCGTGG CGGATTATAG TCTGCCTGAA ATCACCGCCT TGCCTGTAGG GCAGGCGCAG AGCTTGTTTT CCCAGCTATG TTTACCGGGT CGCCGAGGGG CCATTGCAAA CCCAATCCTC AAGGAGATCC AAGACCGGCT AGGCTTCTTG ATCAATGTCG GGTTGGGTTA TTTAACTTTG AATCGGCGCG CGGAGACCCT CTCCGGTGGC GAGGCCCAAC GAATCCGACT GGCCAGCCAG ATTGGAGCCG GTTTGGTGGG AGTGATGTAT ATTCTGGATG AGCCTTCTGT TGGTTTGCAT CAGCGGGATC ATCAGCGGTT GCTGGAGACC TTGATCCGCC TCCGCGATCG GGGCAACACA GTCATTGTGG TGGAGCACGA TGAGGAGGCG ATCCGGGCGG CTGATCAGGT GATTGATATG GGTCCTGGCG CGGGCAGGCA TGGGGGGGAG ATCGTGGCCC AAGGCACGCC GCTAGAGATT ATGGCTAATC CGGCTTCCTT GACTGGGCAG TATCTCAAGG GACAGAAAGA AATCCCCATG CCCTGCCAGC GGGTGCCCTT CAATACTTCC CGCCTGCTTT CTTTACGGGG GGCTCATGGC AATAATCTGG ATCAGGTGGA TTTGGATATT CCCCTAGGAG CGATGACCTG TATCACAGGA GTTTCCGGCT CGGGTAAATC CACCTTAATC AATGACACCC TGTTGCGGGC GGCGACTCGA ATTCTTCATC GAGCCTCGGT TGAGTCTGCC CCTTATGAAA GCATTGAGGG CTTGGAACAT CTGGATAAGG TGATCGCTAT CGATCAAAAT CCCATTGGCC GCACGCCCCG TTCCAACCCG GCCACTTATA CTGGATTTTT TGCTTCTATT CGCAGCCTAT TCGCGGGCAC ACATGAGGCT CGCTCTCGGG GCTATGGGCC GGGTCGTTTC AGTTTTAATG TCAAGGGAGG ACGCTGCGAA TCTTGCCAAG GTGATGGGCT AATCAAGGTA GAGATGCACT TTCTTCCCGA TCTCTATGTG GCCTGCGATG TTTGCCAGGG CAAACGCTAT AATCGGGAAA CCCTGGAGAT TCGCTATAAG GGCAAAAGTA TTGATGAAAT ATTAGCAATG ACGGTGGAAG AAGCGCAGGA TTTCTTTGCC AATGTTCCGG CAGTAGCCCG AAAGCTGCTG ACTTTGCTAG AGGTGGGACT CTCCTATATC ACGCTAGGGC AAAATGCAGT TACCCTCTCG GGCGGCGAGG CCCAGCGGAT CAAACTGGCG AAAGAGCTAG CCAGGCGGGA TACGGGGCGG ACTCTGTACA TTCTAGATGA GCCGACCACG GGGTTGCATT TTCATGATAT TGCCCAGCTT CTCCAAGTAT TATTGCGGTT GCGGGATGGG GGCAATACCA TTGTGATCAT TGAACACCAT TTGGATGTCA TCAAGACGGC TGACTGGATT GTCGATTTGG GTCCCGAGGG AGGGGAAGGG GGAGGAAGGA TTATCGCTAC CGGGACGCCT GAGACGGTGG CTGCCTGCCA AGCTTCCTAT ACTGGACGTT ATCTGGCTCG GATTTTACCC AAGGCCAAGC GAGGCCAATC CCCGGTAGCA GCAAAGCCAT GA
|
Protein sequence | MNRICIRGAR THNLKNIDLD LPRERLIVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHMEGLS PAIAIEQKST SHNPRSTVGT ITEIYDYLRL LYARAGEPRC PEHGIILAAQ TVSQMVDQVL GLPEGGRYML LAPIVEGRKG EHLQVLENLL RRGFIRARID GEVVELEQAP QLDGNKKHTI EAVVDRFKVR PDLQLRLAES FETAIQLSDG LARVASLEES ALAEQVFSAH LACPVCRYSL SELEPRLFSF NNPKGACPSC EGLGVKPFFD SSRVVAHPEL SLAAGAVRGW DRRNAYYYQM ILSLARHYGF EVDLPFQDLP EAVRRVVLYG SGQEKITFRY LDSQDKQTIR RHAFEGVIPN MERHYRETET SAVREELARY LAVQVCPACQ GTRLRQEARH VFVADYSLPE ITALPVGQAQ SLFSQLCLPG RRGAIANPIL KEIQDRLGFL INVGLGYLTL NRRAETLSGG EAQRIRLASQ IGAGLVGVMY ILDEPSVGLH QRDHQRLLET LIRLRDRGNT VIVVEHDEEA IRAADQVIDM GPGAGRHGGE IVAQGTPLEI MANPASLTGQ YLKGQKEIPM PCQRVPFNTS RLLSLRGAHG NNLDQVDLDI PLGAMTCITG VSGSGKSTLI NDTLLRAATR ILHRASVESA PYESIEGLEH LDKVIAIDQN PIGRTPRSNP ATYTGFFASI RSLFAGTHEA RSRGYGPGRF SFNVKGGRCE SCQGDGLIKV EMHFLPDLYV ACDVCQGKRY NRETLEIRYK GKSIDEILAM TVEEAQDFFA NVPAVARKLL TLLEVGLSYI TLGQNAVTLS GGEAQRIKLA KELARRDTGR TLYILDEPTT GLHFHDIAQL LQVLLRLRDG GNTIVIIEHH LDVIKTADWI VDLGPEGGEG GGRIIATGTP ETVAACQASY TGRYLARILP KAKRGQSPVA AKP
|
| |