Gene Noc_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3021 
Symbol 
ID3705768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3416327 
End bp3419188 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content55% 
IMG OID637739495 
Productexcinuclease ABC subunit A 
Protein accessionYP_344993 
Protein GI77166468 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGGA TTTGCATCCG TGGGGCGCGG ACCCATAATT TGAAGAACAT TGATCTGGAT 
CTGCCCAGGG AACGGTTAAT CGTTATTACC GGGCTGTCGG GTTCAGGAAA GTCATCTCTG
GCTTTTGATA CGCTTTATGC TGAAGGGCAG CGCCGTTATG TGGAGTCTCT CTCGGCCTAT
GCTCGCCAGT TTCTCTCCCT TATGGAAAAG CCCGATGTAG ATCATATGGA AGGGCTTTCC
CCGGCCATTG CCATCGAGCA AAAATCCACT TCCCATAATC CCCGCTCCAC CGTCGGCACC
ATCACCGAAA TCTACGATTA TTTGCGCTTG CTCTATGCTC GCGCCGGCGA GCCCCGTTGC
CCGGAGCATG GGATTATCCT GGCGGCCCAG ACGGTGAGTC AAATGGTGGA TCAGGTTTTG
GGCCTTCCGG AAGGAGGGCG ATACATGTTG CTGGCGCCGA TAGTGGAAGG GCGCAAGGGA
GAGCATCTGC AAGTGCTGGA GAATCTGCTG AGGCGGGGCT TTATCCGGGC CAGGATTGAT
GGTGAAGTGG TAGAGCTGGA GCAAGCGCCT CAACTGGATG GTAATAAAAA ACATACCATC
GAGGCAGTGG TGGATCGTTT CAAAGTCCGC CCTGATCTCC AATTGCGCCT GGCGGAGTCC
TTCGAGACGG CCATACAGTT ATCCGATGGC TTGGCTCGGG TAGCTTCTTT GGAGGAATCG
GCCCTAGCGG AGCAAGTGTT TTCCGCCCAC TTGGCTTGCC CCGTTTGTCG CTATTCTTTA
AGCGAATTGG AACCCCGGTT ATTTTCCTTT AATAACCCCA AGGGCGCTTG TCCTAGTTGC
GAGGGGCTGG GTGTAAAGCC GTTTTTTGAT TCCTCCCGGG TGGTGGCCCA TCCTGAGTTG
AGTTTGGCTG CGGGCGCGGT TCGGGGTTGG GATCGGCGCA ATGCTTATTA CTACCAGATG
ATTCTTTCCC TGGCCCGGCA CTATGGCTTT GAGGTAGACC TCCCTTTCCA AGATTTGCCC
GAAGCAGTAC GCAGGGTAGT ACTTTACGGC AGTGGCCAGG AGAAAATTAC CTTTCGTTAT
CTCGATAGCC AAGACAAGCA GACTATCCGC CGCCATGCTT TCGAAGGGGT GATTCCCAAT
ATGGAGCGCC ATTACCGGGA GACCGAGACC TCGGCGGTGC GAGAAGAATT GGCTCGCTAT
TTAGCCGTGC AGGTTTGCCC GGCATGCCAG GGTACCCGGC TGCGCCAGGA GGCGCGCCAT
GTATTCGTGG CGGATTATAG TCTGCCTGAA ATCACCGCCT TGCCTGTAGG GCAGGCGCAG
AGCTTGTTTT CCCAGCTATG TTTACCGGGT CGCCGAGGGG CCATTGCAAA CCCAATCCTC
AAGGAGATCC AAGACCGGCT AGGCTTCTTG ATCAATGTCG GGTTGGGTTA TTTAACTTTG
AATCGGCGCG CGGAGACCCT CTCCGGTGGC GAGGCCCAAC GAATCCGACT GGCCAGCCAG
ATTGGAGCCG GTTTGGTGGG AGTGATGTAT ATTCTGGATG AGCCTTCTGT TGGTTTGCAT
CAGCGGGATC ATCAGCGGTT GCTGGAGACC TTGATCCGCC TCCGCGATCG GGGCAACACA
GTCATTGTGG TGGAGCACGA TGAGGAGGCG ATCCGGGCGG CTGATCAGGT GATTGATATG
GGTCCTGGCG CGGGCAGGCA TGGGGGGGAG ATCGTGGCCC AAGGCACGCC GCTAGAGATT
ATGGCTAATC CGGCTTCCTT GACTGGGCAG TATCTCAAGG GACAGAAAGA AATCCCCATG
CCCTGCCAGC GGGTGCCCTT CAATACTTCC CGCCTGCTTT CTTTACGGGG GGCTCATGGC
AATAATCTGG ATCAGGTGGA TTTGGATATT CCCCTAGGAG CGATGACCTG TATCACAGGA
GTTTCCGGCT CGGGTAAATC CACCTTAATC AATGACACCC TGTTGCGGGC GGCGACTCGA
ATTCTTCATC GAGCCTCGGT TGAGTCTGCC CCTTATGAAA GCATTGAGGG CTTGGAACAT
CTGGATAAGG TGATCGCTAT CGATCAAAAT CCCATTGGCC GCACGCCCCG TTCCAACCCG
GCCACTTATA CTGGATTTTT TGCTTCTATT CGCAGCCTAT TCGCGGGCAC ACATGAGGCT
CGCTCTCGGG GCTATGGGCC GGGTCGTTTC AGTTTTAATG TCAAGGGAGG ACGCTGCGAA
TCTTGCCAAG GTGATGGGCT AATCAAGGTA GAGATGCACT TTCTTCCCGA TCTCTATGTG
GCCTGCGATG TTTGCCAGGG CAAACGCTAT AATCGGGAAA CCCTGGAGAT TCGCTATAAG
GGCAAAAGTA TTGATGAAAT ATTAGCAATG ACGGTGGAAG AAGCGCAGGA TTTCTTTGCC
AATGTTCCGG CAGTAGCCCG AAAGCTGCTG ACTTTGCTAG AGGTGGGACT CTCCTATATC
ACGCTAGGGC AAAATGCAGT TACCCTCTCG GGCGGCGAGG CCCAGCGGAT CAAACTGGCG
AAAGAGCTAG CCAGGCGGGA TACGGGGCGG ACTCTGTACA TTCTAGATGA GCCGACCACG
GGGTTGCATT TTCATGATAT TGCCCAGCTT CTCCAAGTAT TATTGCGGTT GCGGGATGGG
GGCAATACCA TTGTGATCAT TGAACACCAT TTGGATGTCA TCAAGACGGC TGACTGGATT
GTCGATTTGG GTCCCGAGGG AGGGGAAGGG GGAGGAAGGA TTATCGCTAC CGGGACGCCT
GAGACGGTGG CTGCCTGCCA AGCTTCCTAT ACTGGACGTT ATCTGGCTCG GATTTTACCC
AAGGCCAAGC GAGGCCAATC CCCGGTAGCA GCAAAGCCAT GA
 
Protein sequence
MNRICIRGAR THNLKNIDLD LPRERLIVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHMEGLS PAIAIEQKST SHNPRSTVGT ITEIYDYLRL LYARAGEPRC
PEHGIILAAQ TVSQMVDQVL GLPEGGRYML LAPIVEGRKG EHLQVLENLL RRGFIRARID
GEVVELEQAP QLDGNKKHTI EAVVDRFKVR PDLQLRLAES FETAIQLSDG LARVASLEES
ALAEQVFSAH LACPVCRYSL SELEPRLFSF NNPKGACPSC EGLGVKPFFD SSRVVAHPEL
SLAAGAVRGW DRRNAYYYQM ILSLARHYGF EVDLPFQDLP EAVRRVVLYG SGQEKITFRY
LDSQDKQTIR RHAFEGVIPN MERHYRETET SAVREELARY LAVQVCPACQ GTRLRQEARH
VFVADYSLPE ITALPVGQAQ SLFSQLCLPG RRGAIANPIL KEIQDRLGFL INVGLGYLTL
NRRAETLSGG EAQRIRLASQ IGAGLVGVMY ILDEPSVGLH QRDHQRLLET LIRLRDRGNT
VIVVEHDEEA IRAADQVIDM GPGAGRHGGE IVAQGTPLEI MANPASLTGQ YLKGQKEIPM
PCQRVPFNTS RLLSLRGAHG NNLDQVDLDI PLGAMTCITG VSGSGKSTLI NDTLLRAATR
ILHRASVESA PYESIEGLEH LDKVIAIDQN PIGRTPRSNP ATYTGFFASI RSLFAGTHEA
RSRGYGPGRF SFNVKGGRCE SCQGDGLIKV EMHFLPDLYV ACDVCQGKRY NRETLEIRYK
GKSIDEILAM TVEEAQDFFA NVPAVARKLL TLLEVGLSYI TLGQNAVTLS GGEAQRIKLA
KELARRDTGR TLYILDEPTT GLHFHDIAQL LQVLLRLRDG GNTIVIIEHH LDVIKTADWI
VDLGPEGGEG GGRIIATGTP ETVAACQASY TGRYLARILP KAKRGQSPVA AKP