Gene Noc_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1644 
Symbol 
ID3705674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1838148 
End bp1839947 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content41% 
IMG OID637738119 
Productexcinuclease ABC subunit C 
Protein accessionYP_343646 
Protein GI77165121 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATT TCGATATTGA TGATTTTCTG AGGAATTTAA CTCCTTGCCC AGGTGTTTAT 
CGTATGCTAG ACGCCAAAGG AAAAGTGCTC TATGTTGGTA AGGCAAAAAA CCTAAAAAGG
CGGATAAAGA GTTATTTTAG AAATTCCAAA TTAGCACCTA AAATTCATGT TCTAGTAAAG
CAAATTTGCG ATATCAAAAT TACTGTGACT CATACGGAAA ATGAAGCACT GATTCTAGAA
AGTAATCTTA TTAAAGCTCT GCAACCTCGC TATAATGTAT TATTAAGGGA TGATAAGAGT
TACCCTTACA TTTTTCTCTC TGCTGACGAT TTTCCCCGTT TAGGATTTCA TCGGGGGGTG
AAACAAGTTT CTGGCCAATA TTTCGGCCCT TATCCAAATA TTAGATCGGT ATGGCAAACG
CTGAAACTAC TTCAGAGGGT TTTTCCCGTA CGCCAGTGTG AAGATAACTT TTACCGTAAT
CGTTCTCGTC CCTGTCTACA ATATCAGATC AAACGCTGCA CTGCTCCTTG CGTGGGCTTA
ATCAGTAAAA AAGATTATAG TCAAGATATT CAACACGTAG TGATGTTTTT AAAAGGGCGG
GATCAACAGG TTATCAATGA GTTGGTAATA CGTATGGAAG AAGCTTCCGG GCAACTTGCT
TTTGAACAAG CAGCCTATTA CCGGGATCGA ATTGCTAGCT TACGCCAAAT ACAGGCACGT
CAGTATATAA GCGGTGAAAA AAAAGACATT GATGTGCTTG GGGTCGCTCT TACGGAGAAA
ATGGCTTGTG TGGAGGTTTT TTTTATTCGT GGTGGGCATA ATTTAGGAAA TAAGACTTTC
TTACCTAAGC TTGAAGGAAA TCTAACACCA GAAGAGTTAC TTTCTACCTT TATAGCACAA
TATTATTTGA ATCGAGAGAC TCCGCCCATC CTTATATTAA GCCACCAGCC TAAAGATATG
GGTTTGTTAA CTGAAGTACT AAGCAAGCAG GCGGGAAGAA AGATTGCTTT GATAAAGCCA
GTTCGTGGTC CCAAGGTGCA GTGGATAAAA ATGGCTTTAG CCAATGCTAA GATTAATTTA
AATCAACATC TCGCAGAGAA ATCGAACATT ACCGCACGAT TTAAAAGCTT ACAGCAATTA
CTAAGCTTAG CAAACTTCCC ACAACGGATT GAGTGCTTTG ATGTTAGTCA TATTCAAGGA
ACGGCTACAG TGGCTTCCTG CGTAGTATTT GATAGAGAGG GTCCCCGTAA GGCTGATTAC
CGTCGTTTTA ATATCACAGG GATCATTCCA GGGGATGATT ATGGAGCCTT ACGTCAAGCG
TTAATGCGGC GTTTTAAAAA AAAAGAGGGA GTATTTCCAG ATTTGCTCGT GATTGATGGT
GGTAAAGGTC AGATAAATCA GTCGCTTAGA GTTCTTAAAG AAATAGGCAT TACAGAAATC
ACCGTGTTGG GTATAGCGAA AGGGCCTGAA CGTAAGGCAG GGAATGAAAC CTTATTTTTG
GCGGGATATG AGAATCCGGT AATGGTAACA TCTGATTCTC CAGCTTTACA TATATTGCAA
CATATCCGTG ACGAAGCTCA CCGTTTTGCT ATTGTTAGCC ATCGTAAACG CCGTGCTAAG
GGGGGTAAGC TTTCCCTTTT AGAGGGAATA TCTGGATTAG GTCCCAAACG TCGTCGGAAG
CTATTGATTC AACTAGGTGG ATTGCAGGAG ATAACACGGG CAGGAGTTGA AGATCTAGCT
CAAATAGAGG GAATTAGCTT GGAATTAGCT CAGCGTATTT ATGACGTTTT TCATAGATGA
 
Protein sequence
MANFDIDDFL RNLTPCPGVY RMLDAKGKVL YVGKAKNLKR RIKSYFRNSK LAPKIHVLVK 
QICDIKITVT HTENEALILE SNLIKALQPR YNVLLRDDKS YPYIFLSADD FPRLGFHRGV
KQVSGQYFGP YPNIRSVWQT LKLLQRVFPV RQCEDNFYRN RSRPCLQYQI KRCTAPCVGL
ISKKDYSQDI QHVVMFLKGR DQQVINELVI RMEEASGQLA FEQAAYYRDR IASLRQIQAR
QYISGEKKDI DVLGVALTEK MACVEVFFIR GGHNLGNKTF LPKLEGNLTP EELLSTFIAQ
YYLNRETPPI LILSHQPKDM GLLTEVLSKQ AGRKIALIKP VRGPKVQWIK MALANAKINL
NQHLAEKSNI TARFKSLQQL LSLANFPQRI ECFDVSHIQG TATVASCVVF DREGPRKADY
RRFNITGIIP GDDYGALRQA LMRRFKKKEG VFPDLLVIDG GKGQINQSLR VLKEIGITEI
TVLGIAKGPE RKAGNETLFL AGYENPVMVT SDSPALHILQ HIRDEAHRFA IVSHRKRRAK
GGKLSLLEGI SGLGPKRRRK LLIQLGGLQE ITRAGVEDLA QIEGISLELA QRIYDVFHR