Gene Noc_0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0063 
Symbol 
ID3705901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp64658 
End bp67546 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content61% 
IMG OID637736588 
ProductType III restriction enzyme, res subunit 
Protein accessionYP_342135 
Protein GI77163610 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTGGC AATACAGCAC CGTTCATAAC AGCGCCTGTA AGGTCATCGA AGAACAGACC 
TTATGGGGGC AGGCCGTGTG CCGTATCTGG TTGCCGAACC AGGACGCGGT GGTGCGCGTG
CCCCGCTCCG CCTTACGGCC GCTGAGTGCC GACCTGCAAC CGGAGATCGA GGCTGGACGC
ATTGCCTATG TGGCCGCCGC AGCCAAGGTG GCCGAGGTGC TCGAAGGCTC CACCAGCGCC
ACTGACGGTC ATGTATTGCT GGCTCCCATG GAGTCCAATG TTATCCCGCT GCCGCACCAG
ATCCACGCCT TGTCCCGGGC TATCTCCGGC GACCGTGTGC GCTACCTGTT GGCCGACGAG
GTGGGCCTCG GCAAGACCAT CGAGGCCGGT CTAGTCATGC GCGAGCTCAA GCTGCGCGGA
CTGGTTCGGC GAATCCTGGT CGTCTCTCCC AAAGGCATCG CTACCCAGTG GGTGGCGGAA
ATGCAGACCC ACTTCAATGA GCAGTTCCAG CTCGTGCTGG GTGATGACAT CAGCACCTTG
CAGCGCCTGG CCCCAGGGGC GGACCACCGG AACTCAGCCT GGTCGATGTT CGATCAGGTC
ATCGTTCCCC TGGATTCGGT CAAGCCCATG GACAAGCGGC GGGGTTGGAC CGCCGGGCGC
GTTGCCGAAT ACAATCGCAG CCGGTTCGAG GATCTGATCA CTGCTGGTTG GGATCTGGTA
GTGGTGGACG AAGCGCACCG CCTGGGTGGC AGTACCGATC AGGTCGCCCG CTATAAGCTC
GGCAAGGGCC TGGCGGAGGC CGCGCCCTAT GTACTGCTCC TTTCGGCTAC GCCCCACCAG
GGGAAGACCG ATGCTTTCCA TCGCTTGATG AACCTGCTGG ATGAGGACGC CTTCCCGGAT
ATGGACAGCG TTTCCCGCGA CCGGGTGGCT CCTTACGTCA TCCGCACGGA AAAACGGAAG
GCCATCGATG CCGATGGCAA GCCCCTCTTC AAAGCCCGGC GCACGCAGAT GGCCCCGGTA
GTCTGGGAGA GCCGTCATCA CCTGCAGCAG CTCCTCTATG AGGCGGTGAC CGACTATGTG
CGCGAGGGCT ACAACCAGGC TCTGCGCGAG AAGAAGCGCC ACATCGGCTT TCTGATGATC
CTGATGCAGC GCCTGGTGGT CTCTAGCACC CGAGCAATCC GCACCACTTT GGAGCGTCGG
CTCGCGGCAC TTAAGGAAGG CGAGCAGCAA GCCAGCCTGC GCCTAGCGGA GCTGGAAAAC
AGTGCGGGGG GATCGGAAAA CACAGACGAT GAAATAACCG AGCTCTACGA CATGGACGGC
CAGGAGTTGC TCGATGAACT GCTGAAATCC CATGTGTTGG CTCTACAGAG CGAAGGCAGT
CATGTGGAGA CTTTGCTAGA TGCGGCGGTT CGCTGCGAGC AGGCGGGGCC GGACGCCAAG
GCCGAGGCGT TGATCGAGTG GATCTACGAG TTGCAAGCCG AGGAAAACGA GCCGGATCTG
AAAGTGTTGA TCTTCACTGA GTTCGTACCG ACCCAGGAGA TGCTGAAGGA GTTTCTGGAA
GCCCGGGGAA TCTCGGTGGT CACCCTGAAC GGCTCCATGG ATATGGAGGT ACGTGGGGCA
GCCCAGGATA CCTTCCGTAA ATCGCACCGC GTGCTGCTTT CCACCGATGC GGGCGGTGAG
GGTCTAAACT TGCAGTTCGC CCATGTCATC ATCAACTACG ACATCCCCTG GAACCCGATG
CGGTTGGAGC AGCGAATCGG CCGCGTGGAC CGTATCGGCC AGCCCAAGAT GGTGCGAGCG
ATCAACTTCG TGTTTGAAGA TTCGGTCGAG TTTCGCGTTC GCGAAGTGTT GGAACAGAAG
CTCTCGGTGA TCTTTGACGA GTTCGGCATC GATAAGACTG GTGACGTGCT TGACTCAGCT
CAGGCCGGCG AGTTGTTCGA GGATGTGTTC GCGCAGGCGT TCGCCAACCC TGATGGTATT
GAAACTTCCG TCGATCAGAC GGTGACTCGG ATTCGCGATG AGATTCAGCA GGTGCGGGAG
TCCTCCGCCA TCTATGGCAT TTCCGAAGAA CTGAATGTGC AGGCGGCTGA GCAGCTGCGC
TCCCATCCGC TGCCCCACTG GGTGGAGCGG ATGACGGTGG GCTATCTCAA TTCCCACGGC
GGCACAGCCA GCCGTAAACG CTCCTGGTGG GATCTAAATT GGCCGGACGG TCAGGAGCAT
CGCAAGGCCG TGTTCAATGC CCGGGAAGCG GACCGGCTGA CCGATGCAAC CCTGCTCAAT
CTTGAAAACA GCCGTGTCCG TGGGCTGGCC TTGAACCTGC CGCAGATCGC GGCGGGCCAG
CCATTGCCTT GCGTAAGCGT GAGCGGTCTG CCAACCAGCA TCTCCGGTCT CTGGGGACTC
TTTGAGATCC GCCTTCAGGC CGGAATGCAC CAGAAGACAC AACTCCTGCG CATCCCCATG
GTGCGGCGCG GTTATGTCAG CGTGTTCCTG AGCGAGGAAG GCAAACTGTT TCTGCCCACG
GCCCGGCATA TCTGGGATGC GCTGCAGACA GCGGAAGCCC AGGTGCAAGC CACCCTCGGG
CGAGATGAAT CCATCACCGC CCATGAGCGT TTGCGGATTG CTGCCGAACA GGCCGGACAG
GAGCTGTTTG ACGCCTTGCA GCAGGTACAT CTTGCCGCTG TGGCTTACGA GGAGGAACGC
GGAATTGTCT CCTTTGCCTC GCGCCGCAAG GCCATCGAAC GGGTTGGATT GCCGGAGGTG
CGGCAATTCA GGCTGGCCCG TTGCGACGCA GAAGAATCCG AATGGCGACA TGAACTGCAA
TCGGCGCGGC AGATCGTGCC GGAAATCCGG TCGCTGCTGA TGCTGCGGAT TATCAAAAGA
GGCGCTTAA
 
Protein sequence
MPWQYSTVHN SACKVIEEQT LWGQAVCRIW LPNQDAVVRV PRSALRPLSA DLQPEIEAGR 
IAYVAAAAKV AEVLEGSTSA TDGHVLLAPM ESNVIPLPHQ IHALSRAISG DRVRYLLADE
VGLGKTIEAG LVMRELKLRG LVRRILVVSP KGIATQWVAE MQTHFNEQFQ LVLGDDISTL
QRLAPGADHR NSAWSMFDQV IVPLDSVKPM DKRRGWTAGR VAEYNRSRFE DLITAGWDLV
VVDEAHRLGG STDQVARYKL GKGLAEAAPY VLLLSATPHQ GKTDAFHRLM NLLDEDAFPD
MDSVSRDRVA PYVIRTEKRK AIDADGKPLF KARRTQMAPV VWESRHHLQQ LLYEAVTDYV
REGYNQALRE KKRHIGFLMI LMQRLVVSST RAIRTTLERR LAALKEGEQQ ASLRLAELEN
SAGGSENTDD EITELYDMDG QELLDELLKS HVLALQSEGS HVETLLDAAV RCEQAGPDAK
AEALIEWIYE LQAEENEPDL KVLIFTEFVP TQEMLKEFLE ARGISVVTLN GSMDMEVRGA
AQDTFRKSHR VLLSTDAGGE GLNLQFAHVI INYDIPWNPM RLEQRIGRVD RIGQPKMVRA
INFVFEDSVE FRVREVLEQK LSVIFDEFGI DKTGDVLDSA QAGELFEDVF AQAFANPDGI
ETSVDQTVTR IRDEIQQVRE SSAIYGISEE LNVQAAEQLR SHPLPHWVER MTVGYLNSHG
GTASRKRSWW DLNWPDGQEH RKAVFNAREA DRLTDATLLN LENSRVRGLA LNLPQIAAGQ
PLPCVSVSGL PTSISGLWGL FEIRLQAGMH QKTQLLRIPM VRRGYVSVFL SEEGKLFLPT
ARHIWDALQT AEAQVQATLG RDESITAHER LRIAAEQAGQ ELFDALQQVH LAAVAYEEER
GIVSFASRRK AIERVGLPEV RQFRLARCDA EESEWRHELQ SARQIVPEIR SLLMLRIIKR
GA