Gene Noc_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0442 
Symbol 
ID3706613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp478400 
End bp479554 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content55% 
IMG OID637736952 
Producttransposase 
Protein accessionYP_342496 
Protein GI77163971 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000469255 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATAC AATGCGCCTA CAAGTTCAGG TTTTACCCAA CGCCCACGCA AAAGCGGCAA 
TTGGCCCTTG AGTTTGGCCA TGCCCGTTAT GTGTGGAATT GGGCCTTGGA AACCCGAACG
AAGGCGTATC AAGCGCAGGG GGAGTCGTCG AATACTATAA GTCTTAGTCG CCAATTGACG
GCACTGAAAA AGACGCAATG CCCCTGGTTG AGCGAAGCCA CCGCTAGTTG CCATACCCAA
AAGCTCAGGG ACCAAGATAC GGCCTTCAGG AACTTTTTCG CAGGTCGAGC GAAGTATCCC
CGCTTTAAGA AGCGCCACCA CACGCAATCG GTACGCTATC AGTTGGACCA ACGCCATGTG
GCGAAGAACT TCAACGCTGA AAGCAAGCTG TTGAAGCTGC CCAAGCTTGG CAGAGTTAAG
TTGAGATGGT CCCGTGGTAT CGAGGGCATC CCCAAGATGG TCACGGTCAG CCAAGCCCCG
GCGGGCCGTT ACTTCGTTAG CCTGACCTGC GAGGTGGAGA TTCTCCCCTT GCCTGTGCGA
AGGAACGCTA TCGGCGTGGA TGTGGGGGTT AAGGATGTGG TCATTACTTC CGAAGGCTGG
AAGTCGGGTG CGCCCAAATA CACCTATCAC TATGCCCGGC AATTGAAAAT GGCCCAGCGT
CGCCTGAGCA AAAAAAAGAA AGGCTCTCAG CGTCGCCGCC AGCAGCAACA GCGGGTAGCG
CGCATCCATG CCCGGATAAC CGATAGCCGC CGGGATTTTT TGCACCAACA ATCCTCCAAG
ATAGTCAACG AGAACCAAGT GATCTGCCTG GAGGATTTGA ATATCCAAGG GATGTTGAGA
AACCGACGCC TGAGTAAAGC CATAGCTGAT TGCGGGCTGT ATGAGCTCAG ACGGCAAATG
GAGTACAAGG CCGCCTGGTA TGGCCGGGAG GTGTTGATCG TGGACCGCTG GGCACCCACC
AGCAAGACGT GCTCGGCGTG TGGGGCTGTG CAAGAGTCCA TGCCGCTCAA AGTGCGCGCA
TGGGCTTGTG AATGTGGGGC CACCCACGAT AGGGACATCA ACGCAGCCAA AAATATTTTG
TTTTTCGGTA CGGCGGGGAG CGCCGGAACC TCTAAAGCGC GTGGAGCGGT AAAACCCCCA
AGGGCCGTGG CCTAG
 
Protein sequence
MIIQCAYKFR FYPTPTQKRQ LALEFGHARY VWNWALETRT KAYQAQGESS NTISLSRQLT 
ALKKTQCPWL SEATASCHTQ KLRDQDTAFR NFFAGRAKYP RFKKRHHTQS VRYQLDQRHV
AKNFNAESKL LKLPKLGRVK LRWSRGIEGI PKMVTVSQAP AGRYFVSLTC EVEILPLPVR
RNAIGVDVGV KDVVITSEGW KSGAPKYTYH YARQLKMAQR RLSKKKKGSQ RRRQQQQRVA
RIHARITDSR RDFLHQQSSK IVNENQVICL EDLNIQGMLR NRRLSKAIAD CGLYELRRQM
EYKAAWYGRE VLIVDRWAPT SKTCSACGAV QESMPLKVRA WACECGATHD RDINAAKNIL
FFGTAGSAGT SKARGAVKPP RAVA