Gene Noc_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2686 
Symbol 
ID3704443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3041767 
End bp3042885 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content50% 
IMG OID637739168 
Producttransposase 
Protein accessionYP_344669 
Protein GI77166144 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01765] transposase, putative, N-terminal domain
[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.442816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAA CGCTCCAAAT CAAATTGCTT CCTGATGGAA CTCAGCACTC GGCCCTGAAA 
GAGACCATGC GGGTTTTTAA TGACGCTTGT AACGCCATTG CAGAAGTGGC TTTTAGAGAA
CAGTGCGCCT CTAAATTTGA GTTGCAAAAA CTTGTGTATG CGGATGTTAG GAAGCAGTTT
GGTTTGTCGG CCCAATTGAC TATTCGCGCC ATCGCCAAAG TCGTTGAAGC TTACAAGCGA
GATAAATCCA AGCAGTGCTT CTTTAAGCCG ACGGGTGCCG TGGTGTATGA CCAGCGCATA
CTGAGCTTTA AGGGTTTGGA TAGAGCCAGC CTTGTAACGA TGCAAGGGCG CGTGTCTATT
CCTATACAGA TGGGCCAATA CCAGCGCGTA CAATGGCATC GTGCCAAAGG ACAGGCCGAC
CTGGTGCTTG TGAAGGGTGC TTTCTTTTTG TTGGTCGTCA TCGACACACC CGAAGCACCC
CCCATAGACC CGTCTGGTTT TATTGGTATT GATCTTGGAA TTACCAAAGT GGCCACTGAT
TCCGATGGCG GGTCGTTCTG TGGTTCTACC GTGGAGCGTG TGCGCCAGCG CTACCACCGT
TTGCGTAGGC GACTGCAGTC TAAGGGCACG CGCTCGGCTA AGCGGCATTT GAAGAAAATT
CGACGCAAGG AAGCGCAGTT TCGAAGAAGT CAAAATCATA TTATTTCTAA GCGTCTTGTC
GAGAAAGCTA AAGACACCGG ACGCGGAATT GCTTTGGAAG AGTTGAAGCA TATCCGCAGC
CGGACAACGG TTCGGAAATC CGACAGGGCC AAGCACAGCG GTTGGTCGTT CTTTCAACTT
CAATCCTTTA TCGAATATAA GGCGAAGCTT GCGGGTGTCT TTGTTCAATA TATTGACCCC
TGGTATACCT CGCGCACCTG TAGCGCCTGC GGGCATGCCG ATAAAGCTAA CCGCAAAACC
CAATCCCACT TTCAATGTGT CTCTTGTGGA TACACTGATA ATGCGGATAT CAATGCGGCG
ATCAATATTG CTGCAAGGGC TGACGTCATG CAGCCTATGG TGATGCGTGC GACGACGGCA
AAGGATAGCC CGAGCACAGC TACAAGCCTC CCCCTTTAG
 
Protein sequence
MKTTLQIKLL PDGTQHSALK ETMRVFNDAC NAIAEVAFRE QCASKFELQK LVYADVRKQF 
GLSAQLTIRA IAKVVEAYKR DKSKQCFFKP TGAVVYDQRI LSFKGLDRAS LVTMQGRVSI
PIQMGQYQRV QWHRAKGQAD LVLVKGAFFL LVVIDTPEAP PIDPSGFIGI DLGITKVATD
SDGGSFCGST VERVRQRYHR LRRRLQSKGT RSAKRHLKKI RRKEAQFRRS QNHIISKRLV
EKAKDTGRGI ALEELKHIRS RTTVRKSDRA KHSGWSFFQL QSFIEYKAKL AGVFVQYIDP
WYTSRTCSAC GHADKANRKT QSHFQCVSCG YTDNADINAA INIAARADVM QPMVMRATTA
KDSPSTATSL PL