Gene Noc_2902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2902 
Symbol 
ID3707419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3281342 
End bp3282565 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content54% 
IMG OID637739379 
Producttransposase 
Protein accessionYP_344878 
Protein GI77166353 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01765] transposase, putative, N-terminal domain
[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00617488 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAA TTGTTAGAAC AACGCTGATT AAGGTTGATT TGCCCTTTGA GGCGGCAAAG 
AACACGGTCA TCGCCTGGAC AGAAGCTTGC AATGCGGTGA GCTGTAGGGC CTTCGAGAAC
GGCAAACTCA GCAATGCGAT CAAACTGCAA CAGCTTTGTT ACGAAACGGC CAAATCCTTT
GGCCTGTCGT CGCAAGTGGC CGTCTCTTGT ATTCGGCAGG TTGCCAGCAA ATACGCGGCA
GCGCGCACGG CCAAAAAGAC CCTTAGCAAG CCTGTTTATT TTCGCCCATG CGCCGTTGTT
CTGCAGGGCG GAAAGCGCGG GCGAGACTTC AGTTTTACGC AACAAGGACT GAGCGTGTGG
ACGGTGAACG GAAGGATAAA AAGCCTTGTC TATCATGGTG CGCCGAAACT GCAAGCGTAT
GTTGCGAATT GGCGTCTGGG TGATGGCCGT TTGTTTGTTC GCAAGGGCAA GGTTTTTCTG
TCGGTCTCTT TCAAACATGA GGCCGAGACT ATTTCCAAAC CTAACGATGC TGTGGTGGGC
GTTGACCGGG GCATCAAGGT GTTGGCGACC GTCACCGACG GCCAACGCCA GCTTTTCTTC
GGTGGCGGAC ACACCCACCA TGTGCGCCAC CGCTACGCCA AAACCCGCGC TTCTCTCCAA
AAGAAAAAGG CACGAACGGG CAGCCGCTCC ACGCGGCGAG TCCTGAAACG GTTGTCTGGC
CGAGAGCGAC GCTTTATGAG AAATATCAAT CACGTGATGA GTCGCCATCT CGTCGATTTT
GCCAGGGACA CGGGCAACCC AACGATTGCC GTTGAGGACC TTGGTGGTAT CCGCAATGGA
CGCAGACTTA GAAAGCAACA GCGCACGGAT CTCAACCGCT GGGCCTTTTA TGAGTTGGAG
CAATTTATCC GCTATAAAGC GGACACCTTC GGTATGGAGG TGATCGGGGT TGACCCGAAG
CACACCAGCC AAGGCTGTTC CCGCTGTGGC CATACCGAAA AAGCTAATCG ACATCAACGT
CGTTTCCTCT GCAAAGCGTG TGGTTATGAA CTACACGCTG ATTTAAATGC CTCTCGCAAT
ATTCGCCTGA GAGGTGTCCT CGCCAGGCAA GTTCTTGACG AGGATGGGGT GCTGTCAATC
ACCCCTGAAG CACGCCCCGT TGATCCAGGC TCCAAACCAG GGGAGGGGAC GGGCAAGCCG
CTTGCTTTAG CTAGCGGTCA TTGA
 
Protein sequence
MKEIVRTTLI KVDLPFEAAK NTVIAWTEAC NAVSCRAFEN GKLSNAIKLQ QLCYETAKSF 
GLSSQVAVSC IRQVASKYAA ARTAKKTLSK PVYFRPCAVV LQGGKRGRDF SFTQQGLSVW
TVNGRIKSLV YHGAPKLQAY VANWRLGDGR LFVRKGKVFL SVSFKHEAET ISKPNDAVVG
VDRGIKVLAT VTDGQRQLFF GGGHTHHVRH RYAKTRASLQ KKKARTGSRS TRRVLKRLSG
RERRFMRNIN HVMSRHLVDF ARDTGNPTIA VEDLGGIRNG RRLRKQQRTD LNRWAFYELE
QFIRYKADTF GMEVIGVDPK HTSQGCSRCG HTEKANRHQR RFLCKACGYE LHADLNASRN
IRLRGVLARQ VLDEDGVLSI TPEARPVDPG SKPGEGTGKP LALASGH