Gene Noc_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1594 
Symbol 
ID3705756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1774080 
End bp1775261 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content56% 
IMG OID637738073 
Producttransposase 
Protein accessionYP_343602 
Protein GI77165077 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACCT CGTTGTATCA TGTATACATG ATAACGCAAC GCGCTTACAA ATTCAGGTTC 
TACCCAACAC CTACGCAGAA GCGGCAATTG GCCCTTGAGT TTGGCCATGC CCGCTTTGTG
TGGAATTGGG CCTTGGAAAC CCGAACGAAG GCGTATCAAG AGCGCGGCGA GCGGCTGAAT
AATATCGGGC TTAGCCGCCA ACTGACGGCG CTGAAAAAGG CCGAGTATCC CTGGCTGAGC
GAAGCGACCG CCGGTTGCCA TACCCAAAAG CTGCGGGACC AGGACACGGC CTTTAAAAAC
TTCTTTGCGG GCCGGGCGAA ATATCCCCGC TTTAAGCGCC GCCACCACAC CCAATCGGTA
CGCTATCAAC TCGATCAACG CCATGTGGCC AGGAATTTCA ACGCCGAAAG CCAACGGCTG
AGACTGCCTA AGCTCGAAGC GTTGAAACTC AAGTGGTCCC GTGATATCGA GGGTATTCCC
AAAATGGTGA CGGTGAGTAA AGACCCCGCT GGGCACTATT TTGTCAGCAT GGCCTGTGAG
GTGACTATCG TTCCCTTGCC TGCCAGAAGG AATGCCCTTG GCGTGGATGT GGGCGTGAAG
GATATCGCCA TCACCTCCGA GGGTTGGAAG TCCGGCGCGC CTAAATACAC TGACCGCTAT
GCTCGGCAAT TGAAAAGGGC GCAGCGCCGT CTGAGCAAGC GGCAAAAGGG ATCGGGCAGG
CGCTATCAAC AACGCCAGCG CGTGGCCCGT ATACATGCTC GGATCAAGGA TAGCCGCCGG
GATTTTCTGC ACCAAATCTC CTCCAAGCTC ATCAACGAGA ACCAAGTGAT TTGCCTGGAG
GATTTGCATA TCAAAGGAAT GTTGAGAAAT CGCCGCTTGA GCAAAGCCAT CGCCGATTGC
GGCCTGTATG AACTGCGGCG GCAAATTGAA TATAAAGCGG CGTGGACTGG CCGTGATGTG
TTGATCGTGG ATAGATGGGC GCCGACGAGT AAAACCTGCT CGGCCTGTGG CACGGTGCAA
GAGTCCATGG CGCTGAAAGT GCGCGCATGG ACTTGTGGCT GTGGCGCTAG CCACGATAGG
GACATCAACG CGGCCAAAAA TGTGTTGTTT TTCGGTACGG CGGGGAGCGC CGGAACCTCG
AAAGCGCGTG GAGCGGTAAA ACCCCCAAGG GCCGTGGCCT AG
 
Protein sequence
MYTSLYHVYM ITQRAYKFRF YPTPTQKRQL ALEFGHARFV WNWALETRTK AYQERGERLN 
NIGLSRQLTA LKKAEYPWLS EATAGCHTQK LRDQDTAFKN FFAGRAKYPR FKRRHHTQSV
RYQLDQRHVA RNFNAESQRL RLPKLEALKL KWSRDIEGIP KMVTVSKDPA GHYFVSMACE
VTIVPLPARR NALGVDVGVK DIAITSEGWK SGAPKYTDRY ARQLKRAQRR LSKRQKGSGR
RYQQRQRVAR IHARIKDSRR DFLHQISSKL INENQVICLE DLHIKGMLRN RRLSKAIADC
GLYELRRQIE YKAAWTGRDV LIVDRWAPTS KTCSACGTVQ ESMALKVRAW TCGCGASHDR
DINAAKNVLF FGTAGSAGTS KARGAVKPPR AVA