Gene Noc_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1199 
Symbol 
ID3706697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1305832 
End bp1307064 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content55% 
IMG OID637737702 
Producttransposase IS605 
Protein accessionYP_343231 
Protein GI77164706 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01765] transposase, putative, N-terminal domain
[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0968514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTG TACGCACGAC TAAAACCCGG CTGGATTGGG ACGTAGCAGC GGCGAAGCGC 
ACTGTTGAGG CGTGGTCGGC AGCATGCAAT GACATCAGCC AGCAGGCTTT CGCTCAGGGC
TGTCTTTCCA ATACCGTGCG TTTGCATCGG CTGGTTTATA GGGATATTCG CACCCGTTTT
GGGCTTTCCG CCCAGGTGGC GCAGAATGCG ATTCGTCACG TGGCGAGCAA GTATGCCGGT
GCGCGAATAA AAAAGATTCA ACTCAAGCGT CCTGTCACCT TTTCCAAACA GTGTGCGGTC
GCCCTTCAAG GCGGTGAGCG TGGCCGTGAC TTTGGTTTTA GGCACAAGGG CGTCAGCCTC
TGGACCGTGG ACGGGCGTAT CAAAGGCCTT CCCTTCCACG GCGAACCTCG CCTTTGTGAA
TACCTTTCTG AGTGGAAGAT GGGTGATGGG CGTTTGTTCA TCGGCAAGGG CAAAGTGTAC
TTGTCGATTT CTTTTAAGCG CGAAGTCGAG ACTGTCTTCA AGCCCAACGA TGCCGTGGTC
GGCGTAGACC GAGGTATCCG CGTATTGGCG ACCGTCACCG ACGGCCAACG CCAGCTTTTC
TTCGGTGGCG GACACACCCA CCATGTGCGC AACCGTTACG CCAAAACCCG CGCTTCTCTC
CAAAAGAAAA AGGCACGAAC GGGTAGCCGC TCCACGCGGC GCACCCTGAA ACGGTTGTCT
GGCCGAGAGC GACGCTTTAT GAGAAACGAT AACCACGTGA TGAGTCGTCG TATCGTTGAT
TTTTCCAGGG ACACGGGCAA TCCAACGATT GCCGTAGAAG ACCTAGGCGG TATCCGCAAC
GGGCGCAAAC TTAGAAAGCA ACAGCGCACG GATCTCAACC GCTGGGCCTT TTATGAGTTG
GAGCAATTTA TCCGTTACAA GGCGGACACC TTTGGTATGG AGGTGATCGG GGTTGACCCG
AAGTACACCA GTCAGGGCTG TTCCCGCTGT GGTCATACTG AAAAAGATAA TCGACATCAA
CATCGGTTCC TCTGCAAAGC GTGTGGTTAT GAACTACACG CTGATTTAAA TGCCTCTCGC
AATATTCGCC TGAGAGGTAT CCTGGCAAGG CAAGTTCTTT GCGAGGATGG GTCGCTGTCA
TGCGGCCCTG AAGCACGGCT CGTTGATCCA GGCTCGAAAC CTGGGGAGGG CGCGGGCAAG
CCGTCTGCTT TAGCTGTGAC GGTACATGAC TAA
 
Protein sequence
MEIVRTTKTR LDWDVAAAKR TVEAWSAACN DISQQAFAQG CLSNTVRLHR LVYRDIRTRF 
GLSAQVAQNA IRHVASKYAG ARIKKIQLKR PVTFSKQCAV ALQGGERGRD FGFRHKGVSL
WTVDGRIKGL PFHGEPRLCE YLSEWKMGDG RLFIGKGKVY LSISFKREVE TVFKPNDAVV
GVDRGIRVLA TVTDGQRQLF FGGGHTHHVR NRYAKTRASL QKKKARTGSR STRRTLKRLS
GRERRFMRND NHVMSRRIVD FSRDTGNPTI AVEDLGGIRN GRKLRKQQRT DLNRWAFYEL
EQFIRYKADT FGMEVIGVDP KYTSQGCSRC GHTEKDNRHQ HRFLCKACGY ELHADLNASR
NIRLRGILAR QVLCEDGSLS CGPEARLVDP GSKPGEGAGK PSALAVTVHD