Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0401 |
Symbol | |
ID | 3706572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 444098 |
End bp | 445306 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637736913 |
Product | transposase |
Protein accession | YP_342457 |
Protein GI | 77163932 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGT TAAGTAAAAT CCAAGATAAT CTACGAAAAT TGGAATGTAA GAGCATGTTA CTGGCACATA AGATTGAACT AAGGCCCAAA GCAAGCCAAG CCGAATATCT GAACAAGTCT TGCGGTTCAA GGCGGCACTG CTATAACCAA CTACTGGAGC ACTTCTCCAA GCCGGATAAT AAGTGGAGCA AGGCGGCGGC CTACCAGTAC TATATAAAAG TCATCCGTCC TGCGTATCCC TGGTATAACG AAGTTTCCAG TCGCGTGACC CGCAACGCCA TTGATGACCT GGATGATGCT TTCAAGCACT TTTTTCGGCG GGTGAAAAAG GGAGGGAAGC CCGGTTTTCC AAAGTTTAAG AAAAAAGACA TCAACGATAG CTTCGCCCTA CGTGAGAAAA CCAAGTTTGA GGTCAAGGGC CGCAAGCTGA GGATTGAAAA ACTCAAAACA CTTATCCCCA TGCGCCAGCG GCTGCGCTTC GAGGGAACGC CCAAGCAAGT GGCGATGAGT AAGCAAGCCG GTAAGTATTT CGCCTCCGTT CTGGTAGATA CGACAGACTA TAAGGACTAT AGCCAAAACC GATCCCCCTC CGTAGGCGTG GATTTTGGCG TCAAGTCGCT GGCCGTGACT TCTGACAATG AAGTGATTCC TTCCAACAAC AAGTTAAAAA AGAGCCTAAA GAAACTCAAG CATTTAAGTA GAAGTCTATC CAGAAAGCGC AAAGGCTCCA ACCGTCGAGC GATAGCCAAG CAGCGGTTAG CCAAATTGCA CTATCGGATA GCTCAACAAA GGAAAGCCGT GCTCCATGAA CTGAGCCATA GTTTAACGGC AAACTATGAT CGAATCGCCA TAGAAGATCT CAATGTTAAA GGGATGGTTC GGAACCGTAC ACTAGCCCGG TCCATCGCCG ATGCGGGCTT CGGGATGCTG CGCCAGTTGA TTGAATACAA AGCCTTTCTT CGTGGCTGCA CGGTTGAGCT GGTAGATAGG TTCTTCCCCT CTAGCCGGAT GTGTTCAGGC TGTGGACAGC TTCACGACAT CACGCTCGCG GATAGAGCAT TGGCCTGTGA TTGTGGATTA ACCATAGACC GCGATCTCAA TGCCGCGATT AATTTAAACC GGTATCGTCG GGACACGCTC AAGCCAGACG TAAAACGCAC GCAAGAGCCA AGTAAGACCG CGCTAGCGGC ATCGGTGTGG ACGGTGTGA
|
Protein sequence | MKLLSKIQDN LRKLECKSML LAHKIELRPK ASQAEYLNKS CGSRRHCYNQ LLEHFSKPDN KWSKAAAYQY YIKVIRPAYP WYNEVSSRVT RNAIDDLDDA FKHFFRRVKK GGKPGFPKFK KKDINDSFAL REKTKFEVKG RKLRIEKLKT LIPMRQRLRF EGTPKQVAMS KQAGKYFASV LVDTTDYKDY SQNRSPSVGV DFGVKSLAVT SDNEVIPSNN KLKKSLKKLK HLSRSLSRKR KGSNRRAIAK QRLAKLHYRI AQQRKAVLHE LSHSLTANYD RIAIEDLNVK GMVRNRTLAR SIADAGFGML RQLIEYKAFL RGCTVELVDR FFPSSRMCSG CGQLHDITLA DRALACDCGL TIDRDLNAAI NLNRYRRDTL KPDVKRTQEP SKTALAASVW TV
|
| |