Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1991 |
Symbol | |
ID | 3704875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2289608 |
End bp | 2290759 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637738467 |
Product | transposase |
Protein accession | YP_343983 |
Protein GI | 77165458 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0166547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGT ACCAATTAAG ACTCTACCCC ACACTCCGAC AGCGGCGGCA GCTAGAGGAA GCATTTAGCG CCTGCCGGTA TGTGTGGAAT TGGGCGTTAG ATAGACGCAC CAGGGCCTAC AAGGAAAAAG GAGAGTCCCT GAATGCCATT GCGCTTTCGC GGGCGTTGAC GGCGCTCAAA AAAGAGAAGG TTTTTCTTAA GGCCGCCAGT GCGACGGCCC TTACGTATGT CCTAAAAAGC CAGGATGAGG CTTTCCAGAA GTTTTTCAAC AAGCAGGCCC GCTACCCGAA GTTTAAACGG CGTGGGCGGG TGCACTCCTG TACCTTCCAG CTCGACAAAC GCCGGGGCGA GAAGGTGTTT ATGCCAGGCC AATTATTGCG CCTGCCCAAG CTCGGCCCGG TGCGCGTGGT CTGGTCCTAC CAGGATATCC CCGTATTCCC CAACAGCGCC ACGGTCAGCT GCAATGCCTG TGGGCAATGG TTTGTCTCGC TCCAGTGTGA CTGTATCGAC GTGATACACC CGCCCGCCAC GGATAAAACC ATTGGGCTCG ATTTAGGGCT ATCGACCCTG ATAGCCATGA GCGATGGCAG AAAAGAGAAA CCCAGAAGAT TTTTAAAGAA CGCCTTACGC CGGTTGAGGT TTGCCCAGCG CCGTTTATCG AAGACGGCAA AAGGTGGCAG TAACCGGCGT AAGCAAAGGA GCCGCGTAGC TCGACTCCAC CAAAGAATAG CCAGCAAGAG GGCGAACTTT CTGCACGGAC TGAGTACTTC GATCGTACGC GAAAACCAAG CCATAGCGAT TGAGGACCTG AACGTGCGTG GCGTGATGGC CAACGGAAAG CTAGCCCGAT CGGTTGGGGA CTGCGGTTGG TACGAGTTAC GACGGCAGCT TACTTACAAA GCGAAGTGGT ACGGACGGCA ACTTAATGTG GTGCCGCGAT TCCAGCGTAC CACGGGGGTT TGTCCTGATT GCGGGACGGT AGGGGAAAAG CTGCCGCTGA GGGTGCGGTC CTGGACGTGC GGGCACTGTG GAAGCGCGCA CGATCGGGAT ATTGCCGCCG CTCGGGTGAT TGATTTAATG GGTAATACCG CGAGGAGCGC GGGAATTGAT GCCTGTGGAC TGGCGCACAA ACCGGAGGAG GCTGTTAGTT AG
|
Protein sequence | MKAYQLRLYP TLRQRRQLEE AFSACRYVWN WALDRRTRAY KEKGESLNAI ALSRALTALK KEKVFLKAAS ATALTYVLKS QDEAFQKFFN KQARYPKFKR RGRVHSCTFQ LDKRRGEKVF MPGQLLRLPK LGPVRVVWSY QDIPVFPNSA TVSCNACGQW FVSLQCDCID VIHPPATDKT IGLDLGLSTL IAMSDGRKEK PRRFLKNALR RLRFAQRRLS KTAKGGSNRR KQRSRVARLH QRIASKRANF LHGLSTSIVR ENQAIAIEDL NVRGVMANGK LARSVGDCGW YELRRQLTYK AKWYGRQLNV VPRFQRTTGV CPDCGTVGEK LPLRVRSWTC GHCGSAHDRD IAAARVIDLM GNTARSAGID ACGLAHKPEE AVS
|
| |