Gene Noc_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1991 
Symbol 
ID3704875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2289608 
End bp2290759 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content56% 
IMG OID637738467 
Producttransposase 
Protein accessionYP_343983 
Protein GI77165458 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0166547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGT ACCAATTAAG ACTCTACCCC ACACTCCGAC AGCGGCGGCA GCTAGAGGAA 
GCATTTAGCG CCTGCCGGTA TGTGTGGAAT TGGGCGTTAG ATAGACGCAC CAGGGCCTAC
AAGGAAAAAG GAGAGTCCCT GAATGCCATT GCGCTTTCGC GGGCGTTGAC GGCGCTCAAA
AAAGAGAAGG TTTTTCTTAA GGCCGCCAGT GCGACGGCCC TTACGTATGT CCTAAAAAGC
CAGGATGAGG CTTTCCAGAA GTTTTTCAAC AAGCAGGCCC GCTACCCGAA GTTTAAACGG
CGTGGGCGGG TGCACTCCTG TACCTTCCAG CTCGACAAAC GCCGGGGCGA GAAGGTGTTT
ATGCCAGGCC AATTATTGCG CCTGCCCAAG CTCGGCCCGG TGCGCGTGGT CTGGTCCTAC
CAGGATATCC CCGTATTCCC CAACAGCGCC ACGGTCAGCT GCAATGCCTG TGGGCAATGG
TTTGTCTCGC TCCAGTGTGA CTGTATCGAC GTGATACACC CGCCCGCCAC GGATAAAACC
ATTGGGCTCG ATTTAGGGCT ATCGACCCTG ATAGCCATGA GCGATGGCAG AAAAGAGAAA
CCCAGAAGAT TTTTAAAGAA CGCCTTACGC CGGTTGAGGT TTGCCCAGCG CCGTTTATCG
AAGACGGCAA AAGGTGGCAG TAACCGGCGT AAGCAAAGGA GCCGCGTAGC TCGACTCCAC
CAAAGAATAG CCAGCAAGAG GGCGAACTTT CTGCACGGAC TGAGTACTTC GATCGTACGC
GAAAACCAAG CCATAGCGAT TGAGGACCTG AACGTGCGTG GCGTGATGGC CAACGGAAAG
CTAGCCCGAT CGGTTGGGGA CTGCGGTTGG TACGAGTTAC GACGGCAGCT TACTTACAAA
GCGAAGTGGT ACGGACGGCA ACTTAATGTG GTGCCGCGAT TCCAGCGTAC CACGGGGGTT
TGTCCTGATT GCGGGACGGT AGGGGAAAAG CTGCCGCTGA GGGTGCGGTC CTGGACGTGC
GGGCACTGTG GAAGCGCGCA CGATCGGGAT ATTGCCGCCG CTCGGGTGAT TGATTTAATG
GGTAATACCG CGAGGAGCGC GGGAATTGAT GCCTGTGGAC TGGCGCACAA ACCGGAGGAG
GCTGTTAGTT AG
 
Protein sequence
MKAYQLRLYP TLRQRRQLEE AFSACRYVWN WALDRRTRAY KEKGESLNAI ALSRALTALK 
KEKVFLKAAS ATALTYVLKS QDEAFQKFFN KQARYPKFKR RGRVHSCTFQ LDKRRGEKVF
MPGQLLRLPK LGPVRVVWSY QDIPVFPNSA TVSCNACGQW FVSLQCDCID VIHPPATDKT
IGLDLGLSTL IAMSDGRKEK PRRFLKNALR RLRFAQRRLS KTAKGGSNRR KQRSRVARLH
QRIASKRANF LHGLSTSIVR ENQAIAIEDL NVRGVMANGK LARSVGDCGW YELRRQLTYK
AKWYGRQLNV VPRFQRTTGV CPDCGTVGEK LPLRVRSWTC GHCGSAHDRD IAAARVIDLM
GNTARSAGID ACGLAHKPEE AVS