Gene Noc_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2768 
Symbol 
ID3705306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3140259 
End bp3141437 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID637739244 
Producttransposase 
Protein accessionYP_344745 
Protein GI77166220 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000132039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC ACGATTCTAC CACTGTTCAG CGGTCCTATA CATTCCGCTT CTACCCCACG 
AGTGTCCAGC GCCAGCAACT GGCTATGGAA TTCGGCCATG CCCGGTGGGT ATGGAACACC
TGTCTGACCT GGCGGGGCCG TCAATATAGG GTACATGACA AGCGCGTGAC TGGCGTTGAT
TTTAGCCGCC AGCTCACGTT CCTCAAAGGG TTAGGCCCGT ACGCTTGGCT CAAGGAGGCC
AGCGCCACCT GCCTGATCCA AAAGCTCAGG GACCAGGACA CGGCCTTCAG GCATTTTTTT
GCGGGGCGAG CGAAGTATCC CCGTTTCAAA AAAAGAACGC ATACCCAAAG CATCCGCTAT
CAACTCGATC AACGTCAGGT AGCAGGCATG TATCGAGCGG GCGAGTTTCT TAAATTGCCC
AAACTCGGGG CACTCAAGCT CAAGTGGTCC CGCAAGCCCC AGGGCATCCC CAAGATGGTG
ACGGTCACCC AGGATTGCGT CGGCCGTTAT TGTGTTTCGT TCATGTGCGA GGAAACCCTC
CAACCCCTAC CGCGAAAGCC AAACGGTATC GGTGTTGATT TAGGGGTGTG CGATGTGGTG
GTGACCTCCG AGGGCTGGAA GTCCGGTAAT CCTCGGCACC TACGCACGCA TACCCGGCAA
CTCAGAAAAA CCCAGCGCAG GCTATCCCGC AAGCGTAAGA GCAGTGTTCG TTGGCATCGT
CAGCGGATAC GGGTTGCTAA AGCTCATGCC AGGGTGAGCA ATACCCGCCA GGATTGGCTG
CACAAGCTCA CCACGGCGCT GATTCGCCAG GCGGGGTTCA TTGCCATGGA AACGCTCAAC
GTCAGGGGCA TGATGGCCAA TAGACGGCTA TCCAGGGCGC TGGGCGATGT GGGCATGCAC
GAGCTCAAGC GGCAACTGGC CTACAAAGCC CAGTGGTATG GCCGGGCGTT TAGGCAAGTG
GATAGGTGGG CGCCGACCAG CAAAGCCTGT AGCGAATGCG CCGCGGTACA AGAAACGATG
CCGCTCAATA TTCGCGAGTG GACTTGCCCG GACTGTAAGT CAGTCCATGA CCGCGATATT
AACGCGGCCA GAAATATTTT AAGGTTAGCT ACGGTTGGGA GGACCGGAAG TGATGCGCGT
GGAGGGGTAC ACAAACCGGA GGTGGCTTAT GGCTGCTGA
 
Protein sequence
MTEHDSTTVQ RSYTFRFYPT SVQRQQLAME FGHARWVWNT CLTWRGRQYR VHDKRVTGVD 
FSRQLTFLKG LGPYAWLKEA SATCLIQKLR DQDTAFRHFF AGRAKYPRFK KRTHTQSIRY
QLDQRQVAGM YRAGEFLKLP KLGALKLKWS RKPQGIPKMV TVTQDCVGRY CVSFMCEETL
QPLPRKPNGI GVDLGVCDVV VTSEGWKSGN PRHLRTHTRQ LRKTQRRLSR KRKSSVRWHR
QRIRVAKAHA RVSNTRQDWL HKLTTALIRQ AGFIAMETLN VRGMMANRRL SRALGDVGMH
ELKRQLAYKA QWYGRAFRQV DRWAPTSKAC SECAAVQETM PLNIREWTCP DCKSVHDRDI
NAARNILRLA TVGRTGSDAR GGVHKPEVAY GC