Gene Noc_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2198 
Symbol 
ID3705136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2541892 
End bp2542998 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content55% 
IMG OID637738674 
Producttransposase IS605 
Protein accessionYP_344188 
Protein GI77165663 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01765] transposase, putative, N-terminal domain
[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTGG TAGCTAACCT CAAACTGACG CCAACTCCAG CGCAAGAACG CGAGTTGCGT 
TTGACGCTGG CGCGCTGTAA TGAAGCGTGC AACTGGCTCT CTGAGCGCGC TTGGGAAACG
AAGACATTCC GGCAATACGA TTTGCATAAG CTCTGCTATC AAGCCGTGCG CGCCAAATTT
GCATTGTCCG CTCAGGTGGC GGTGCGCTGC ATTGCTAAAG TGGCGCACGC CTACAAGCTT
GACCAGAAAA CCCAGCGCGC CTTTCGTAAG CATGCGGCGC ATCCCTATGA TGACCGTATC
CTGCGCTTTG TCTGTGATGA GAAAGTCTCC CTGTGGCTTT TGTCGGGTCG AGAAAAGATT
GGCTATGTTG GTAGCGATCA CCAGCGCCAA TTGCTTGAAC ACCGCAAGGG CGAAGTCGAT
TTGATGTTCG TGCGCGGCCA ATGGTATTTG GCCGCTGTAT GTGACTTTGA CGACCCCAAA
TTGCTGACCC CTGAAGGCAT GTTGGGTGTG GATTTCGGTA TTGTCAATAT CGCCACTGAC
AGCCTGGGTG AGAGGTACTG CGGGGCTAAA GTCCAAGCCT ACCGTGAGCG TTACGCCAAA
CGACGCGCCA CCTTGCAGCG CCTCGGCACA CGGGCCGCTA AACGCTGCCT TCGCCACATA
AGCGGCAGGC AGAAACGGTT TCAAAAATAC GAGAACCATT GTATCTCCAA ACGCATCGTC
TCGACTGCGG AACGCTCCCG TCTCGGCATT GGACTTGAAA ATCTCAAGCA TATCCGGGCA
CGGGTTAAGG CCAACAAAGC GCAGAGGAAA CGCTTGCATA ACTGGGGCTT CGCTCAGCTT
CGTGCCTTTA TCGAGTATAA GGCTAAACGT GCTGGCGTGC CGGTGGTGAT AGTCGACCCA
CGCAACACTA GCCGCGAGTG CCCGGCCTGT GGCCGTATCG ACAAAGCTAA CCGGCCAACC
CAGTCTGAGT TTCGGTGTGT GGAATGCGGG CACAGTAATC ACGCAGACCA TAACGCCGCT
GGCAATATCG CCAGAAGGGC TGCTGTAACT CAGCCTATGT TCGCGCATAA GTGTGCTCCT
TGTGCAGTGG AAAGCCGCCA GCTTTAG
 
Protein sequence
MKLVANLKLT PTPAQERELR LTLARCNEAC NWLSERAWET KTFRQYDLHK LCYQAVRAKF 
ALSAQVAVRC IAKVAHAYKL DQKTQRAFRK HAAHPYDDRI LRFVCDEKVS LWLLSGREKI
GYVGSDHQRQ LLEHRKGEVD LMFVRGQWYL AAVCDFDDPK LLTPEGMLGV DFGIVNIATD
SLGERYCGAK VQAYRERYAK RRATLQRLGT RAAKRCLRHI SGRQKRFQKY ENHCISKRIV
STAERSRLGI GLENLKHIRA RVKANKAQRK RLHNWGFAQL RAFIEYKAKR AGVPVVIVDP
RNTSRECPAC GRIDKANRPT QSEFRCVECG HSNHADHNAA GNIARRAAVT QPMFAHKCAP
CAVESRQL