Gene Noc_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2240 
Symbol 
ID3704913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2586716 
End bp2587936 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID637738715 
ProductPhage integrase 
Protein accessionYP_344228 
Protein GI77165703 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCG CACCACAGAG AACCAAGCTT ACCAAGACGG TCGTTGATCG CCTGCCCGCG 
CCTACGAGGG GCCAAGCTTT CTATTGGGAC AGCGCGCTAC CCTGCTTTGG CGTGCGGGTA
TCGGCGGGAG GCGTGAAGTC ATTTGTTATT CAAAAGCGCA TTCAAGGCCG GGAGAAACGA
ATCACGCTAG GCAAGTACGG CCATTTAACA CTCATGCAGG CCCGCAAGGA GGCCGCGCGG
CTGTTAGGGG AAATTGCCGT AGGACGCAAC CCATTAGCTG AGAAGGCGCA AGCAAAACTA
CGGGCCGTGA CGCTCGGCGA GGCGCTTGAG CACTATCTAA CCTCAAGGCC ACTGAAAGCG
CGCACCATTC AGGGAACGCG GCACACCATG GGAAAGTGCT TTAGTGACTG GATGAAGCGC
CCCCTGACAA GCATTACCAG GGATAAAGTC GCCGCCAGGC ATAAGCAGCT AGGCACCGCT
AGCAAGTCCC ACGCTAACTT AGCTATGCGG TATTTAAGGG CTGTCTTCAA CTTCGCCATG
GCGGACTATA CCGATAACGA AGGCCGCCCT GTGATTGCGG ATAACCCCGT CAACCGCTTG
TCCGAGGCTA GAACCTGGTT CCGGGTAGAG CGCAGGCGCA CGGTGATAAA GTCCCACGAG
TTAAAGCCCT GGATGCAGGC CGTACAGAGG CTAGAGAATG GGGCAGCCCG TGACTACTTT
ATGTTGGTAT TGCTAACGGG CCTTCGACGC ACCGAGGCGC TTAATTTACG CTGGCAGAAC
GTGGACTTAG TCGCTAACAC CCTTACAGTC CAGGACACCA AGAACCACCA GGCCCACACC
CTGCCCCTAT CCGACTACCT GACGGAGATG CTAGCGGCAC GGCTAGAGGA TACCTATAGC
GAGTATGTGT TCAGCACCTC CAGGGGACGG CTTTCCAACC TGAGAGGCCC GCTTGCTGAG
GTAAGGAGCT ATGCGGGTAT ATCGTTTTCT ATCCATGACT TAAGGCGCAC CTTCGCCACT
GTGGCGGACT CCCTGGATGT GCCAGGCTAC GCCGTTAAAG CACTCCTTAA CCATAAGGCG
GCTAATGATG TGACGGCGGG CTATATCGTG GTGGATACGG AAAGGCTACG CGCCCCCATG
CAGAAGATTA CCGACTTTAT GTTAAGGGCA GGCGGCTTAT GGGAAGGGGG CGAAGTGGTG
GAGCTTAGGC AGTACGGATG A
 
Protein sequence
MKPAPQRTKL TKTVVDRLPA PTRGQAFYWD SALPCFGVRV SAGGVKSFVI QKRIQGREKR 
ITLGKYGHLT LMQARKEAAR LLGEIAVGRN PLAEKAQAKL RAVTLGEALE HYLTSRPLKA
RTIQGTRHTM GKCFSDWMKR PLTSITRDKV AARHKQLGTA SKSHANLAMR YLRAVFNFAM
ADYTDNEGRP VIADNPVNRL SEARTWFRVE RRRTVIKSHE LKPWMQAVQR LENGAARDYF
MLVLLTGLRR TEALNLRWQN VDLVANTLTV QDTKNHQAHT LPLSDYLTEM LAARLEDTYS
EYVFSTSRGR LSNLRGPLAE VRSYAGISFS IHDLRRTFAT VADSLDVPGY AVKALLNHKA
ANDVTAGYIV VDTERLRAPM QKITDFMLRA GGLWEGGEVV ELRQYG