Gene Noc_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2221 
Symbol 
ID3705101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2564885 
End bp2566918 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content57% 
IMG OID637738697 
Productpeptidase S15 
Protein accessionYP_344211 
Protein GI77165686 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTCA TTACTTCATT CCCCCGGCGG GTGCGCGAGA TCGAAAATTG TTGGATTTCC 
ATGTCTGATG GCTGCCGTCT AGCGGCCCGA ATCTGGCTAC CCGAGGATGC TACGCAATCC
CCCGTACCGG CCATTTTTGA GTATATCCCC TACCGCAAGC GGGATTTCAC CCGTCCCCGC
GACGAACCCA TGCATCACTA CTTCGCTGGT CACGGTTATG CCGCCGTACG GGTAGATGTT
CGCGGTTCCG GGGACTCCGA TGGTCTGCTC CTGGACGAAT ACCTCCAGCA AGAACAAGAT
GACGCCATAG AGGTTATCCG CTGGATCGCC TCCCAGCCTT GGTGTTCCGG CGCTATCGGG
ATGATGGGCA TTTCCTGGGG GGGATTCAAC TCCCTCCAGG TAGCGGCCCT GCAGCCCCCG
GCCCTTAAGG CAATCATCAC CCTCTGCTCC ACGGATGATC GCTATGCCGA TGATGCCCAT
TACATGGGCG GCTGCTTGCT CAACGAAAAC CTGACCTGGG GCTCGGTCTT ACTAACCTTT
AATGCTTATC CCCCCGATCC GGAACTGGTG GGCGAGCGCT GGCGGGAAAT GTGGATGGAG
CGGTTGCAGC ATGCCGTTTT ATTTCCCGAA GTATGGCTGC GCCACCCCCG GCGCGATAGC
TACTGGCGGC ATGGCTCGGT GTGCGAGGAC TATAGCCGTA TCCGCTGCCC CGTATACGCC
ATTGGCGGCT GGGCCGACGC CTACTCCAAT GCCATTCCCC GGCTCCTAGA AGGGCTGTCC
GTGCCTCGCA AGGGATTAAT CGGTCCCTGG ACCCATAGTT TTCCCCATGA GAGCGCGCCT
GGACCCGCCA TTGGCTTTTT ACAGGAAGCG CTACGCTGGT GGGATCACTG GCTCAAAGGA
ATCGATCGGG GAATTATGGA AGAACCCATG TATCGGGTGT GGATGCAGGA AAGCCTGCCG
CCACAACCCT TTTACGAAGA ACGCCCCGGC CGTTGGGTGG CGGAACGCTG TTGGCCTTCT
CCACGAATTA GGCCCTTGCG GCTGATATTA AACCCTAACC GCCTGGAGCA GGAGGCCACC
ACCGAAACCA AACTGACGTT CCAGTCCCCG CAGACAACGG GTCTGGCGGC CGGCGACTGG
TGCGGCTTTG GCGCGGATGG GGAAATGCCT ACTGACCAGC GGGAAGATGA TGGCAAATCC
CTAACCTTTG ATTCCGTCCC ATTAGACCAG CACCTGGAAA TTCTGGGGGC ACCCGTAGCC
ACCCTGGAAC TTGCCTTTGA TCGTCCTTGT GCTCTCATCG CCGTACGTCT GAATGACGTT
GCGCCCAATG GGGCCTCAAG CCGGGTGAGC TACGGTCTAC TCAACCTCAC CCACCATAAT
AGCCATGAAT TCCCTGAACC TTTAAAACCA GGTCGGCGCT ATACCGTGCG GGTTCAGCTC
AATGACATCG CCCATGCCTT CCCTCCGGGC CATACCCTCC GACTGGCAAT CTCCACCAGC
TACTGGCCGG TGGCATGGCC TTCTCCAGAA CCCGTTCATT TAACTCTGTT CACGGGCAAA
AGCTATCTGG ACTTACCTGT GCGCTCCCCC GATCCCCAAG ACCAATCGCT CCGCCCTTTT
GAACAACCAG AAAGAGCACC CGCCCCCGCG CATATGACCT TGCGGCCAGC AAGGTTCCAG
CGCACTATTG AACGTAACCT TTCCACTAAT GAAACCTTGT ATACCATTTT CAGCGATGGC
GGCGATTTCG ATGGAGCGGC AGTGGCTCAT CTCCATGCCA TCGACTTAGA CCTTGGCCAC
ACGATTTTAA AACGCTTTCG TATCGGCGAA ACTGATCCAC TCTCGGCTCA GGCCGAAAAC
GAGCAGAATG CCCTGCTCCG CCGCGGCGAC TGGGAAATTC GGATTAAGGC CCGAACCCGT
CTGTCCTCAA ACTGGAATAG CTTTCACCTC CACGCCGATC TGGAGGCTTA TGAAGGCGAG
ACTTTGGTTT TCTCCCGCAG CTGGGAGGAG ACTATCCCCC GTGATTTAGT CTAA
 
Protein sequence
MKVITSFPRR VREIENCWIS MSDGCRLAAR IWLPEDATQS PVPAIFEYIP YRKRDFTRPR 
DEPMHHYFAG HGYAAVRVDV RGSGDSDGLL LDEYLQQEQD DAIEVIRWIA SQPWCSGAIG
MMGISWGGFN SLQVAALQPP ALKAIITLCS TDDRYADDAH YMGGCLLNEN LTWGSVLLTF
NAYPPDPELV GERWREMWME RLQHAVLFPE VWLRHPRRDS YWRHGSVCED YSRIRCPVYA
IGGWADAYSN AIPRLLEGLS VPRKGLIGPW THSFPHESAP GPAIGFLQEA LRWWDHWLKG
IDRGIMEEPM YRVWMQESLP PQPFYEERPG RWVAERCWPS PRIRPLRLIL NPNRLEQEAT
TETKLTFQSP QTTGLAAGDW CGFGADGEMP TDQREDDGKS LTFDSVPLDQ HLEILGAPVA
TLELAFDRPC ALIAVRLNDV APNGASSRVS YGLLNLTHHN SHEFPEPLKP GRRYTVRVQL
NDIAHAFPPG HTLRLAISTS YWPVAWPSPE PVHLTLFTGK SYLDLPVRSP DPQDQSLRPF
EQPERAPAPA HMTLRPARFQ RTIERNLSTN ETLYTIFSDG GDFDGAAVAH LHAIDLDLGH
TILKRFRIGE TDPLSAQAEN EQNALLRRGD WEIRIKARTR LSSNWNSFHL HADLEAYEGE
TLVFSRSWEE TIPRDLV