Gene Noc_1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1342 
Symbol 
ID3706145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1490807 
End bp1491976 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content54% 
IMG OID637737838 
Productpeptidoglycan-binding LysM 
Protein accessionYP_343367 
Protein GI77164842 
COG category[P] Inorganic ion transport and metabolism
[S] Function unknown 
COG ID[COG0428] Predicted divalent heavy-metal cations transporter
[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.779699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGGG CGGCAACTCT CCTATCTTTG TTAAAAATGC AAGACTTTCT AATAGTTTTC 
GGCCTTGCCT TGTTGCCCGC ATTGGGGAAT TTTGTCGGCG GGCTGTGGGC GGAATTTCTT
CGAACCTCAG AACGCGCCCT CAACCGGGCG CTCCACGCCG CGGCCGGTAT CGTCCTTGCT
ATCGTCGCCA TTGAACTGAT GCCCGAGGCG CTGAAAAGTA TCTCCCCCTG GATGATTGCC
TTGGCCTTTG CCCTGGGCGG CTTCGCCTAT ATGGCCCTGG AAGCGGCGAT TGAGTATTTG
CAGAAGAAAA AAGGAAAGAA TAGCTCTGGA AGCACGGCCA TGTGGATGCT CTATGGGGCG
GTGGCTACGG ATTTGTTCAG TGATGGCCTC ATGATTGGTG CCGGTTCGGC TGTTTCACCC
AGTATGGCGC TTATTTTGGC GCTGGGACAG GTGCTGGCCG ATGTTCCGGA AGGGTATGCC
GCGATCGCCA ATTTCAAGGA TAAAAACATC CCCCGTAGAC GGCGGTTTTG GCTTTCCGCT
TCTTTCGCTT TGCCAGCGCT AACCGCTGCC ACTCTGGCTT ATTTCCTGCT TCGTGACCAG
AATGAAACCC TAAAAATGGC GGGATTAGTT TTTACGGCGG GACTGCTTAC GGTAGCCGCT
GTGGAGGACA TGGTTTCCGA GGCTCATGAA ATCGCGCAAG ATACGCGCTG GTCGGACTTT
TCCTTCATTG GTGGCTTTGT CTTATTTATC CTTGTTTCCG CCGGTTTCAA AAGCTACCTG
ATAGAAGAGC CTGAATCTGC TGTAGCGGCC AAAGCAGGGG CGGAGGCGCT ACCGGCGCTT
TCGGTTTCCG AAACAGCCAA GTCTGGGGAA AAAGAGTCTC TCGTAACGCG CCTCCAGGAA
CGTTCCCGCG AAAAAGACGC CGACATGATT ACTAGGCTTC CTCCTACGTC GGCAACCAAA
AAATCACAGG AAAAGCGTGC GTTAACGTCT AGGCTGCCGA AAAGGTTGGT TGTCCAGCAC
GGTGATACCT TGTCGCAGAT CGCGGCGCGT CTCTATGGCG ATCCTGCTCA ATGGCGACTC
CTGTATGCGG CCAATCGGGA CAGACTTGAT AATCCTGATT TACTCAGAGC AGGAATGGAG
CTTGTTGTTC CCCTTGATTC GGAAAAATAG
 
Protein sequence
MMGAATLLSL LKMQDFLIVF GLALLPALGN FVGGLWAEFL RTSERALNRA LHAAAGIVLA 
IVAIELMPEA LKSISPWMIA LAFALGGFAY MALEAAIEYL QKKKGKNSSG STAMWMLYGA
VATDLFSDGL MIGAGSAVSP SMALILALGQ VLADVPEGYA AIANFKDKNI PRRRRFWLSA
SFALPALTAA TLAYFLLRDQ NETLKMAGLV FTAGLLTVAA VEDMVSEAHE IAQDTRWSDF
SFIGGFVLFI LVSAGFKSYL IEEPESAVAA KAGAEALPAL SVSETAKSGE KESLVTRLQE
RSREKDADMI TRLPPTSATK KSQEKRALTS RLPKRLVVQH GDTLSQIAAR LYGDPAQWRL
LYAANRDRLD NPDLLRAGME LVVPLDSEK