Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2143 |
Symbol | |
ID | 3705335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2473217 |
End bp | 2474962 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637738619 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_344133 |
Protein GI | 77165608 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.38256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAT CAAGCATAAA GTTATATAAG TCTTTTTTAC TGGGCGCTGC CGCCCTTCTC CTGGGAACGG CATCGGCCCA AGGGGCCCAA TGGCGTCCCC TGTGCCCGGA AGAGGTCTCC TGCGTCAATC CTCAAATTTT TATTAATTCT AATTCCTTAG GCCGGTCGGA GTTGCTGGCC CCCGCCTTGG CCCCCCTGGT GAGAACGGAT AGCGCGTGGT TGCAGTATTC CGATCAATTC GAGCCTGATC CCCTGCAACT GCGTCCACGA GGGGTCTTTC GACTTCATCC CCCCCAAACG GCCCTGGAGA TCCTGCTGCC CCTTAAGGAT TCTCCCCCGG CAACCCCCGA CCCCGATCTG GGCTCCTCGG ACCTGGAGGA GGTCTTTGAT CCACCTAATA TTCAGCCCAC TTACCTGAAT GTGTCCGATT TTAACGCCAT GGAGGCCGAG GATGGCAGTA CCCTGGTTTT TGGCCTGTAT GGGATGCTGC GGTTATCCCC CGAGGGTGGG CTCATCGACG CCCTTAATTG GTCCGGCCGC AGCGTGCTGG GTCTGAACAA TGCCGCCTTC ACCGGCCCGC CGATTAGAGT GCAGGATTCC CTTTTTGTAG GGATTCGGGG TCCCGCCCAG CAGTTTATTT ATCACAGTGA GGATGATGGC CTGACCTGGC AGGAGGAGGT GGCGAGTAAT CGTCTTGGGG ACGATCGCTA TAACCTGCTG GCCAATCCGG AAGGCACCGG CTTGTGGGCG ATTATCTCCG AGTTTTTTGA CCGCCCGGCG GAACTGCGGG AATCCCTGGA TTTAGGCGCC ACCTGGAACC GAGTAGACAA TGGCAGTTTC CCAGCCCATA CAGTGCGGGT GGTCCATGAT CCGGGCGATC CCCAGGTGGC CTACGCCCTC TCGGCGCAAG GCTTATACCG GAGCCAGGAT CGGGGGGTAT CCTGGCACTT GACCGCCTTA CAGGAGCCGG TCCATGGGCT GGTTTTTGTA CCCCAAAAGG CGCCTTTGCC ACCGCTGCTG GTGGCGGGCA CCGACACAGG CGTTAAAATC AGTCCAGAGC CGTTCGGTAC TTGGGAAGCC TTGAGCAATG GTCTGCTGGC TATTCCCCAT ACGGTGGTCT ATACGGACGG CCTGCTGATA GGCGTCAGCG CTGCCGGCTA TTTTGTTTGC CCCCAGGCCG ATTGCTTTGG GGAGAGTCAA GCCGTGCCAG CCGAAGAGGC GCGGGGCGAA GTGACCGTGA CGGAATTTTT CAATGTCGAT TTAGGCCATT ATTTCATGAC GGCCTCCCCA GAGGACGTTG CTATCATTGA GGCCGGCGGA GCAGGGCCCG GCTGGGAACG TACCGGTCAT ACCTTTAAAG CCTGGAGTAA TTTAGGCAGT GACGTGGGGG TGTATCTCTG CCGTTTCTAT GGCTCCGTCT CCCCCGGCCC CAACAGCCAT TTCTTTACCG CCTCTCCCCA GGAGTGTGGT TTTTTGCTGG ACCTGCAAGC CCAAACGCCC CCCACCGTGC CCCGCTGGAA CTTCGAGGGC GATGCCTTTA TGGCCATCCC CGCCCAGGGC AAGGGGGACG CGCAGCATTG TCCGGAGGCA TTCGTTCCCG TCTACCGGGC CTATAACAAT GGCTTTGCCC GAGGAGAGGA GAGTAACCAC CGCTTTGTGA CCGACCGGAC CTTGCTCACG CCCTTATTAG ACCAAGGCTG GGTAGATGAA GGCATTGCGT TCTGCGTGCC ACCCCAATCC CAATAA
|
Protein sequence | MAKSSIKLYK SFLLGAAALL LGTASAQGAQ WRPLCPEEVS CVNPQIFINS NSLGRSELLA PALAPLVRTD SAWLQYSDQF EPDPLQLRPR GVFRLHPPQT ALEILLPLKD SPPATPDPDL GSSDLEEVFD PPNIQPTYLN VSDFNAMEAE DGSTLVFGLY GMLRLSPEGG LIDALNWSGR SVLGLNNAAF TGPPIRVQDS LFVGIRGPAQ QFIYHSEDDG LTWQEEVASN RLGDDRYNLL ANPEGTGLWA IISEFFDRPA ELRESLDLGA TWNRVDNGSF PAHTVRVVHD PGDPQVAYAL SAQGLYRSQD RGVSWHLTAL QEPVHGLVFV PQKAPLPPLL VAGTDTGVKI SPEPFGTWEA LSNGLLAIPH TVVYTDGLLI GVSAAGYFVC PQADCFGESQ AVPAEEARGE VTVTEFFNVD LGHYFMTASP EDVAIIEAGG AGPGWERTGH TFKAWSNLGS DVGVYLCRFY GSVSPGPNSH FFTASPQECG FLLDLQAQTP PTVPRWNFEG DAFMAIPAQG KGDAQHCPEA FVPVYRAYNN GFARGEESNH RFVTDRTLLT PLLDQGWVDE GIAFCVPPQS Q
|
| |