Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1898 |
Symbol | |
ID | 3705491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2166124 |
End bp | 2167851 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637738376 |
Product | hypothetical protein |
Protein accession | YP_343893 |
Protein GI | 77165368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.487361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA AATGTATATT TTGGAAAAAC GTAACATTCG CGCTAACTAT ATTAACACCG CTGCCTTTAC TGGCAGAAGC AACTTCAGCG CCGGTAGCTA TAGATCAGGG TGCTGAGTGG ACAGCTTCTG CTCAAAAAGA TTTTTATAGC CGTGATCAAG GCTCGCGGAT CATGCCGCTT CGCTGGATGG CTGCGTTGAA ACAGCCAAAT GGCGAACCTT TCATGGCGGC GAGTCTTAGT CGGTATGGTT ATCTGCCGAA CGAGGACAGC AACCCCCCCG GCCTGCCGGT GGGCTTTACC GTCGCCAGCG GTAGCGATGG CCAATATATC GGCATGACCT GTGCTGCATG TCACACACGG CAGATCGAGG TAGCAGGGAC TTTTTATCGG ATCGATGGTG GCCCGGCTAT TGCTGATTTC CAGAGTTTCC TGGCCGATCT CGATACTGCG GTAAATACTA TCCTTACCGA TCAACAAGCC TTTAAGAATT TCGCCCATGC GGTACTTGGC CCGTCGCCAA CGACAAGCGA AGAGAATAAG CTGCATGAGG CAGTGCAAAC TTGGTATTTG CCTTATCACA CGCTGATGGA AGGTGCGTTG CCACCCTCGC CCTGGGGACC AGCGCGTCTC GATGCCGTAT CCATGATTTT TAACCGACTT ACCGGCCTGG ATATTGGCCC CCCTCCTACT TACATGATTC CAGAAAATAT TAAGCCTGCC ACGGCACCGG TACGATATCC TTTTCTTTGG AACGCGGCAA TCCAGGATAA GACACAGTGG CCTGGTTTCG CTGACAATGG CAACAATATT CTGGGGCTCG CGCGTAATCT TGGAGAAGTC TACGGAGTCT TCGGCGTTTT TCATCCTAAA AAGGATAAGT GGCGACTGCT TGGGATTAAT TACCTAGCCA ATAATTCAGC TAATTTCCAA GGCTTAAATG CATTGGAGAA TCTGGTGCGA AAAATTGGCC CGCCGAAATG GCCATGGGAA GTAGATCAAG CTCTTGCTAG CAAGGGCAAG GAAGTTTTTG AGCGTAAGGC TGAACAGGGC GGTTGTATTG GCTGTCATGG GATCAAACCT GGGGAAACCC GCTTCTTGAA CCAAAAAACC TGGGCCACGC CGATTCAAGA TGTCGGTACG GACTCCAAAG AATATGAAAT CCTTGGCTGG ACTGTTAAGA CCGGCGTGCT TGAAGGCGCG AAAATTCCTT TCCTTGCCGA ACCGCTTAAA CCTGTTGACA CAGCCTTCAA TGTACTGGGA ACATCCGTCA TCGGCTCTAT CCTCCAGCAT TACGTTCCAG TTTTGATGAA GTCAGAAGAA CATGCTAAGA CCGAGGGTAA GCGCCCACTG TTCACACCGG AGACCGAAGA TCTCAAAGGC GCGTTTAGAA TGCCAACGTT GGCTACCGCT ACGCCTACCT ATGCTTACGA ATCGCGGGTA CTTCAAGGAA TATGGGCAGC CGCTCCATAC CTCCACAATG GATCGGTGCC AACACTAGCT GAGTTACTAA AACCAGCAGC CGAACGGGTT CGTTCATTCA AAGTAGGCCC AGCTTATGAT CTGGTTGATA TCGGACTTGC TGTCGAGCAA ACCCAGTTTG ACTATACTTT AGAGACTACC GATTGCAGTG ATCGCAACTC AGGAAATAGT CGCTGTGGCC ATGAATTTGG TACCCAACTT TCAGCGGACG AGAAAAAGGC GCTGCTTGAA TACCTTAAAA TTCTTTAA
|
Protein sequence | MKIKCIFWKN VTFALTILTP LPLLAEATSA PVAIDQGAEW TASAQKDFYS RDQGSRIMPL RWMAALKQPN GEPFMAASLS RYGYLPNEDS NPPGLPVGFT VASGSDGQYI GMTCAACHTR QIEVAGTFYR IDGGPAIADF QSFLADLDTA VNTILTDQQA FKNFAHAVLG PSPTTSEENK LHEAVQTWYL PYHTLMEGAL PPSPWGPARL DAVSMIFNRL TGLDIGPPPT YMIPENIKPA TAPVRYPFLW NAAIQDKTQW PGFADNGNNI LGLARNLGEV YGVFGVFHPK KDKWRLLGIN YLANNSANFQ GLNALENLVR KIGPPKWPWE VDQALASKGK EVFERKAEQG GCIGCHGIKP GETRFLNQKT WATPIQDVGT DSKEYEILGW TVKTGVLEGA KIPFLAEPLK PVDTAFNVLG TSVIGSILQH YVPVLMKSEE HAKTEGKRPL FTPETEDLKG AFRMPTLATA TPTYAYESRV LQGIWAAAPY LHNGSVPTLA ELLKPAAERV RSFKVGPAYD LVDIGLAVEQ TQFDYTLETT DCSDRNSGNS RCGHEFGTQL SADEKKALLE YLKIL
|
| |