Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1344 |
Symbol | |
ID | 3706147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1492417 |
End bp | 1493937 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637737840 |
Product | extracellular solute-binding protein |
Protein accession | YP_343369 |
Protein GI | 77164844 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.088373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGAA CAAGAAACAT AGTCCGGTGG TGGCGAACGT TGGGTTTGTT GATTGGCATA GTTTTGCTCA TAGGCTGTAG TTCTGGTTAT GAGGACAGGA ATTCTTTACG TTTTGGCCTG GCGACCTCAC CCCTTACCCT TGATCCCCGC TATGCCACGG ATGCCGCCTC AGCCCGTATT ACCCGCCTCT TGTATCAACC CCTCGTTGAT TTGGATGCCA GCGGCAAGCC CATCCCCAGT TTAGCCCATT GGCAACAACT GTCTCCGAGC CGCTACCGGT TTCGATTACA GCTTGATCGT CCTCGCTTTC ACGATGGCAG CAAGCTTAGC GCTCAGGATG TGAAAGCGAC CTATACCTCG GTACTTGATC CAGATAATGG CTCGCCTTTT TTAGCTTCCT TGGATAAAGT GCAGTCTATT GCGGTGCCCA ATCCAGAAAC CATTGACTTC GTTTTGAAAG AACCTGATTC CCTGTTTCCA GGGCGGCTTA CCCTGGGGAT AATCCCTGCT TCCCTAATAG CTGTGGGTCA CCCTTTGAAT CGCATTCCGG TAGGCAATGG TCCCTTTCGC TTTAAGGCGT GGCCGGAAGA GGGCCGATTG ATACTCCAGC GGCGACGAGA TAGACAGGGG TTTGTATTTG TTCATGTACC CGATCCCACG GTGCGGGTAT TAAAATTACT CCGTGGGGAG ATCCACATGC TGCAAAACGA TCTTCCCCAG GAGCTGGTGG CTTACCTGGA AGATAAGGAG GAAATTTCCC TGCAACGGGG TCGGGGGAAT ACGTTTACGT ATCTAGGATT TAATCTCGAT GATCCGGTAA CAGGCCAACC TCAAGTGCGT TTAGCGATCG CCCATGCTTT GGATCGTCAA AAGATCATTC GCTATGTTTT TAAAGGAGCA GCCCGGCCAG CTCAGGCCCT GCTGCCGCCC GAGCATTGGG CAGGCCACCC GGCTTTACCC CCGCACGCCT ATGATCCCCA GCGGGCCAGG AAACTGCTTA AGGAAGCTGG CTTCAATGCC CATTCTCCGG CCCGCGTTAG CTATAAGACC TCTAGCGATC CTTTCCGGGT TCGCCTGGCC ACCATTATTC AGCATCAACT GCAACAAGTG GGCATTCAGG TAGAGGTGCA AAGTTATGAT TGGGGAACCT TTTACGGCGA TATTAAGGCC GGGCGTTTCC AAATGTATAG CCTATCCTGG GTAGGCGTTA AATTGCCGGA TGCCTTTCAT TACATCTTTC ATAGCGAGTC GGCTCCACCT CAAGGGGCGA ACCGAGGCCA TTTTAACGAT CCTCAAAGTG ATTTTTTAAT TGAACGGGCG GGAAATACAG ACAATCAAAA CCAGCAAGCC AATCTCTACC GCCGGTTACA AGCACGCCTT CTGGAACAAC TGCCTTATGT GCCTCTTTGG TACGAAGATA ACGTGTTCGC AGCCCATCAA GGTATCAGTC ATTATCAACT AGCACCCGAT GGGAATTATG ATGGCTTGGT GGAAGTGCGG TTCCATCGGT ACGTGAAGTA A
|
Protein sequence | MRGTRNIVRW WRTLGLLIGI VLLIGCSSGY EDRNSLRFGL ATSPLTLDPR YATDAASARI TRLLYQPLVD LDASGKPIPS LAHWQQLSPS RYRFRLQLDR PRFHDGSKLS AQDVKATYTS VLDPDNGSPF LASLDKVQSI AVPNPETIDF VLKEPDSLFP GRLTLGIIPA SLIAVGHPLN RIPVGNGPFR FKAWPEEGRL ILQRRRDRQG FVFVHVPDPT VRVLKLLRGE IHMLQNDLPQ ELVAYLEDKE EISLQRGRGN TFTYLGFNLD DPVTGQPQVR LAIAHALDRQ KIIRYVFKGA ARPAQALLPP EHWAGHPALP PHAYDPQRAR KLLKEAGFNA HSPARVSYKT SSDPFRVRLA TIIQHQLQQV GIQVEVQSYD WGTFYGDIKA GRFQMYSLSW VGVKLPDAFH YIFHSESAPP QGANRGHFND PQSDFLIERA GNTDNQNQQA NLYRRLQARL LEQLPYVPLW YEDNVFAAHQ GISHYQLAPD GNYDGLVEVR FHRYVK
|
| |