Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0282 |
Symbol | |
ID | 3706453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 310853 |
End bp | 312124 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637736798 |
Product | extracellular solute-binding protein |
Protein accession | YP_342342 |
Protein GI | 77163817 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGT GGTTGAAGGT AGGTTTGCTT CACCTTGTCT CCGGGCTTAA AACCGGGATG TTGTTACTCT TTTTATTGCT TGCCGCTGCC TGCTCGGATT CTCCTGATAA CCCTGTCCCC ACCCTAAAAT GGTACGTGTT CGATGAGCCC TCCGGGGCCT TTGAGACGGC GGCAAAACGA TGCTCGGCGG ACGCCAAGGG TGTCTATCAG GTGGAAATCG CCGCCCTGCC CGCTGATGCC AGCCAACAGC GGGTACAATT GGTGCGGCGG CTAGCGGCGA AAGACACGGC TATCGACCTC ATTGGCATGG ATGTTATCTG GACCGCTGAG TTCGCCGAGG CGGGCTGGAT TTTGCCCTGG CTAGGAGAGG CGGCGGACCA AGCCAGACAA GGGCGCCTAC CCTCCACCAT TGAAAGCGCT ACCTATGACC ACCAACTTTG GGGCATCCCC TTCACCAGTA ACATTCAATT GCTTTGGTAC CGCACCGATC AGGTAGCGAA ACCGCCCCAG AGCTGGGATG AATTGATTCA AAGCGCCGAG GCCCTGGGAA TCGGCACCTT GCAAGTACAA GGGGCACGCT ATGAAGGTTT GACCGTCTTG TTTAATTCCC TGCTAGCCTC CGCTGGTGGC TCCGTGCTAG ACAAAACCGG CAAGGCGGTT TCCTTGGAAG CAACGCCCAC TGAAAAAGCT CTCCGTATTA TGAAGCGCAT CGCCACTTCC CCGGCTACTA ACTCCTCCCT TTCCATTGCC CGGGAAGATG AAACCCGGCT CGCCTTTGAA GGGGGCAGCG CCTTTATGAT CAACTATACT TATGTTTGGC CCAGCGCCCA GCAGAATGCC CCCCGGGTCG CCGCCCACAT GGGCTGGGTC CGCTGGCCAG CAGTTATTGA AGGTCAACCT AGCCGGGTCA CCCTAGGAGG CATTAACTTA AGTGTCGGCG CCTATTCGCG ATATCCCCAG CTGGCTTTTC GAGCGGCTAC CTGCATCGCT TCCCAGGAAC AGCAACGACT TGCCGCAATT AAGGGGGGGC TGCTGCCTAC CTTTGAGAAG CTCTATGCGG ATCCCTCGAT TAGAGAAGCC CTTCCCTTCG CGGACACCCT GCGTGCTACC CTTAAGAATG CGGCCCAACG GCCCTCAAGC CCCCTCTATA ATGATATTTC CTTGGCCATC AGCCGGACGT TGCACCCCAT GAAAGCCATT GATCCCCAGA AAGATATTGC TAGGTTGCGA AAAAACATCA ATCGGGCCCT TCACTCCCAG GGGTTACTAT GA
|
Protein sequence | MKQWLKVGLL HLVSGLKTGM LLLFLLLAAA CSDSPDNPVP TLKWYVFDEP SGAFETAAKR CSADAKGVYQ VEIAALPADA SQQRVQLVRR LAAKDTAIDL IGMDVIWTAE FAEAGWILPW LGEAADQARQ GRLPSTIESA TYDHQLWGIP FTSNIQLLWY RTDQVAKPPQ SWDELIQSAE ALGIGTLQVQ GARYEGLTVL FNSLLASAGG SVLDKTGKAV SLEATPTEKA LRIMKRIATS PATNSSLSIA REDETRLAFE GGSAFMINYT YVWPSAQQNA PRVAAHMGWV RWPAVIEGQP SRVTLGGINL SVGAYSRYPQ LAFRAATCIA SQEQQRLAAI KGGLLPTFEK LYADPSIREA LPFADTLRAT LKNAAQRPSS PLYNDISLAI SRTLHPMKAI DPQKDIARLR KNINRALHSQ GLL
|
| |