Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1987 |
Symbol | |
ID | 3704871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2282999 |
End bp | 2285803 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637738463 |
Product | TPR repeat-containing protein |
Protein accession | YP_343979 |
Protein GI | 77165454 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0465352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAGCC CACTAATAAG CCCTATGATC CTAATCTGCT CACTATTATT AGCAATTGCC CTGTTAACAG GCTGCAGCGG GGATTCCAAC CTCACCCCCG AAGAGCACAT CTCACGCGCC AAGGAATATC AGGATCAGGG AAAAATAAGG GCCACCATTA TCGAATTGAA AAATGCCCTC CAGAAAACAC CAGACAATCA AGAGGCGCGC TGGCTTCTAG GTCAAACCTA CGTGAAAGCG GGTGATGGGC CATCCGCCGA AAAAGAACTA AAGCGAGCTC TCAGCCTCGG GTTAGCCTCC GAAGCTGCGG CTATTTATCT AACCCGCGCT GCCCTCTTGC AAAGAGAATT TCAAACTGCC ATTGAAACCT CGACCGATTA CCCTGCCCTT CCCGAAGATG AACAGGCAGA GCTGCTAGCC CTACGGGGAC ATGCCTATCT TGGACTACGA GAACTTGAAA AAGCAGAAAA ATCATACGAA TCCGCCCTTT CCATTAACCC TGCTGCCCCG GAGGCCGGTT TTGGAAAAGC ACGAATCGCC GCAGTGCAAA ACCGCCTGGA AGAGACCCGG CAATGGCTGG AAAAAGTCCT TCAGACTACG CCAAGTTTCG CTCCCGCCTG GAGTCTGCTA GGTGACCTAG AGCGTTATCA AGGAAACGGG GAAGCCGCTG AACAAGCCTA TGGGAAAGCC ATTGCCCACC GCTTCAATAA TGCCAGCGAT CTGCTCAACC GGGCATTGGT GCGCATTTAC TTGAAAGACT ATGAGGGCGC GGCCAGCGAT TTGGAAACGC TGAGTAAACG CGCGCGCAAT CATCCAGGAG TAACCTACGC CCAGGGATTA TTACATTTCC AGCAGCAGCA GTACGCTGAC GCTTTAACAA GCTTCCAGAA AACACTAAGC AAAAATCCAG AATACATGCC TGCAGTATTC TACGCCGGTA TCGCCTATTA CCAACAAGGA CAGTTAACGC AAGCTGGACA GCTCTTAAAT CAGTTCCTAA AGCGTTTTCC CCATTCCGAT ACTGCGGCTA AGACTCTGGC CATGATACGT CTTCGCGAGG GCAACTATAC GAGCGCCCAA GCTATCCTAG AGCCTATTAT AGCCCAAAAT CCCAATGATA CTGCCGCCCT GGATTTACTA GGAAGCGCAA TCCTAGGGCA AGGCAAGCCT GAAAAGAGCG CTGCCTACTT TCAAAAAGTC ACCGCGCAAA CGCCAGAGTC GGCGGCAGCC TACATGAAAC TAGGGCTTGG CTTCATGATG TCCGGGGAAC ATGAGCAAGG CATTGGCGCT CTGGAAAAAG CCATCGAATT AGACTCTCAA CTCCCCCAGG CCGACCGGTT AATAATCCTT GGCCACTTGC GGGCCCAGGA ATTCGATAAA GCTCTTGCAG CGGCTAAACG ACTGAGAGAG AAACAACCAG ACAGCCCCCT GCCCATAAAT CTAATTGGCG CCGCCTATCT TGGCAAAGGA GAAGAAAGCA AAGCCCAGGA GGCATTTCGC CAAGCTTTAG AAATCGCTCC AGGTGATCCA TCAGCGACTC ATAATCTTGC CATGCTGGCC ATCAAAAAGG GAAATATTGA GAAGGCCCAC GCTCTCTACC AAGAAGCGCT CAGATACCAT CCGGGTCATC TCAGAACATT GCTCAAGCTC AGTGCGCTAG AAGCACAGCA GGGCCATCCG GAGAAAGCAA AGAACTGGGT AGAGCAGGCT ATGGAGAAAA ATTCTAAAGC CCTAGAACCT CGGGTATTGC TAGCCCGTTA CTATTTGGAG CAAGGTAGGC CCGCCCGGAG TCTTGCTATC ACGCGGGAAA TCCAGGATCT TTATCCTGCT CATCCGGCAT TATTGCTCGT GGTGGGAACG GCACAACTGG AAAACAGCCA GCTACGGGAT GGGGTAAAAA CCTTTCAGAA ACTGGTGGAG GTCCAGCCCC AGTCGGCCCA AGCCCACTAC CTTCTGGCCA AAGCTTATGC CACGGTAAAT AATACGGATA AGCTCCGTAA AGAGCTCGAA CAAGCGCTTA AACTGAACCC CAACCATACT CTTTCTAAAA TCGCCATGAC TCGCCTGCTG ATGCAGGAGA ATCAACCTGA AGCGGCTAAT AAGCTATTCC AGGAATTAAA ACAGGCCTAC CCAGAACATC CGGAAGTGCT GGCCCAGGAA GGCTGGTTGG CCATGCGTCA GAACCGGCCT CAGGACGCAA TAATAGCCTT CAGGGAAGCA CTAAAACGCT CCCCCACCAG TCAAATCATC GTCAATCTTG CGCACGCCCA ACTCCAGGCG GGGAATCAAA ATGAGAGCTT GGCGACCCTA GAGGATTGGT TGAAAAAGCA TCCGGAGGAT ATGGTAGTGC AGTATAATTT GGCTAATCTC TATCTAGCAC TTAAACAAGA ACAGAAGGCA GCGTCAGCTT TTACTACAGT AGTCAAGCGA GCGCCAGATA ATGTGGTGGC CCTTAATAAC CTGGCCTGGC TGCTGCGCAA AAACGATCCG GCCAAAGCTT TAGAATATGC TGAGCGGGCT TTGGAGCTGG CTCCTAATGC ACCACCCGTC ATGGACACCT TGGGGATGTT GCTTTTAGAA AAGGGTGAAG CAAAACGCAG CCTACGCTTG CTCAGGAAAG CCTCGGACAG GGCGCCTGAG AACCTAACTA TTCGGTACCA TTTTGCCTTG GCTCTGGCGC AGAATGGAGA AAGTGCCCAG GCCCGGCAAG TGCTCGATGG GATCTTAGAT GCAAAGCAAC CCTTTGCCAA GAAAAAGGAA GCCCATGCGC TGCGTCAAAC CTTAAGCAAA TCACTAAATG ATTGA
|
Protein sequence | MPSPLISPMI LICSLLLAIA LLTGCSGDSN LTPEEHISRA KEYQDQGKIR ATIIELKNAL QKTPDNQEAR WLLGQTYVKA GDGPSAEKEL KRALSLGLAS EAAAIYLTRA ALLQREFQTA IETSTDYPAL PEDEQAELLA LRGHAYLGLR ELEKAEKSYE SALSINPAAP EAGFGKARIA AVQNRLEETR QWLEKVLQTT PSFAPAWSLL GDLERYQGNG EAAEQAYGKA IAHRFNNASD LLNRALVRIY LKDYEGAASD LETLSKRARN HPGVTYAQGL LHFQQQQYAD ALTSFQKTLS KNPEYMPAVF YAGIAYYQQG QLTQAGQLLN QFLKRFPHSD TAAKTLAMIR LREGNYTSAQ AILEPIIAQN PNDTAALDLL GSAILGQGKP EKSAAYFQKV TAQTPESAAA YMKLGLGFMM SGEHEQGIGA LEKAIELDSQ LPQADRLIIL GHLRAQEFDK ALAAAKRLRE KQPDSPLPIN LIGAAYLGKG EESKAQEAFR QALEIAPGDP SATHNLAMLA IKKGNIEKAH ALYQEALRYH PGHLRTLLKL SALEAQQGHP EKAKNWVEQA MEKNSKALEP RVLLARYYLE QGRPARSLAI TREIQDLYPA HPALLLVVGT AQLENSQLRD GVKTFQKLVE VQPQSAQAHY LLAKAYATVN NTDKLRKELE QALKLNPNHT LSKIAMTRLL MQENQPEAAN KLFQELKQAY PEHPEVLAQE GWLAMRQNRP QDAIIAFREA LKRSPTSQII VNLAHAQLQA GNQNESLATL EDWLKKHPED MVVQYNLANL YLALKQEQKA ASAFTTVVKR APDNVVALNN LAWLLRKNDP AKALEYAERA LELAPNAPPV MDTLGMLLLE KGEAKRSLRL LRKASDRAPE NLTIRYHFAL ALAQNGESAQ ARQVLDGILD AKQPFAKKKE AHALRQTLSK SLND
|
| |