Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0755 |
Symbol | |
ID | 3707021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 821586 |
End bp | 823778 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637737257 |
Product | TPR repeat-containing protein |
Protein accession | YP_342798 |
Protein GI | 77164273 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.548835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATAG AAAAGCTGCT AGCGAAAGCC CGTCAATTAC AGGCCGCAAA CCGAGTTCAG GAAGGCACCC AAATCTACCG CCAAATCCTG GCCAAACACC CTAATCACTC AATTGCCCTC CTGGGCTTGG GCAATGCTGC CCTTCAAAAC AAAGATTTTA CCGCAGCTAT CCAATGGCTA GAGCGTCTGC TTGCGGTTAT CGGCCCCAAA AGACAGCTAT TAACCACCCT GAGTATGGCC CACAGTAATT GTGGCTCCCG CCTGTTCGAG AACGTCATGC TGCCCCAGGC CCATGCTCAT TTCCGGCGCG CCTTGGAACT CGATCCCCGT AACCGGCTTG CCTGGCGCAA TTTAGTGCTC GCTCAACTCC AGCAAGGAGA CAACCAAGCC GCAGTTGCCA GCGCCCGCCA GGCAAGCATC CTCGATCCGC GGGATCATGA AATCCGTTTG CTGCTAGCGC GGGCCCTCCT CGCTAACCAG CAACACCCCG CGGGACTGGG ATTGCTCGCT ATCCTAACAA ATATTCCCCT CCCTGATGAG ATTGCCCTGG GTGTGGCCGA ACAATGGCTG CTTTACCATC AACCGCAACG TGCCTGGGCG TTGCTAGAGC GGCAACAAAA TATTAGCGCA GACCCCAAGG CATTAATATC TCGCATTATG ATTCTCGCTC GCCGCCATGG GGAAAACTGG CAAGCAGCCC AATGGCTGCG CCGCTGGCTT AAGCGCCACA GCGCTCAGGA AAAACAATGG TTAGCTTTTG CCCGCGTCCT CGACCGGGCC GGCGAGGCCC GAAAAGCCAT GACCGTCTAC CAACATATTC TGACGGCCAA CCCCGATGCC TGGCAAGCAC GGCTAGGCGC GGCCCTCACC CTGCCGGTTG TCTACCATGA TCGCCAGCAC CTGGCCACCA CACGGAGCCG GTACCGAAAA GAACTGCAAG CCCTCAAGGA GTGGCAACCG GCAAGTCCAC CGTGGCTAGA AGATCTGCTC TGGTCCAACT TCTTCCTGGC TTACCAAGGC GGGAATGATG CTTCCCTACA GCGAGATTAC GGTGATTGGC TCCATCATTG GGCCAGCCAT GCCCTGGATG CCCCTAGCCC AGTTCGCTGC AAGCACTCCA GGCCGCGCCG TATCGGCCTA GTGTCAAGCG CCTTTCGCGA CTGCACGGTA GGCCATTACT TTGGTCGCTG GCCGGAGGCC CTGGGGCGAG GGGGCTTTGA AGTCATCGTT TACCAATTAG GCCCAAAGCG GGATCACCAT ACCCGGATCG TGGCCGATTC AGCCAGTAAA TTCCGCTACC TGGACGGCCG CCTTGCCTCT TGCGCGGCCC AAATAGCGGC GGATCGGCTA GATGCCCTCA TTTACCCGGA GCTGGGCATG GATGCCCGGC TGCTGGTGCT GGCCGCCCTT CGCTTAGCCC CGTTTCAGGG CTGTGCTTGG GGCCATCCTG TCACCAGCGG ACTGCCCACC ATGGACATTT ACTTCTCCTG CGCCACCATG GAGCCGCCCG AAGCTCGGAC CCATTACCGT GAGCGATTGC TGTCCCTGCC GGGACTAGGC ACTAGCTACC CAGCCCCACC TGAACCCCCT CCCGCCGACC GGAACGACCT TGGCCTGCCA GAGAAACGCA CTCTCTACCT CCTGCCCCAG TCGCCCTTTA AAATTCATCC GGATGCCGAT GCCCTGGTAG CCCAGTTGCT GGCCGAGGAT AGGCAGGGAA TGTTGGTACT ATTTACCGGT CAGGATCGCC GGGTTACGGA CAAACTGCTG ACGCGCCTGG GGGCGGCCTT AACCCAAGCG GGAGCTGACC CAGAACGCCA ACTGCTGTTG CTGCCTACTA CGTCTCGCGC GCGCTATTTG CAAATCAACC GCTGCTGCGA TCTCATGCTG GATACGCCCC ATTGGTCGGG GGGCAATACT GCCCTGGATG CCCTGGGTTC CGGGCTTCCT CTCATTGCTC TGCCCAGCAC CTACATGCGG GGGCGGCAAA GCGCAGCCAT GCTAAATTTG CTGGAACTTC CTGAGTTGGT TGCCCAGGAT GCCGGAGATT ATGTGCGTAA AGTCCTCCAG TATGGCCGGG ACAAAGCAGC CAACCAAGCC TTGCGAGTGC GGATTCTTGC CCGGCGCAAT CGTCTCTTTG ATCAGCAAGC ACCTCTTGAT GCGTTGACCG CCTTTTTTAA ATCCTTGAGC TAA
|
Protein sequence | MGIEKLLAKA RQLQAANRVQ EGTQIYRQIL AKHPNHSIAL LGLGNAALQN KDFTAAIQWL ERLLAVIGPK RQLLTTLSMA HSNCGSRLFE NVMLPQAHAH FRRALELDPR NRLAWRNLVL AQLQQGDNQA AVASARQASI LDPRDHEIRL LLARALLANQ QHPAGLGLLA ILTNIPLPDE IALGVAEQWL LYHQPQRAWA LLERQQNISA DPKALISRIM ILARRHGENW QAAQWLRRWL KRHSAQEKQW LAFARVLDRA GEARKAMTVY QHILTANPDA WQARLGAALT LPVVYHDRQH LATTRSRYRK ELQALKEWQP ASPPWLEDLL WSNFFLAYQG GNDASLQRDY GDWLHHWASH ALDAPSPVRC KHSRPRRIGL VSSAFRDCTV GHYFGRWPEA LGRGGFEVIV YQLGPKRDHH TRIVADSASK FRYLDGRLAS CAAQIAADRL DALIYPELGM DARLLVLAAL RLAPFQGCAW GHPVTSGLPT MDIYFSCATM EPPEARTHYR ERLLSLPGLG TSYPAPPEPP PADRNDLGLP EKRTLYLLPQ SPFKIHPDAD ALVAQLLAED RQGMLVLFTG QDRRVTDKLL TRLGAALTQA GADPERQLLL LPTTSRARYL QINRCCDLML DTPHWSGGNT ALDALGSGLP LIALPSTYMR GRQSAAMLNL LELPELVAQD AGDYVRKVLQ YGRDKAANQA LRVRILARRN RLFDQQAPLD ALTAFFKSLS
|
| |