Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1851 |
Symbol | |
ID | 3705115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2102755 |
End bp | 2104470 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637738331 |
Product | TPR repeat-containing protein |
Protein accession | YP_343848 |
Protein GI | 77165323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATC TAGGGTTAAA AATAGGGTTA TTGAGTCTCT TTCTATCATT ATTGGCAGGA TGCGAACAGC GGGTACCCGT CGTTGCCAGC GAAGGCCAGC AGAGCAGCTT TGCCGCTGAT TCGGCCACGA CGGCGAGCGA AGATGCGCCA CCGGGAGATA TTGATCCCCT GTTGGATAAT CTGGGCGATC ATCATCATCC CGTTACTACC TCTTCTTCCC TGGCTCAGCG TTATTTCGAT CAGGGCTTAA CCCTTGCGTT TGCCTTCAAT CATGCCGAGG CTATCCGCTC CTTTAAGGAC GCCGCCACAA TCGATCCGGA TTGCGCTATG TGCTATTGGG GGGTTGCCCT TGCGCTCGGA CCTAATATTA ATGCGCCCAT GGAGGCGGCG GCTGTCCCGC AGGCTTATGA GGCCGTTCAG AAAGCGCTGG CGCTGGCGCC TAAGGCCAAT AAAGCGGAGC AAGCCTATAT CCAAGCGCTG GCGATACGTT ATGGGCCTAC CTCTGGAGCT GATCGGGAAG GACTGGATCG GGCCTATGCC GATGCCATGA GGGAGCTGTC GCGCCGTTAC CCCGATGACT TGGATGGGGC AGTGATTTTT GCCGAGGCCC TGATGAATCT CACGCCATGG GAGTACTGGA CTCCCGCAGG GGAGCCGACG GCCCATACCC AGGAAATCAT AGCCACCCTG GAGTCGGTAT TAGAGCGCGA CCCCAATCAT ATTGGGGCTA ATCATTATTA TATTCATGCT GTCGAGGCAT CCCCGGCGCC GGAGCGGGCC CTTCCTAGCG CCAAGCGGTT AGGGCAGCTA GCGCCGGGCG CTGGTCATCT GGTTCATATG CCTGCCCATA TTTACTGGCG GGTGGGGGAT TACCATGCAG CGGTGACCGC CAATGAACAT GCTATCCATA CGGATGAAGA ATATCTCCCC GACCCAGATG CCGAGGGGTT ATACCGGCTC GGTTATTACC CCCATAACAT CCATTTCCTA TTTGCCGCGG CGCAGATGGA AGGCAACAGC CAGCTGGCCT TGGAAGCGGC CCGCAAACTA GTGGCCAGTA TTCCCGAAGA ATCCTACTCG ACCTTACCTC AATTGGAGGA ATTTAGGCCC ATGCCTCTCT ACGCCTTGGT GCGGTTCGGG AAATGGGATG AAATTCTCCG TGAACCCAAG CCCGGTGCCT TCTTTCGATA CACGCGGGGA ATCTGGCATT GGGCCCGGGG CATGGCGCTC ACGCGTCTGG GTCAGCTTGA TTCCGCAGCC CAGGAATATG AACAATTAAC CAAAATCGGG CAGTCCCAGG CCATGGCTCA ACTAGTTTTT TGGTCTGCTT CTTCCGGCTC AACCTTGCTA GAGATTGCCG CCCATATTCT CGCCGGAGAA TTAGCCGGCG CCCGGGGCCA GACCGAAGCA ATGATCGCCC CCCTCAGGGA AGCGGTGGGC ATTCAGGATA ATCTGCGGTA TATCGAACCG CCGGCTTGGT ATTACCCGGT GCGCCATAAT CTGGGCGCGG CGTTGCTGAA GGCGGATCGG GCCGTAGAAG CCGAAGCCGT TTACCGGAAA GATTTAAAGC AATATCCCCA AAATGGCTGG TCGCTCTTTG GCTTGGCCCA AAGCCTCCGT GAGCAAGGCC AAACCGAAGC GGCGGCAACG GTGGAAAAGC GCTTTGAGGA GGCTTGGCAG CACGCCGATG TGGACTTGAG GGCTTCCCGC TTTTGA
|
Protein sequence | MSYLGLKIGL LSLFLSLLAG CEQRVPVVAS EGQQSSFAAD SATTASEDAP PGDIDPLLDN LGDHHHPVTT SSSLAQRYFD QGLTLAFAFN HAEAIRSFKD AATIDPDCAM CYWGVALALG PNINAPMEAA AVPQAYEAVQ KALALAPKAN KAEQAYIQAL AIRYGPTSGA DREGLDRAYA DAMRELSRRY PDDLDGAVIF AEALMNLTPW EYWTPAGEPT AHTQEIIATL ESVLERDPNH IGANHYYIHA VEASPAPERA LPSAKRLGQL APGAGHLVHM PAHIYWRVGD YHAAVTANEH AIHTDEEYLP DPDAEGLYRL GYYPHNIHFL FAAAQMEGNS QLALEAARKL VASIPEESYS TLPQLEEFRP MPLYALVRFG KWDEILREPK PGAFFRYTRG IWHWARGMAL TRLGQLDSAA QEYEQLTKIG QSQAMAQLVF WSASSGSTLL EIAAHILAGE LAGARGQTEA MIAPLREAVG IQDNLRYIEP PAWYYPVRHN LGAALLKADR AVEAEAVYRK DLKQYPQNGW SLFGLAQSLR EQGQTEAAAT VEKRFEEAWQ HADVDLRASR F
|
| |