Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_4618 |
Symbol | |
ID | 6207951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 5932499 |
End bp | 5935579 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641694286 |
Product | NHL repeat-containing protein |
Protein accession | YP_001821489 |
Protein GI | 182416423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCG CGGGGTTGGC AACGGTCACG CCCTGTCCGG TCGCGTCCGG ATCCGAGAGC GTGGAATCTT ACGTGTTCAC CCGCTACGCG GGGAATCCGA ATGATACGCC CTATGGTAGC ACGGACGGAT TCGGCGATTC GGCGCGTTTC GCCGCTCCGC GAAGCATCGC GGTCGATGCA TCGGGCACTC TCTACGTCGC GGATGCGGCA AGCAGCGTGA TTCGGAAGAT CACGGCGGAA GGTATGGTGA CGACATTTGT CGGGACGGCC GGGCAGCGCG GGAGCGCTGA TGGCATCGGC GCCGCCGCGC GATTCCAAGG AATTGATGGC CTGGCGATCG ATGCCCGCGG AAATCTGTAC GCCGTCGACT TCACCGACCA TACCGTGCGG AAGATAACGC CGGAAGGAGT GGTCACGACG CTGGCAGGTT CGGCTGGAGA CCATGGCACC CAAGTGGGAC ACGGCGGGGA GGCCCGTTTT GATTCTCCCA TGGCAGTCGC GGTGGATCGA TGGGACAATC TCTACGTGGG ACAGATGGGT GACGGCGCGA TCCGCAAGGT TTCGCCCGAT GGAAATGTCA CGATCCTGGC AGGCGCAGGC AAGGCTGGCA GCGCCGACGG AGACAGCGCC AGCGCGCGAT TCAGTGGCAG CGACGGGCTG GCCGTGGATG GGACCGGGAA CGTCTACGTG GCGGACCTGT TCAATCACAC GATTCGCAAG ATTACGCCCG ACGGAGTGGT GACGACCCTG GCGGGGGTCG CCCGCGAGAG CGGCTTTGCG GACGGAGCCG GTGCGGCGGC TAGGTTTTAC TATCCGCGGG AACTGTCGAT CGATGCGTAC GGAAACATCC TGGTGGCGGA CGAGGGGAAC TGCGCCATCC GTAAGGTCAG CCCGTCAGGG GTTGTCTCGA CGGTGGCGGG GAAGACCGGG CTGAGCGGCA GCGATGACGG AGTCGATGCA GCTCGTTTCT CCTTGCCGCG GGGTGTGGCC GTGAGCCGGA CGGGCGATAT CTATGTTGCC GATTCGGGTA ATTCGACCGT GCGGCGGATC GCGGTGGGCG GCGCGGTGAC GACCTTCGCG GGCCGGCCCG GGGGCCCCGG TTATGCGAAC GGGAGCAGTG AGACCGCGCA GTTCTATTTT CCGACCGGAA TCGCGATCGA TCAGAACCGG AACGTCTTCG TCGCGGATTC CTACAACAAC GTGATAAGGA AGATCACGCC TGGCGGCGTG GTGACAACCG TGGCCGGGTT GGGCGGCGTG TTCGGTAGTG CCGAGGGATC CGGAGCGGCC GCTCGTTTCG GGGTTCCCGC CGCGGTCGCG ATCGATGCGG CCGCCAATCT CTACGTTGCA AATCGCCAGA CTCATGTGAT TGCCAAGATC GCGCCTGATG GCGCGGTGAC CTTCTTCGCC GGCAGTCCGG GACTGTCAGG TAGCACGGAT GGCAATGCGC GGACGGAAGC GCGCTTCAAC GGCCCGACCG GGATTGCCGT CGGTCCGTCC GGAACGATCT ACGTTGCGGA CTTCGACAAT CATACGATCC GGCAGATTTC ACCGGCGGGA ATGGTTTCGA CGCTCGCGGG AGCAGCCGGC CAGCCAGGGA CGGCTGATGG TACCGGATCA GCGGCGCGTT TCTACGCACC GGCGGCGGTC ACCGTGGACC GCGCCGGCAT GATTTACGTG GCGGATTCCT GGTCGAGCGC GGTGCGGAAA ATCACCCCGG ATGGCGTGGT CACTACGGTC GTTCGCCAGC CGTACGACGG CGAGCCAGAG CGACTCTACC TTCCGTTCGG CATTGCTGCG GGTCACGATG GAAGCCTTTA CATTGCGGAT ACGGGAAACA GCACAATTCG CCAGATCAGG CCCGACGGCT CGATGGTCAC GATAGGCGGC GGCATGCGGC AGGAGGGCAA GCAAGATGGT CGGGGCGGCG AGGCCCGGTT TCTCAATCCC TACGGGGTGG CGGTGGACGC CGCCGGCCAT CTTTACGTGG CGGACTCGGG CAACAACCTC GTTCGGAAAG GCGTGAAGGT GGCGGCGGGC AAACCGGTGT TCACCGCGAG TCCGATGAAT GCAACCACGC TCGTGGGTCA GTCGGTTCAG TTTGCTTCGT CGGTTTCGCA AAGCGGGGGC GCGATCCTTC AGTGGCGCTG CAACGGACGC GATCTCGTGG GTGAAACTGC TGCGGTGCTG GCGGTGGCCA ATGTGGGCCT GAACGATGTC GGCCTCTATT CCATCACCGC AACACAAGGG CTGGAGATCA CAAGCACCGA AGGGGCGGTG CTCGGAATCG CTTCGTCGTC GAAGGTGACG GGCGCGGGGA GCGAGTTGCA GGCCGACATC CGACATCCGA ATGGGAACGT GTTTGACCAA GTGCTGCTGA CAGGGGAGGC GGAAGCGATC ACCGCTGACT ACGCGCTCAA TCAGATCACC CGGACCTCGT TCGTCGATGT GGACGGCGAC ATCGTGCAAG TGGAGTTCAG CGGACCGGGG ACGCTGTCGC TGGTGCTTGA TGCGTCATCG GGGCCCGCTC GGGCAGAGAA GTATAATCAG GACAGCGTGA GCTACATGAA AGGCCACGCG GGGATCGTGA TCGCCGGCGC GACGGAGCAG ACGAACGTGT CCGTGTTCAG CGTGGGCAAG GCGACGGCCG TCAACCAGGC GCTGTTCAAG GAAGGGGAGA GTTACGATGG CATCGCCGAC CTCGCCTTCA TCGCGATCCT GTCGAGCGAT GGGAAGTTCG GCGGGATCCG GGCAGCGAAT GCCACCTTCT TCGCCGACAA GGGCCACACC GGCATCTACG CGCCGGGGGT GGACTTCTCG GGGCCGGTTT TTGTCGGCGA TGTGAGTGCC TTCGACCAAG CCAAACCAGT GTTGGTGCTC GGCTCGGCCG CGGATGTGCG GATTACGGGT GGCGATCTTC TGCAGGACAA CAATCAGCCC GTCCAAGTGA GCGGGATCAC ACAGCTGCAG TTCCGCGATG GTTCCGACTC GCATGGCAAC GCGTTGCCGG CACAGAAGAA CAAGGCGGTG CTGACCGAAG ACGGCATCGA CGTGACGGCG CAGATCGTCA TCGGGCCTTG A
|
Protein sequence | MASAGLATVT PCPVASGSES VESYVFTRYA GNPNDTPYGS TDGFGDSARF AAPRSIAVDA SGTLYVADAA SSVIRKITAE GMVTTFVGTA GQRGSADGIG AAARFQGIDG LAIDARGNLY AVDFTDHTVR KITPEGVVTT LAGSAGDHGT QVGHGGEARF DSPMAVAVDR WDNLYVGQMG DGAIRKVSPD GNVTILAGAG KAGSADGDSA SARFSGSDGL AVDGTGNVYV ADLFNHTIRK ITPDGVVTTL AGVARESGFA DGAGAAARFY YPRELSIDAY GNILVADEGN CAIRKVSPSG VVSTVAGKTG LSGSDDGVDA ARFSLPRGVA VSRTGDIYVA DSGNSTVRRI AVGGAVTTFA GRPGGPGYAN GSSETAQFYF PTGIAIDQNR NVFVADSYNN VIRKITPGGV VTTVAGLGGV FGSAEGSGAA ARFGVPAAVA IDAAANLYVA NRQTHVIAKI APDGAVTFFA GSPGLSGSTD GNARTEARFN GPTGIAVGPS GTIYVADFDN HTIRQISPAG MVSTLAGAAG QPGTADGTGS AARFYAPAAV TVDRAGMIYV ADSWSSAVRK ITPDGVVTTV VRQPYDGEPE RLYLPFGIAA GHDGSLYIAD TGNSTIRQIR PDGSMVTIGG GMRQEGKQDG RGGEARFLNP YGVAVDAAGH LYVADSGNNL VRKGVKVAAG KPVFTASPMN ATTLVGQSVQ FASSVSQSGG AILQWRCNGR DLVGETAAVL AVANVGLNDV GLYSITATQG LEITSTEGAV LGIASSSKVT GAGSELQADI RHPNGNVFDQ VLLTGEAEAI TADYALNQIT RTSFVDVDGD IVQVEFSGPG TLSLVLDASS GPARAEKYNQ DSVSYMKGHA GIVIAGATEQ TNVSVFSVGK ATAVNQALFK EGESYDGIAD LAFIAILSSD GKFGGIRAAN ATFFADKGHT GIYAPGVDFS GPVFVGDVSA FDQAKPVLVL GSAADVRITG GDLLQDNNQP VQVSGITQLQ FRDGSDSHGN ALPAQKNKAV LTEDGIDVTA QIVIGP
|
| |