Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0351 |
Symbol | |
ID | 4569523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 387683 |
End bp | 390667 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 639764949 |
Product | glycosyl transferase family protein |
Protein accession | YP_910834 |
Protein GI | 119356190 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.792541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAACA AATCTCTTTT TTTAAAACCA GCAAGAATTA TTGAACCAAT CGGTTGGTTA GGGCATATTC CATTCTCATT CTGGATTGTA GAAGCGGCGA AGCCCCAAGT AATTGTTGAA CTTGGTACTC ATTCAGGAAA TTCATATTTC TCTTTTTGCC AAAGCGTCAA GTATAACAGC TTGCCTACAC TCTGTTATGC AATATATTCA TGTTTGAATA ATGAACAAAC CGAATTATAT AATGCCGATA TTATTGATTC ATTATTAACC TATAATAATG AACACTACCA GGCTTTTTCA AATGTTACTG GTAACACTTA TTATAATGCC CTTTCTTTTT TTGAAGACGC TTCTATTGAC ATTTTACATA TAAATAGCGA AGGGACTTAT ACATCAGTAG TAAATTTATT TAATGCTTGG ATTCCAAAAT TATCCTCTTC TGGAATAGTA CTTTTTGATA ATATTAATGC TTCAGAAAAT AATACCGAAG TTTATAAATT ATGGGGATAT GTCAAAAACA TATACCCAAA CCTTACTATT GAGCATTCCA ATGGGCTGGG AGTTATTTTT ACTGGTAATA TGTATAATGA TAATATTAAT GAATTCATTA AAACTATTGA AACAGAGGCT AATAGAAATA TAATAGGAAC TCTATTTGAA AAGATTGGCA ACACAATAGA AATGGATTTT CAAATAAGAC GCCTATTAAA TTCCTTAAAA GAAGCTGATA CAATGATCGA TATGCTGAAT CAAACAATCA GTAATCGAGA GACTGTAATC AACAAACTTA TTCAAACAAT TGAAAAAAAC GATGAAACAA TAAAAAGTTT TAGTAAAAAA CATGATTTTG TGAAAGTGAA ACCTTTGCGA AAACTAAAAA AATCAATAAG AAAAAGAGCT CATACACTAA GTAAGCTTTT TTTCAAAGTC GAAGATAAAA ATCATCTCAG TGTAAATACG CATAACAGTA ATGCAGAATG GTTTCTTAAA GAATATAAAA ATATAGCATG TTCAGATAAT AAACATTGTT TTCATTATAA ATTCGGCATT ATTGATCATC TTGCCGAAAA CAATTCTTTC TTTAAAGAAT CGAAACCGAC TGCTTCTATA ATTATTCCTG TTTATGGAAA TATTTCATAC ACAATAAAAT GTATTGAGTC ACTATTAAAT TTACCTGACT CAACATCCTT TGAAATCATA ATAATCGATG ATCACTCCCC AGATAACTCA TATGAAACAC TTAATAAAAT CAGTCAAATA AAATTAATAA GAAACGACTG CAATAAAGGA TTTATACATT CATGTAATTC AGGCGCATCT CTTGCAACTG GTGAATATCT GATATTTCTC AATAATGATA CAGAAGTACT AAATGCCTGG CTCGATAGTC TTATAGCTCC ATTTATAATA CATGATAATG TCGGCTTAGT AGGTTCTCAA ATCATATATC CTGATGGGCG TTTACAGGAA GCTGGCGGAG TAATCTTGTC AGATGGCAGT GGATTAAATT ACGGAAGATT AAGTGATCCA AATAAACCAG AATACAATTT CCTGAGGGAA GTAGATTATT GCTCGGGATG CTCAATTGCA ATAAAAAAAT CTTTATTCGA TAGCATTGGA GGATTTGACA CTCTATTTAT CCCTGCTTAT TATGAGGATA CAGACATAGC ATTTACAGTT CGGAAAATGG GATATAAAGT ACTATATCAA CCAGCATCAA AGGTAATTCA TCATGAAGGG ATTACTTCTG GTACTGATTT GAAAAAAGGA GTTAAAAAAT TTCAAAATAC AAATAAAATT AAATTTTACA ACAAATGGGA AACACAATTA AAATCTCACA ATATCTTACC AGATAATTAT AGCTTAGCTA AGTGCAAGTA TTATAAAGAT ACTATATTAT TTATAGATGC ATGCACTCCA ACGCCTGATA AAGATTCTGG CTCAGTTGAT GCTTTTTTTC ATCAATATAT TTTCACAAAA TTAAATTTTA AGGTAACATT CATACCAGAT AACCTGATTT TTCTTGATGG TTACACACAG GTTTTACAAC AACTTGGTAT TGAATGCCTG TACAAACCAC ACATAACATC TATAGAAAAT TTTTTAAAAA CCAGAGGTGC TGAATTTAAA TACGTTGTAC TTTCAAGAGT AGGTATTGCA GTAAAACACA TTAACGCAAT AAAGAAATAC TGCCCTAACA GCATTTTAAT ATTTAACCTT GTTGATATTC ACTATATTCG AGAAGCACGA CAAGCAAGAA CGCTGAATTC CATCGAACTT CTTCATAAAG CAAAAAAAAC CAAAGCTACT GAACTCAGTA TCATGAAAAG ATCAGATGTG AATATCATTA TTAGTGAGTC GGAAGTCAAG CATCTTAGCA AGATCGATCC AGAATTAAAA CTTTTTAATC TTCCGTTAAT TCTTGACATG CATCAGCGCA CCAATAATTT TGAAAACAGA AAAAACATCA TGTTTATTGG CGGATTTCAA CATTCCCCCA ATGTAGATGC AGTACTGTAC TTTTCCAGGG AGATCTGGCC ATTGGTCAAA ACCAGGTTAG TTGATGCTCA CTTTATTATC ATCGGTAGTG AAATGCCAAA TGAAATAATA AATCTGCATA ATCAAAATGG AATTCTAAGC ATTGGATATA TTGAAGATTT ATCGCCTTTC TTCAACTCAT GTAAATTTTC TATAGCTCCC TTGAGATATG GAGCAGGGCA AAAAGGTAAG CTTGCTCGAA GTGGCAGCTA TGGACTTCCC TCAGTTGCAA CAAGTATAGC AGTTGAAGGA ATGGGCCTTA AACATGAAAA GCATATACTG ATTGCAGATA AACCTGATGA TTTTGCAAAT TCCATTGAAA GACTTTATCA TGACAGTATA CTTTGGGAAA AGCTTTCAAA AAATATATAT GATTACACTA ATAGTGAGTA CTCAATTACA AAAGGAATAA GCAGAATAAG CAATTTAATT CACGCTTTAA ATTAG
|
Protein sequence | MINKSLFLKP ARIIEPIGWL GHIPFSFWIV EAAKPQVIVE LGTHSGNSYF SFCQSVKYNS LPTLCYAIYS CLNNEQTELY NADIIDSLLT YNNEHYQAFS NVTGNTYYNA LSFFEDASID ILHINSEGTY TSVVNLFNAW IPKLSSSGIV LFDNINASEN NTEVYKLWGY VKNIYPNLTI EHSNGLGVIF TGNMYNDNIN EFIKTIETEA NRNIIGTLFE KIGNTIEMDF QIRRLLNSLK EADTMIDMLN QTISNRETVI NKLIQTIEKN DETIKSFSKK HDFVKVKPLR KLKKSIRKRA HTLSKLFFKV EDKNHLSVNT HNSNAEWFLK EYKNIACSDN KHCFHYKFGI IDHLAENNSF FKESKPTASI IIPVYGNISY TIKCIESLLN LPDSTSFEII IIDDHSPDNS YETLNKISQI KLIRNDCNKG FIHSCNSGAS LATGEYLIFL NNDTEVLNAW LDSLIAPFII HDNVGLVGSQ IIYPDGRLQE AGGVILSDGS GLNYGRLSDP NKPEYNFLRE VDYCSGCSIA IKKSLFDSIG GFDTLFIPAY YEDTDIAFTV RKMGYKVLYQ PASKVIHHEG ITSGTDLKKG VKKFQNTNKI KFYNKWETQL KSHNILPDNY SLAKCKYYKD TILFIDACTP TPDKDSGSVD AFFHQYIFTK LNFKVTFIPD NLIFLDGYTQ VLQQLGIECL YKPHITSIEN FLKTRGAEFK YVVLSRVGIA VKHINAIKKY CPNSILIFNL VDIHYIREAR QARTLNSIEL LHKAKKTKAT ELSIMKRSDV NIIISESEVK HLSKIDPELK LFNLPLILDM HQRTNNFENR KNIMFIGGFQ HSPNVDAVLY FSREIWPLVK TRLVDAHFII IGSEMPNEII NLHNQNGILS IGYIEDLSPF FNSCKFSIAP LRYGAGQKGK LARSGSYGLP SVATSIAVEG MGLKHEKHIL IADKPDDFAN SIERLYHDSI LWEKLSKNIY DYTNSEYSIT KGISRISNLI HALN
|
| |