Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1044 |
Symbol | |
ID | 4184380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 1204891 |
End bp | 1206849 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638071042 |
Product | b-glycosyltransferase |
Protein accession | YP_677661 |
Protein GI | 110637454 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00623044 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACAAT TAAGTATCAT AATTGTAAAT TATAATGTCT GTCATTTTTT AGAACAGGCG CTTATATCCG TATCTAAAGC GATTAAATCT TTAGATGTTG AAGTTTTTGT TGTTGACAAC AATTCTGCAG ATGGCTCTGT TGAAATGGTT CAAACCAAAT TCCCGAACGT ACAATTAATC GTAAACGATA TAAATGTCGG TTTCTCAAAA GCCAATAATC AGGCTATTGA ACAGGCTACA GGTAAATATA TACTGTTACT GAATCCCGAC ACCGTTATTG AAGTTGACAC CTTAGAGAAA TGTATTCACT TTTTGGATAC CCACCCGGAT GGCGGTGGTT TAGGTGTTAA AATGATTGAT GGCAAAGGTG ATTTTTTAGC CGAATCAAAA AGAGGTTTCC CTACGCCATG GGTAGCATTC TACAAAATAT TCGGGTTAGC AAAACTGTTT CCTCATTCTA AAAAATTTGG TCATTATCAT TTAGGTTATT TAGATAAAGA TCAGAACCAC GAAGTGGAAG TATTATCCGG CGCCTTTATG GTGCTTCGGA AATCCATGCT GGACAAAGTG GGCAACTTAG ACGAAGATTA CTTCATGTAT GGAGAAGATA TCGATCTTTC TTACCGCATT ATTAAAGCTG GCTATAAAAA TTATTATCTT TCAGATACCC GCATTATTCA TTACAAAGGA GAAAGCACTA AAAAGACAAG TGTCAATTAC GTGTTCATCT TTTACAAAGC AATGATCATT TTTGCACAGA AACATTTTAC ATCTAAAAGT TCCGGTGCAT TTTCATTACT GATTCATTTA GCGATTTATC TCCGGGCACT CTTAGCCATC AGCAACAGAG TTATTGAAAA GCTTTTTCCT ATAGCATTTG ATGCAGCCTT AATTCTTGCT TCCTTATTCA GCCTGTTCTA TTTTAAAAAT TCAGAAAACG CAATCGGCGA CCAGGGGAAT TCAATCATCT ATAAGCAGAT CATTCCCTTA TTTTCAAGTG TCTGGTTATT ATCCCTGTTA TTTAATGGCG CATACAAAAG CAATGTAACA CTTGCCCGTT TAGCCAGAAG CTTCTTTTTC GGCACATTGA TCATAGCCTC CATATCTTAT TTCATAGACG AATACCGCTA TTCTAAAAAC TTTCTTCTTG AAGGTTCTTT GCTTTCGTTG TTTATGGTAT TCCTGTTCAG AGGGATTGCC CACTGGATCA GAAACGGCCA TTTTGAATTA GGAGAAAGTA AAAATAAAAA AATTGTTATT GTTGGTTCGT ATAAAGAATG TGAACGCATC GATAAACTGC TGCAGGAAAC CAACTACAAA CTAAATGTTC TGGGTTTCAT TACTACAGGA AACAAAGCCG ATGTAAAAGG CAAATATCTG GGCTACACAA AACAATTATT GAATATTGTA CGCTTATATA AAGTAGACGA GATCATATTC TGCTCAAAAG ATCTGCCGGC AAATTCTATT ATAGAATGGA TGACTCAGAT CAACAATACA CTCGTTGACT TTAAAATTGT TCCCGAAGAA AGTAATATTA TTATCGGAAG TAATTCTAAA AACAGACGGG GTGATTTTTA TTCCCTGAAT ATTAACCTGA ACATTATTGA GGAAAACAAC GTTAAAGATA AACGTATACT TGATGTAAGT ACAAGTATAC TGTTCTTATT TATGTATCCG GTAATCTTTT GGTTGATTCA GAACCCTAAA AACTTCTTTA ATAATATCTT AAAAGTATTA TCAGGGAAAA AATCATGGGT TGGTTTTACA AACACCGAAC AGTTGAACTT ACCTAAGATT AAAAAAGGTA TTGTCAATCC GAGCTATTAC CTTGAAAAAT CGAACCATCA GCTTCCGCTG AATATTCAGG AACTGAATTT GATTTATGCC CGCGATTACA ATCTGTACAT GGACATTATG CTAATAGTTA AATCGTTTAA ATATCTGGGT AAAAGCTAA
|
Protein sequence | MKQLSIIIVN YNVCHFLEQA LISVSKAIKS LDVEVFVVDN NSADGSVEMV QTKFPNVQLI VNDINVGFSK ANNQAIEQAT GKYILLLNPD TVIEVDTLEK CIHFLDTHPD GGGLGVKMID GKGDFLAESK RGFPTPWVAF YKIFGLAKLF PHSKKFGHYH LGYLDKDQNH EVEVLSGAFM VLRKSMLDKV GNLDEDYFMY GEDIDLSYRI IKAGYKNYYL SDTRIIHYKG ESTKKTSVNY VFIFYKAMII FAQKHFTSKS SGAFSLLIHL AIYLRALLAI SNRVIEKLFP IAFDAALILA SLFSLFYFKN SENAIGDQGN SIIYKQIIPL FSSVWLLSLL FNGAYKSNVT LARLARSFFF GTLIIASISY FIDEYRYSKN FLLEGSLLSL FMVFLFRGIA HWIRNGHFEL GESKNKKIVI VGSYKECERI DKLLQETNYK LNVLGFITTG NKADVKGKYL GYTKQLLNIV RLYKVDEIIF CSKDLPANSI IEWMTQINNT LVDFKIVPEE SNIIIGSNSK NRRGDFYSLN INLNIIEENN VKDKRILDVS TSILFLFMYP VIFWLIQNPK NFFNNILKVL SGKKSWVGFT NTEQLNLPKI KKGIVNPSYY LEKSNHQLPL NIQELNLIYA RDYNLYMDIM LIVKSFKYLG KS
|
| |