Gene CHU_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1044 
Symbol 
ID4184380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1204891 
End bp1206849 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content35% 
IMG OID638071042 
Productb-glycosyltransferase 
Protein accessionYP_677661 
Protein GI110637454 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00623044 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACAAT TAAGTATCAT AATTGTAAAT TATAATGTCT GTCATTTTTT AGAACAGGCG 
CTTATATCCG TATCTAAAGC GATTAAATCT TTAGATGTTG AAGTTTTTGT TGTTGACAAC
AATTCTGCAG ATGGCTCTGT TGAAATGGTT CAAACCAAAT TCCCGAACGT ACAATTAATC
GTAAACGATA TAAATGTCGG TTTCTCAAAA GCCAATAATC AGGCTATTGA ACAGGCTACA
GGTAAATATA TACTGTTACT GAATCCCGAC ACCGTTATTG AAGTTGACAC CTTAGAGAAA
TGTATTCACT TTTTGGATAC CCACCCGGAT GGCGGTGGTT TAGGTGTTAA AATGATTGAT
GGCAAAGGTG ATTTTTTAGC CGAATCAAAA AGAGGTTTCC CTACGCCATG GGTAGCATTC
TACAAAATAT TCGGGTTAGC AAAACTGTTT CCTCATTCTA AAAAATTTGG TCATTATCAT
TTAGGTTATT TAGATAAAGA TCAGAACCAC GAAGTGGAAG TATTATCCGG CGCCTTTATG
GTGCTTCGGA AATCCATGCT GGACAAAGTG GGCAACTTAG ACGAAGATTA CTTCATGTAT
GGAGAAGATA TCGATCTTTC TTACCGCATT ATTAAAGCTG GCTATAAAAA TTATTATCTT
TCAGATACCC GCATTATTCA TTACAAAGGA GAAAGCACTA AAAAGACAAG TGTCAATTAC
GTGTTCATCT TTTACAAAGC AATGATCATT TTTGCACAGA AACATTTTAC ATCTAAAAGT
TCCGGTGCAT TTTCATTACT GATTCATTTA GCGATTTATC TCCGGGCACT CTTAGCCATC
AGCAACAGAG TTATTGAAAA GCTTTTTCCT ATAGCATTTG ATGCAGCCTT AATTCTTGCT
TCCTTATTCA GCCTGTTCTA TTTTAAAAAT TCAGAAAACG CAATCGGCGA CCAGGGGAAT
TCAATCATCT ATAAGCAGAT CATTCCCTTA TTTTCAAGTG TCTGGTTATT ATCCCTGTTA
TTTAATGGCG CATACAAAAG CAATGTAACA CTTGCCCGTT TAGCCAGAAG CTTCTTTTTC
GGCACATTGA TCATAGCCTC CATATCTTAT TTCATAGACG AATACCGCTA TTCTAAAAAC
TTTCTTCTTG AAGGTTCTTT GCTTTCGTTG TTTATGGTAT TCCTGTTCAG AGGGATTGCC
CACTGGATCA GAAACGGCCA TTTTGAATTA GGAGAAAGTA AAAATAAAAA AATTGTTATT
GTTGGTTCGT ATAAAGAATG TGAACGCATC GATAAACTGC TGCAGGAAAC CAACTACAAA
CTAAATGTTC TGGGTTTCAT TACTACAGGA AACAAAGCCG ATGTAAAAGG CAAATATCTG
GGCTACACAA AACAATTATT GAATATTGTA CGCTTATATA AAGTAGACGA GATCATATTC
TGCTCAAAAG ATCTGCCGGC AAATTCTATT ATAGAATGGA TGACTCAGAT CAACAATACA
CTCGTTGACT TTAAAATTGT TCCCGAAGAA AGTAATATTA TTATCGGAAG TAATTCTAAA
AACAGACGGG GTGATTTTTA TTCCCTGAAT ATTAACCTGA ACATTATTGA GGAAAACAAC
GTTAAAGATA AACGTATACT TGATGTAAGT ACAAGTATAC TGTTCTTATT TATGTATCCG
GTAATCTTTT GGTTGATTCA GAACCCTAAA AACTTCTTTA ATAATATCTT AAAAGTATTA
TCAGGGAAAA AATCATGGGT TGGTTTTACA AACACCGAAC AGTTGAACTT ACCTAAGATT
AAAAAAGGTA TTGTCAATCC GAGCTATTAC CTTGAAAAAT CGAACCATCA GCTTCCGCTG
AATATTCAGG AACTGAATTT GATTTATGCC CGCGATTACA ATCTGTACAT GGACATTATG
CTAATAGTTA AATCGTTTAA ATATCTGGGT AAAAGCTAA
 
Protein sequence
MKQLSIIIVN YNVCHFLEQA LISVSKAIKS LDVEVFVVDN NSADGSVEMV QTKFPNVQLI 
VNDINVGFSK ANNQAIEQAT GKYILLLNPD TVIEVDTLEK CIHFLDTHPD GGGLGVKMID
GKGDFLAESK RGFPTPWVAF YKIFGLAKLF PHSKKFGHYH LGYLDKDQNH EVEVLSGAFM
VLRKSMLDKV GNLDEDYFMY GEDIDLSYRI IKAGYKNYYL SDTRIIHYKG ESTKKTSVNY
VFIFYKAMII FAQKHFTSKS SGAFSLLIHL AIYLRALLAI SNRVIEKLFP IAFDAALILA
SLFSLFYFKN SENAIGDQGN SIIYKQIIPL FSSVWLLSLL FNGAYKSNVT LARLARSFFF
GTLIIASISY FIDEYRYSKN FLLEGSLLSL FMVFLFRGIA HWIRNGHFEL GESKNKKIVI
VGSYKECERI DKLLQETNYK LNVLGFITTG NKADVKGKYL GYTKQLLNIV RLYKVDEIIF
CSKDLPANSI IEWMTQINNT LVDFKIVPEE SNIIIGSNSK NRRGDFYSLN INLNIIEENN
VKDKRILDVS TSILFLFMYP VIFWLIQNPK NFFNNILKVL SGKKSWVGFT NTEQLNLPKI
KKGIVNPSYY LEKSNHQLPL NIQELNLIYA RDYNLYMDIM LIVKSFKYLG KS