Gene CHU_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0894 
Symbol 
ID4186362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1022198 
End bp1023340 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content34% 
IMG OID638070894 
Producta-glycosyltransferase 
Protein accessionYP_677515 
Protein GI110637308 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00308239 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAATC AGAAAAAAAA TCGTAACACG CCTATGAATA TATTGTTTTT ATATATCAAT 
GCATATTCTT CTGTAGGCGG CATACAAAAA TTCAATCAGA ATTTTTTACA TGCCTTAAAG
CAACTGAATG CAGCTGACAA TACTATAACG AGCCTCTCTT TATCTGATAA AACAGAAGAT
CTTCCACATG CCGCACACAC ACATTTTAAC ACTGCGGATG GAAGCCGTAT GAAATTTTTA
ATGAAAGCAG TGAAACTTTC ATTAAAATCG GATGTTGTCG TTTTCGGACA TATAAATTTA
TTTTTTCCTC TGATATTGAT TTTGAAAATT TTATTAAGAA AAAAAATCGT ATTGATTACA
CATGGGATAG AGATTTGGCG ACCATTGGGT TTTTTTACAA AAAAATGCCT GTCTCTGATT
GATACGGTTA TAACAGTAAG TAATTTCACT AAAAATAAAA TAATAGAAAT ACATAATGTT
CCTGCACATA AGATAAAAAT CCTTTGGAAT ACGCTGGATC CGGAATTTGA TGCGTCTGTA
AAAAATGAAA AGCCTGAATA TCTCATGCAA CGTTTTGGAA TTGCTCCGTC AGATAAGGTA
ATTCTGACAG TTTGCAGATT GGTAGCCGGA GAAAAAAACA AAGGCTACGA CAAAGTAATT
CAAAGCCTGA AAGAAATAAA AAAACAAATA CCGGGCGTGA AATATTTGTT AGCCGGCAAA
TACGATCTGA TAGAAAAACA ACGTCTGGAT AATCTCATAG AAGAACATGC CTTACAGCAA
CAGGTGATAT TTTCCGGATA TATTAAAACA GAAGAGCTTC CGGATATATA TAGCTTATGT
GATGTTTTTA TTATGCCTTC TTCTAAAGAA GGTTTTGGAA TTGTATTTCT TGAGGCATTG
GTAAAAGGAA AACCTGTGAT AGCGGGAAAT AAAGACGGCA GTGTGGATGC TTTATTGAAT
GGTGAACTAG GGTTGCTTAT TGATCCGGAG GACCAGTATG CAATTACTAC TGCACTGGTA
GATACATTGA GCATGAAAGC GGATAGTCGC TTTTATAATG TCAATGAACT TCACGATAAA
ACAATCAACA CATTTGGTAT GAAAGCATTC ACCAATCGCA TAAACGATAT TTTAATTAGT
TAA
 
Protein sequence
MLNQKKNRNT PMNILFLYIN AYSSVGGIQK FNQNFLHALK QLNAADNTIT SLSLSDKTED 
LPHAAHTHFN TADGSRMKFL MKAVKLSLKS DVVVFGHINL FFPLILILKI LLRKKIVLIT
HGIEIWRPLG FFTKKCLSLI DTVITVSNFT KNKIIEIHNV PAHKIKILWN TLDPEFDASV
KNEKPEYLMQ RFGIAPSDKV ILTVCRLVAG EKNKGYDKVI QSLKEIKKQI PGVKYLLAGK
YDLIEKQRLD NLIEEHALQQ QVIFSGYIKT EELPDIYSLC DVFIMPSSKE GFGIVFLEAL
VKGKPVIAGN KDGSVDALLN GELGLLIDPE DQYAITTALV DTLSMKADSR FYNVNELHDK
TINTFGMKAF TNRINDILIS