Gene CHU_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1736 
Symbol 
ID4183917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2040591 
End bp2042384 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content42% 
IMG OID638071735 
Productsulfatase 
Protein accessionYP_678345 
Protein GI110638136 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.140971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTT TTCTGGCAGG CAGATGCTTT TTTTTGATCT ATCACATTAA GCCTGTTGCA 
AAACTTCCTG TATCAGAAAC GTTATTAGCT TTTATATATG GACTTACCTT AGATATTTCA
TTTGCTTCTT ATATCACGAT TCTGCCGGCA GCGTTATTAC CCGTTCTGCT GCTGTTTAAC
ATTTCTGAAA AAAAAATACG TAGCGGTTTA CTTATATATA GTACTGTCTG GATTATACTG
CTTGCCTTTA TTTATACACT TGATGCAGAG CTTTTTTCAT TCTGGGGTTT CCGCTTAGAT
AATACGCTGT TCCGCTATAC AGGCACAACA GGAGAAATGA TCGGTTCAGC CATGTCGTCT
CCTCTGGTAA TCCTGCTTCC CTTCTATTTC ATTGCAAGTT ATGTTGGCTA TAAATCCTAT
AAAAAATTTC TGTATCCCTA TACCTTTTCG AAGCTGAAAA TCTGGATGCT GCCTCTTATG
CTGCTGCTTG CTGCCGCACT TGTTATCCCT ATCCGCGGCG GCTTACAACA GATTCCCAAT
AATGAAAGTG TTGCGTATTA TTCTACCAAT AATCTGCTGA ACCAGGCAGC ACTGAATCCG
GCGTGGACAT TGGCGCGGTC AGTGATTGAG CAAAACGGCC CGGATCTCAG CCGGTATCAC
TTTTTTGAAT CTGCCGAGGC TGAAAAACAG CTGGCAGCAA TGCTGCGGCA ACAGGAGCTG
GAAGCAGACA CGATTCTGAC TAATTCCCGC CCGAATATTA TTATTATTGT CTGGGAAAGC
CTTACAGCAA AAGTTTTGAA CGATACCGTA ACACCACGCC TGCATGCGCT GCTTACGGAA
GGCATTTATT TTTCAAACTT TTATGCCAGC GGAGACCGCA GTGATAAAGG GCTTGCGGCT
ATCTTAAGCG CATATCCCGC ACAACCGGAC TTTTCGATTA TGACAGAACC GGGCAAATCA
CGGAAACTAC CTTTCATTAC CCAACAGCTT AAGGCTGCAG GATACCAAAG TACTTACCTG
TATGGCGGTG AACTGGAGTT TGCCAATATG CGGTCCTATA TGAATTACAA CGGTTTTGAC
CAGGTGCTGG GCAAATCTGA TTTCCCGAAA GAAACATGGG GAGCCAAATG GGGCGCGCAC
GACGAAGCTA CCTTTGCACG GCTTTTTTCA GAAATAGAAA CCAATACGGA AACTGCCGCA
CCATTTTTTT ATACCTTGTT TACGCTAAGC AGCCATGAGC CTTATGATGT GCCTGTAAAA
CCTGTCATTA CAGGTAGTTC AAAAACTGCA CAGTTTAAAA ATGCACATCA CTACACCGAT
TCGTGTTTTT TTGACTTTAT ACAGAAAGCG AAAAAACAAA CCTGGTGGAA CAATACCTGG
ATCATTGTTG TAGCCGATCA CGGACACGTG TTACCGGGAA GTGAGTATGC CACGCACAAT
CCAAGTGAAT TTAAAATTCC GATGTTATGG CTGGGCGGTG CAATAAAAAC GCACAAAACC
ATTACAAAAA CATATTCGCA GATCAATCTG GCTCCCACGA TTGCTTCTTA CCTTAAACTA
AACCCGGAAG CCTTTACATT TTCAGCTCCT ATATACTTGC AGGATACAAC AAAACGGATG
GCCTGGTATT CGTATAACGA TGGTTTTGCA ATGGTGCAGA ATAATGACAC CTATTTTAAA
TACAATCTTA ACAGTCAGAA ACTTGAAACA AAAAAGGGAT TGTTTGATTC CAAATTCCTT
ACACAGGGAC AAGCGTTCCT TCAGATTTTA ATGGAGGATT TTTACAATAA ATAA
 
Protein sequence
MLLFLAGRCF FLIYHIKPVA KLPVSETLLA FIYGLTLDIS FASYITILPA ALLPVLLLFN 
ISEKKIRSGL LIYSTVWIIL LAFIYTLDAE LFSFWGFRLD NTLFRYTGTT GEMIGSAMSS
PLVILLPFYF IASYVGYKSY KKFLYPYTFS KLKIWMLPLM LLLAAALVIP IRGGLQQIPN
NESVAYYSTN NLLNQAALNP AWTLARSVIE QNGPDLSRYH FFESAEAEKQ LAAMLRQQEL
EADTILTNSR PNIIIIVWES LTAKVLNDTV TPRLHALLTE GIYFSNFYAS GDRSDKGLAA
ILSAYPAQPD FSIMTEPGKS RKLPFITQQL KAAGYQSTYL YGGELEFANM RSYMNYNGFD
QVLGKSDFPK ETWGAKWGAH DEATFARLFS EIETNTETAA PFFYTLFTLS SHEPYDVPVK
PVITGSSKTA QFKNAHHYTD SCFFDFIQKA KKQTWWNNTW IIVVADHGHV LPGSEYATHN
PSEFKIPMLW LGGAIKTHKT ITKTYSQINL APTIASYLKL NPEAFTFSAP IYLQDTTKRM
AWYSYNDGFA MVQNNDTYFK YNLNSQKLET KKGLFDSKFL TQGQAFLQIL MEDFYNK