Gene CHU_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1766 
Symbol 
ID4186838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2072826 
End bp2073929 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content40% 
IMG OID638071765 
ProductDNA-3-methylpurine glycosylase 
Protein accessionYP_678375 
Protein GI110638166 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0802128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0268874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAT TAAAATACGT TTATTCACCG GCCTTTATAG ATTCGTTAAT TGCTTTTTTG 
AAAAAGGTTC ATCCGTCTTT GAATAAAAAA GCATTCGCTG CTGCTGTTTT TGATGCCGAA
TGGGATAACC GTGAATTGAA GCAGCGGATG AAGCATCTGG CACATGTGCT GCATCAGCAG
CTGCATCAAG CCTATGCAAA GGATATTGAA ACAATTATAG CGTTGGTGCA TTTGTTAAAA
GCGGACAGAG ATAACCATCA GAGTTTCGAA TATTTATTTT TGGCCGAATA TGTTGAAATA
TATGGTCAGC ACGATGTGGT GCTATCCATG AAAGCAATTG AAGAAATTAC ACAATATACC
AGCTGTGAAT TTGCGATCCG TCCTTTTCTG ATCAAACATC CGGAGAAGGT AATGAAGTAC
ATGCTTAAAT GGTCGAAACA TAAACATGCC AGTGTAAGGC GTTTTTCCAG CGAAGGCTGC
CGACCCCGGT TGCCATGGGG TATGGCGCTT CCTGCATTCA AAAAAGACCC GTCCTTGATT
TTACCTGTTC TTGAAAATCT GAAAACAGAT GAATCGTTGT ATGTGCGTAA GAGTGTAGCA
AACAATTTAA ATGATATCGC AAAGGATAAT CCGGAGGTGG TGATTGACCT GATTAAAAAA
TGGCAGGGCG TTTCGCCATA CACAGACTGG ATCATTAAGC ACGGTGCCCG TACACTGCTG
AAAAAAGCAC ATGCAGAAGT GCTGGGTTTA TTTGGCTTAC AGACAACACT TGCTTGTACC
GTTTCAAATC TGACCCTGAT AAAAAATAAG ATCAAAATAG GAGATACGTT GTCTTTCGCT
TTTGATCTGG ATACCGGCTC CAAAGCAGAT GCGAAGCTGC GGATCGAATT TGCCGTTTAT
TATGTAAAAG CAGGCGGGAA GCCCAGCCGC AAACTTTTTA AGATTACAGA AAATACTTAC
CAGAAAGGTA AACGGGTTTC ATTTAACAAA AAACTTTCAT TTAAAGATTT AACTACAAGA
AAACATTATG CGGGGAAGCA TACCATTGCT ATTGTTGTAA ATGGAAATGA ATTGATAGCC
TCCGATTTCC ATCTTCTGGG CTAA
 
Protein sequence
MEPLKYVYSP AFIDSLIAFL KKVHPSLNKK AFAAAVFDAE WDNRELKQRM KHLAHVLHQQ 
LHQAYAKDIE TIIALVHLLK ADRDNHQSFE YLFLAEYVEI YGQHDVVLSM KAIEEITQYT
SCEFAIRPFL IKHPEKVMKY MLKWSKHKHA SVRRFSSEGC RPRLPWGMAL PAFKKDPSLI
LPVLENLKTD ESLYVRKSVA NNLNDIAKDN PEVVIDLIKK WQGVSPYTDW IIKHGARTLL
KKAHAEVLGL FGLQTTLACT VSNLTLIKNK IKIGDTLSFA FDLDTGSKAD AKLRIEFAVY
YVKAGGKPSR KLFKITENTY QKGKRVSFNK KLSFKDLTTR KHYAGKHTIA IVVNGNELIA
SDFHLLG