Gene CHU_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1039 
Symbol 
ID4184375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1198873 
End bp1200108 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content40% 
IMG OID638071037 
ProductHC superfamily phosphohydrolase 
Protein accessionYP_677656 
Protein GI110637449 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.914108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATA AGAAGAAAGT TATCAATGAT CCGGTTTGGG GGTTTATAAA TATCCCAACA 
GATCTTATAT TTGAAATCAT ACAACATCCC TACTTTCAGC GACTGCGCAG GATCAGGCAG
TTAGGGCTAA CGGAAATGGT TTATCCCGGT GCCATACACA CACGTTTTCA CCATGCATTA
GGCGCCATGC ACTTAATGAC CGAAGCGTTG AAGTCATTAC AAAGCAAAGG CCATACTATT
TCTTCCGAAG AATTCGAGGG GGCCCAATTG GCCATTCTGC TGCATGATAT CGGCCACGGG
CCGCTCTCCC ACGCGCTGGA ATATTCCTTA TTAGAAAATA TCAAACACGA AGAGCTTTCC
AGATTTATTA TGGAAAGCCT GAACGTTACA TACAAAGGCA AGCTGGATCT TGCAATCGCA
ATCTTTACCA ATACCTACCA CCGTCCGTTT CTGCACCAAC TGGTTTCAAG CCAGCTGGAT
ATGGACCGGA TGGATTATTT AAGCCGCGAC AGTTTTTTCA CGGGCGTATC CGAAGGAACA
ATTGGAGCAG ACCGTTTGAT CAAAATGCTT GATCTGCATA ATGACGAACT GGTTGTTGAA
GAAAAAGGAA TTTACAGTAT TGAAAATTTT TTAACGGCAC GCCGCTTAAT GTACTGGCAG
GTATATTTAC ATAAGACAAC GGTTAGCGCA GAGAACATGG TGATATCCAT TATAAAACGC
GCTAAGTTTT TAATGAAAAA CAATACGCTG AGTTATAAGC CGCCATTCTT AGAAGTTTTT
CTTCAGACAG AAATTACTTT TGAACAGCTG CTTCAGCAAT CTGATCTCTT AAGCGCATTC
TTAAAACTTG ATGATGTAGA TTTATGGTTC GCTATAAAAC AATGGTCAAA AGAAGAAGAT
GTTGTTTTAT CAACCATCAG TAAAATGATT CTGGACCGCA AGCTCTTTTC CGTACATATT
CAACCGGAAC AAATCGGAGC CAACCAAATC GAAACGAACC AAAAACGCTT GCTTTTGACC
TTTCCGATAA CAACGGCAGA GTTGGATTAT TTTCTGATTC AGGGAACTAT TAGTAATGCG
GCGTACCTTG CTGAAAATAC TCAAATCAAG GTAAAAATGA AGAATAAAGA AATTTTGGAT
GTGGCCATAG CTTCTGATTT ACCCAATATT CAGGCGCTGA GCAAAATTGT AACCAAACAT
TATATTTGCT GTCCAAAAGA TGTATATTTG CAATAA
 
Protein sequence
MINKKKVIND PVWGFINIPT DLIFEIIQHP YFQRLRRIRQ LGLTEMVYPG AIHTRFHHAL 
GAMHLMTEAL KSLQSKGHTI SSEEFEGAQL AILLHDIGHG PLSHALEYSL LENIKHEELS
RFIMESLNVT YKGKLDLAIA IFTNTYHRPF LHQLVSSQLD MDRMDYLSRD SFFTGVSEGT
IGADRLIKML DLHNDELVVE EKGIYSIENF LTARRLMYWQ VYLHKTTVSA ENMVISIIKR
AKFLMKNNTL SYKPPFLEVF LQTEITFEQL LQQSDLLSAF LKLDDVDLWF AIKQWSKEED
VVLSTISKMI LDRKLFSVHI QPEQIGANQI ETNQKRLLLT FPITTAELDY FLIQGTISNA
AYLAENTQIK VKMKNKEILD VAIASDLPNI QALSKIVTKH YICCPKDVYL Q