Gene SeHA_C0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0204 
Symbol 
ID6489911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp210214 
End bp211443 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID642740483 
Productpolysaccharide deacetylase domain-containing protein 
Protein accessionYP_002044157 
Protein GI194451211 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTATGC GCGTTGTTCT TATCTTGCTG TTCTTTTTCG CCGGTAATGT GTTGGCTGCC 
TTGCCCGCTC GTTATATGCA AACGACGAAA GATGCCGCCA TCTGGTCGCA GATTGGCGAC
AAAATGGTGA CCGTAGGGAA TATCCGTGCC GGACAAATTC TTTCCGTAAC GCCTGTTGCG
GCTGATTATT ATGCCTTTAA ATTCGGCTTC GGTGTGGGCT TTATCGATAA AGGCCATCTG
GAATCCGTGC AGGGAAAACA AAAAGTGGAA GATGGCCTGG GCGATCTTAA CAAGCCGCTC
AGCAATCAGA ATCTGGTGAC CTGGAAGGAC ACGCCGGTGT ATAACGCGCC GGACATCAGT
AGCGCCCCGT TTGGCGTATT GGTGGATAAT TTGCGTTACC CCATTATTAG CAAGCTGCAA
GGCCGGCTAC ATCAAACCTG GTATCAAATC CGTATTGGCG ACAGGCTGGC TTATGTCAGC
GCCATGGATG CGCAGGAAGA CAACGGCATT CCGATTTTGA CCTATCATCA CATCTTACGT
GATGAAGAGA ATACTCGTTT TCGCCATACG TCCACCACGA CTTCGGTTCG GGCATTCAGC
AACCAAATGA CCTGGCTTCG CGATCGCGGC TATGCCACGT TGACGATGTA CCAACTGGAG
GATTACATCC ATAACCGCGC GAATTTCCCG GCGCGCGCGG TGGTTATCAC CTTTGACGAT
GGCCTTAAAT CGGTGAGTCG CTATGCGTAT CCGGTATTAA AGCAGTACGG TATGAAAGCG
ACGGCATTTA TTATCTCATC GCGTATTAAG CGCCATCCGC AAACATGGAA TCCCAGGTCG
CTGCAATTTA TGAGCGTGTC CGAATTGCGC AAGATAAGCG ATGTTTTTGA TTTTCAGTCG
CATACCCATT TTTTACACCG GGTAGACGGG CATCGCCGCC CGATTTTATA TAGCCGCAGC
TACCATAATA TTCTGTTTGA TTTTGAACGT TCGCGGAGGG CGCTCACACA GTTTACTCCG
CACGTATTTT ATCTTTCTTA TCCCTTTGGC GGCTATAACG CGACCGCGAT CAAAGCAGCA
AAAGACGCCG GTTTCCATCT GGCGGTCACC ACGGTGAGAG GGAAGGTGAA GCCGGGAGAT
AATCCGATGC TGCTCAAAAG GCTGTATATT TTACGCACGG ATTCGCTGGA AACGATGTCG
CGGCTGATAG TCAATCAGCC GCAGGGGTAG
 
Protein sequence
MVMRVVLILL FFFAGNVLAA LPARYMQTTK DAAIWSQIGD KMVTVGNIRA GQILSVTPVA 
ADYYAFKFGF GVGFIDKGHL ESVQGKQKVE DGLGDLNKPL SNQNLVTWKD TPVYNAPDIS
SAPFGVLVDN LRYPIISKLQ GRLHQTWYQI RIGDRLAYVS AMDAQEDNGI PILTYHHILR
DEENTRFRHT STTTSVRAFS NQMTWLRDRG YATLTMYQLE DYIHNRANFP ARAVVITFDD
GLKSVSRYAY PVLKQYGMKA TAFIISSRIK RHPQTWNPRS LQFMSVSELR KISDVFDFQS
HTHFLHRVDG HRRPILYSRS YHNILFDFER SRRALTQFTP HVFYLSYPFG GYNATAIKAA
KDAGFHLAVT TVRGKVKPGD NPMLLKRLYI LRTDSLETMS RLIVNQPQG