Gene SeHA_C4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4031 
Symbol 
ID6487805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3913415 
End bp3914377 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content56% 
IMG OID642744132 
Productdivergent polysaccharide deacetylase 
Protein accessionYP_002047737 
Protein GI194447310 
COG category[S] Function unknown 
COG ID[COG2861] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00871141 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCAGT TTCGTCGCTC CATTCTCACG CTGGCCACTT TGCTGGCGTT TGCACATCCC 
GTTTTCGCTG GCAAGCTCGC CATCGTGATT GATGATTTTG GCTATCGCCC GCACACGGAA
AACCAGGTTC TGGCGCTGCC GCCAAACATC TCCGTCGCTG TACTGCCCAA CGCGCCGCAC
GCGCGCGAAA TGGCAACTAA AGCGCACAAT AGCGGGCATG AGGTGTTAAT CCATCTGCCG
ATGGCGCCGC TAAGCAAACA GCCGCTGGAG AAGGATACGC TGCGACCAGA TATGAGCAGC
GATGAGATCG AGCGCATTAT CCGCGAGGCG GTAAACAACG TGCCGTATGC CGTCGGGCTT
AATAACCACA TGGGCAGCGC AATGACTTCC AGCCTGTTCG GTATGCAAAA AGTTATGCAG
GCGCTGGAAC ATTACAATCT CTATTTTCTC GACAGCATGA CGATTGGCAA TAGCCAGGCG
ATGCGCGCGG CATCCGGTAC GGGTGTGAAA GTGATCAAGC GCAAAGTGTT CCTCGACGAT
ACGCAAAACG AGGCGGATAT CCGTCGTCAG TTTAATCGCG CTATCGAACT GGCCCGTCGC
AACGGTTCCG CTATCGCGAT TGGTCATCCA CATCCCGCAA CGGTTCGCGT GCTGCAACAG
ATGGTTTATC GCCTGCCGGC GGATATCACC CTGGTACGTC CAGGCAGCCT GCTCAACGAA
CCGCAGGTAG ATACGTCCCG ACCTGGTGTG ACGCCGCAGA AAATTGACGC GCCGCGCAAT
CCCTTCCGCG GCGTAAAGAT GTGCAAGCCG AAAAAACCGC TGCAACCGGT CTACGCTACG
CGCTTTTTCA GCGTCATCGG CGAGAGCATT ACGCAAAGTT CCGTGGTTAC CTGGTTTCAG
CACCAGTGGC AAGGCTGGGG GAAAATCGCC GCGCCTAAAA ACGTGAGCGC TAAGACAGAT
TGA
 
Protein sequence
MPQFRRSILT LATLLAFAHP VFAGKLAIVI DDFGYRPHTE NQVLALPPNI SVAVLPNAPH 
AREMATKAHN SGHEVLIHLP MAPLSKQPLE KDTLRPDMSS DEIERIIREA VNNVPYAVGL
NNHMGSAMTS SLFGMQKVMQ ALEHYNLYFL DSMTIGNSQA MRAASGTGVK VIKRKVFLDD
TQNEADIRRQ FNRAIELARR NGSAIAIGHP HPATVRVLQQ MVYRLPADIT LVRPGSLLNE
PQVDTSRPGV TPQKIDAPRN PFRGVKMCKP KKPLQPVYAT RFFSVIGESI TQSSVVTWFQ
HQWQGWGKIA APKNVSAKTD