Gene SeHA_C2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2281 
Symbol 
ID6491136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2180369 
End bp2181235 
Gene Length867 bp 
Protein Length288 aa 
Translation table11 
GC content56% 
IMG OID642742471 
Productpropanediol utilization protein 
Protein accessionYP_002046106 
Protein GI194447350 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.0102974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTTG CGCAATGCCC CGCCTCATGC GGGGAACTTA TCCAGGGATG GATTCTGGGC 
AGTGAGAAAC TGGTCTCCTG CCCCGTTGAC TGGTACAGCA CCGTAGCAGT CACGGCTGCG
CCGCCGTTGA TAAACGAGCG CCCATTGTCG CGGGCGATGG TGGAGCGCGT TCTGGCGCAC
TGGCAGTATC CTGCGCACTG GAGTAACGAG ATTCGCGTCG ATGTGCGTTC GTCAATTCCC
GTTGCCAAAG GCATGGCCAG CAGCACCGCA GATATTGCCG CTACGGCGGT GGCAACGGCG
CATCATCTTG GCCATTCGCT GGATGAAACC ACCCTTGCAC AGCTTTGCGT CTCAATCGAA
CCCACTGATA GCACCGTTTT TCATCAGTTA ACGCTGTTTG ATCATAATAA TGCGGCCACG
CAAATCGCCT GCGAGCCACC GCCGCCAATC GATTTGCTGG TACTGGAAAG CCCGGTCACA
CTGCGCACGC AAGATTACCA CCGTCTCCCT CGCCAGCAGA AATTAATAGC AAGTTCAGCA
ACCTTGCAGC AGGCCTGGAA TCTGGTGCAG GAAGCCTGTA TAACGCAAAA TCCGCTCCGG
CTGGGTGAGG CGGCTACGCT TAGCGCTATC GCCAGCCAGA CGCTGTTACC TAAGCCGGGC
TTTACCGCCC TGCTGTCGCT GGTCGAAGAG TGTGATTTAT ACGGATTGAA CGTGGCGCAT
AGCGGTAGCG TGGTGGGTCT GATGCTGGAC CGGAAACGTC ATGATATTGC GCGTCTGAAA
GGTAAGCTGG CAGAGAAAAA ACTTACCCGA CACTGGCCAA AACAACATTT ACTCAAGATG
GTCACTGGCG GGGTCAAACT GCAGTGA
 
Protein sequence
MAVAQCPASC GELIQGWILG SEKLVSCPVD WYSTVAVTAA PPLINERPLS RAMVERVLAH 
WQYPAHWSNE IRVDVRSSIP VAKGMASSTA DIAATAVATA HHLGHSLDET TLAQLCVSIE
PTDSTVFHQL TLFDHNNAAT QIACEPPPPI DLLVLESPVT LRTQDYHRLP RQQKLIASSA
TLQQAWNLVQ EACITQNPLR LGEAATLSAI ASQTLLPKPG FTALLSLVEE CDLYGLNVAH
SGSVVGLMLD RKRHDIARLK GKLAEKKLTR HWPKQHLLKM VTGGVKLQ