Gene SeHA_C3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3024 
SymbolgutQ 
ID6489844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2956914 
End bp2957879 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content57% 
IMG OID642743179 
ProductD-arabinose 5-phosphate isomerase 
Protein accessionYP_002046798 
Protein GI194450477 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.00000207652 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGATG CACTACTAAA CGCGGGCCGT CAGACCTTAA TGCTGGAGCT ACAGGAAGCC 
AGCCGTCTGC CGGAGCGTCT GGGCGATGAT TTTGTCCGCG CCGCCAATAT CATTATTCAC
TGTGAAGGCA AAGTGATCGT TTCCGGTATT GGCAAGTCAG GTCATATTGG TAAAAAAATC
GCCGCGACGC TTGCCAGTAC CGGTACTCCC GCTTTTTTTG TTCATCCGGC GGAAGCACTG
CATGGCGATC TGGGGATGAT TGAAAGCCGC GACGTGATGT TATTTATCTC CTATTCCGGC
GGCGCAAAAG AACTCGACCT CATCATCCCA CGTCTGGAAG ATAAATCCGT CGCGCTGCTG
GCGATGACCG GTAAACCTCA CTCTCCGCTG GGGCGAGCGG CAAAAGCCGT TCTGGATATT
TCCGTCGAGC GTGAAGCCTG CCCGATGCAT CTGGCGCCAA CGTCCAGTAC CGTCAATACG
CTGATGATGG GCGATGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGTTT TAACGAAGAA
GATTTCGCCC GTTCGCATCC GGCTGGCGCA CTGGGCGCGC GTTTGCTCAA TAATGTGCAT
CACCTGATGC GCCAGGGCGA TGCAATACCG CAGGTGATGC TTGCCACCAG CGTGATGGAT
GCCATGCTGG AACTTAGCCG TACCGGGCTG GGGCTGGTGG CGGTTTGCGA TGAGCAACAT
GTTGTGAAAG GCGTCTTTAC CGACGGCGAC CTGCGTCGCT GGCTGGTGGG CGGCGGCGCG
CTCACCACGC CGGTAAGCGA AGCCATGACG CCCAACGGTA TTACGCTCCA GGCGCAAAGC
CGCGCCATTG ACGCCAAAGA GCTCCTGATG AAACGCAAAA TTACCGCCGC GCCGGTGGTC
GATGAAAACG GCAAACTCAC CGGCGCCATT AACCTGCAGG ATTTCTACCA GGCGGGGATT
ATCTAA
 
Protein sequence
MSDALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIIIH CEGKVIVSGI GKSGHIGKKI 
AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSVALL
AMTGKPHSPL GRAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE
DFARSHPAGA LGARLLNNVH HLMRQGDAIP QVMLATSVMD AMLELSRTGL GLVAVCDEQH
VVKGVFTDGD LRRWLVGGGA LTTPVSEAMT PNGITLQAQS RAIDAKELLM KRKITAAPVV
DENGKLTGAI NLQDFYQAGI I