Gene SeD_A3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3147 
SymbolgutQ 
ID6874751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3026793 
End bp3027758 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content57% 
IMG OID642786168 
ProductD-arabinose 5-phosphate isomerase 
Protein accessionYP_002216809 
Protein GI198244302 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00650637 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.0214142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG CACTACTAAA CGCGGGCCGT CAGACCTTAA TGCTGGAGCT ACAAGAAGCC 
AGCCGTCTGC CGGAGCGTCT GGGCGATGAT TTTGTCCGCG CCGCCAATAT CATTATTCAC
TGTGAAGGCA AAGTGATCGT TTCCGGTATT GGTAAATCAG GTCATATTGG TAAAAAAATC
GCCGCGACGC TTGCCAGTAC CGGTACTCCC GCTTTTTTTG TTCATCCGGC GGAAGCACTG
CATGGCGATC TGGGGATGAT TGAAAGCCGC GATGTGATGT TATTTATCTC CTATTCCGGC
GGCGCAAAAG AACTCGACCT CATCATCCCG CGTCTGGAAG ATAAATCCGT CGCGCTGCTG
GCGATGACCG GTAAACCTCT CTCTCCGCTG GGGCGAGCGG CAAAAGCCGT TCTGGATATT
TCCGTCGAGC GTGAAGCCTG TCCGATGCAT CTGGCGCCGA CATCCAGTAC CGTCAATACG
CTGATGATGG GCGATGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGTTT TAACGAAGAA
GATTTCGCCC GTTCGCATCC GGCTGGCGCA CTGGGCGCGC GTTTGCTCAA TAATGTGCAT
CACCTGATGC GCCAGGGCGA TGCAATACCG CAGGTGATGC TCGCCACCAG CGTGATGGAT
GCCATGCTGG AACTTAGCCG TACCGGGCTG GGGCTGGTGG CGGTTTGCGA TGAGCAACAT
GTTGTGAAAG GCGTCTTTAC CGACGGCGAC CTGCGCCGCT GGCTGGTGGG CGGCGGCGCG
CTCACCACGC CGGTAAGCGA AGCCATGACG CCCAACGGTA TTACGCTCCA GGCGCAAAGC
CGCGCCATTG ACGCCAAAGA GCTCCTGATG AAGCGCAAAA TTACCGCCGC GCCAGTGGTC
GATGAAAACG GCAAACTCAC CGGCGCCATT AACCTACAGG ATTTCTACCA GGCGGGGATT
ATCTAA
 
Protein sequence
MSDALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIIIH CEGKVIVSGI GKSGHIGKKI 
AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSVALL
AMTGKPLSPL GRAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE
DFARSHPAGA LGARLLNNVH HLMRQGDAIP QVMLATSVMD AMLELSRTGL GLVAVCDEQH
VVKGVFTDGD LRRWLVGGGA LTTPVSEAMT PNGITLQAQS RAIDAKELLM KRKITAAPVV
DENGKLTGAI NLQDFYQAGI I