Gene SNSL254_A3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3040 
SymbolgutQ 
ID6485273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2958904 
End bp2959869 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content57% 
IMG OID642738355 
ProductD-arabinose 5-phosphate isomerase 
Protein accessionYP_002042079 
Protein GI194442564 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.00853446 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGATG CACTACTAAA CGCGGGCCGT CAGACCTTAA TGCTGGAGCT ACAGGAAGCC 
AGCCGTCTGC CGGAGCGTCT GGGCGATGAT TTTGTCCGCG CCGCCAATAT CATTATTCAC
TGTGAAGGCA AAGTGATCGT TTCCGGTATT GGTAAATCAG GTCATATTGG TAAAAAAATC
GCCGCGACGC TTGCCAGTAC CGGTACTCCC GCTTTTTTTG TTCATCCGGC GGAAGCACTG
CATGGCGATC TGGGGATGAT TGAAAGCCGC GACGTGATGT TATTTATCTC CTATTCCGGC
GGCGCAAAAG AACTCGACCT CATCATCCCG CGTCTGGAAG ATAAATCCGT CGCGCTGCTG
GCGATGACCG GTAAACTTCA CTCTCCGCTG GGGCGAGCGG CAAAAGCCGT TCTGGATATT
TCCGTCGAGC GTGAAGCCTG CCCGATGCAT CTGGCGCCGA CATCCAGTAC CGTCAATACG
CTGATGATGG GCGATGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGTTT TAACGAAGAA
GATTTCGCCC GTTCGCATCC GGCTGGCGCA CTGGGCGCGC GTTTGCTCAA TAATGTGCAT
CACCTGATGC GCCAGGGCGA TGCAATACCG CAGGTGATGC TTGCCACCAG CGTGATGGAT
GCCATGCTGG AACTTAGCCG TACCGGGCTG GGGCTGGTGG CGGTTTGCGA TGAGCAACAT
GTTGTGAAAG GCGTCTTTAC CGACGGCGAC CTGCGTCGCT GGCTGGTGGG CGGCGGCGCG
CTCACCACGC CGGTAAGCGA AGCCATGACG CCCAACGGTA TTACGCTCCA GGCGCAAAGC
CGCGCCATTG ACGCCAAAGA GCTCCTGATG AAACGCAAAA TTACCGCCGC GCCAGTGGTC
GATGAAAACG GCAAACTCAC CGGCGCCATT AACCTACAGG ATTTCTACCA GGCGGGGATT
ATCTAA
 
Protein sequence
MSDALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIIIH CEGKVIVSGI GKSGHIGKKI 
AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSVALL
AMTGKLHSPL GRAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE
DFARSHPAGA LGARLLNNVH HLMRQGDAIP QVMLATSVMD AMLELSRTGL GLVAVCDEQH
VVKGVFTDGD LRRWLVGGGA LTTPVSEAMT PNGITLQAQS RAIDAKELLM KRKITAAPVV
DENGKLTGAI NLQDFYQAGI I