Gene SeHA_C1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1942 
SymbolgalU 
ID6489462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1895632 
End bp1896540 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content50% 
IMG OID642742152 
ProductUTP--glucose-1-phosphate uridylyltransferase subunit GalU 
Protein accessionYP_002045797 
Protein GI194450744 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1210] UDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01099] UTP-glucose-1-phosphate uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.966248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCC TTAATTCGAA AGTCAAAAAA GCCGTTATCC CGGTCGCGGG ATTGGGAACC 
AGGATGCTGC CAGCGACCAA AGCTATCCCG AAAGAGATGC TGCCGCTGGT TGATAAGCCA
TTAATTCAGT ACGTCGTGAA CGAATGTATC GCTGCTGGCA TTACTGAAAT CGTGCTTGTT
ACGCACTCGT CTAAAAACTC TATTGAAAAC CACTTTGATA CCAGTTTTGA GCTGGAAGCG
ATGCTGGAAA AACGCGTTAA GCGTCAGCTT CTGGAGGAGG TTCAGTCTAT TTGCCCTCCG
CATGTCACTA TTATGCAGGT ACGTCAAGGG CTGGCAAAAG GCCTGGGCCA TGCCGTATTG
TGCGCGCATC CCGTTGTCGG AAACGAACCT GTCGCTGTTA TTCTGCCAGA CGTTATTCTT
GACGAATATG AGTCCGACCT GTCTCAGGAT AACCTGGCTG AAATGATCCG CCGTTTTGAC
GAAACCGGCA ATAGCCAGAT TATGGTTGAG CCGGTAGAAG ATGTGACTGC ATACGGCGTG
GTCGATTGTA AAGGCGTTGA GCTGGCGCCG GGCGAAAGTG TGCCGATGGT TGGCGTGGTT
GAAAAACCAA AAGCGGATGT CGCGCCGTCT AACCTTGCGA TTGTCGGGCG TTATGTGTTG
AGCGCGGATA TCTGGGCGTT GCTGGCGAAA ACCCCTCCGG GCGCCGGGGA TGAAATTCAG
TTGACCGATG CTATCGATAT GCTGATCGAA AAAGAAACGG TTGAAGCCTA CCACATGAAG
GGTAAAAGCC ATGACTGTGG TAATAAGTTA GGATATATGC AGGCATTCGT TGAATATGGC
ATCCGTCATA ATTCGCTGGG TGCTGAATTT AAAGCCTGGC TTGAAGAAGA AATGGGTATT
AAGAAGTAA
 
Protein sequence
MAALNSKVKK AVIPVAGLGT RMLPATKAIP KEMLPLVDKP LIQYVVNECI AAGITEIVLV 
THSSKNSIEN HFDTSFELEA MLEKRVKRQL LEEVQSICPP HVTIMQVRQG LAKGLGHAVL
CAHPVVGNEP VAVILPDVIL DEYESDLSQD NLAEMIRRFD ETGNSQIMVE PVEDVTAYGV
VDCKGVELAP GESVPMVGVV EKPKADVAPS NLAIVGRYVL SADIWALLAK TPPGAGDEIQ
LTDAIDMLIE KETVEAYHMK GKSHDCGNKL GYMQAFVEYG IRHNSLGAEF KAWLEEEMGI
KK