Gene SeHA_C2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2339 
Symbol 
ID6490034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2243855 
End bp2245072 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID642742528 
Productputative glycosyl transferase 
Protein accessionYP_002046163 
Protein GI194448431 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.225754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.701653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC TGCAATTTAA TGTACGGCTG GCGGAAGGCG GCGCGGCGGG AGTGGCGTTG 
GATCTCCATC TGCGGGCGCT GCAAAAAGGG CTGACGTCGC GTTTTGTCTA TGGTTATGGC
AAAGGCGGAA AAAAAAGCGT CAGCCACCAC CGTTATCCGC AGGTGATAAA ACAGACGCCG
CGCGGCACGG CAATCGCTAA TATCGCGCTG TTTCGTTTCC TGAATCGCGA TCTGTTTGGC
AATCTCGACA ATCTTTACCG CACGGTTATC CAGACATCCG GCCCGCTGGT GCTGCATTTT
CATGTTCTCC ACAGTTACTG GCTAAACCTG GCGGACATCG TGACGTTTTG CGAAAAAGTC
AAAGCGCAAA AACCAGACGT CACGCTGGTC TGGACGCTGC ACGATCACTG GAGCGTCACC
GGGCGTTGCG CCTTCACCGA CGGTTGCGAG GGTTGGAAAA GCGGCTGCCA AAAATGCCCG
ACCCTAAGCA ATTATCCGCC GGTCAGGGTG GATCGGGCGC ACCAGCTTAT TGACGGCAAA
CGTCAGCGCT TTCGGGACAT GCTGCGGCTG GGCTGCCGGT TTATTTCGCC GAGCCAGCAC
GTGGCCGAGG CCTTTAACAG CGTTTATGGC GCGGGGCGCT GCCAGATTAT TAACAACGGT
ATCGATCTGG CGACCGAGGC GATTCTCGCG CAGCTATCAC CTGTGCCGCT GAATCCGGGC
AAACCGCGGA TCGCCATTGT GGCGCATGAC TTGCGTTATG ACGGCAAAAC TGACCAGCGT
CTGGTACACG ACATGATGGC GCTGGGCGAA AAAATTGAAC TGCACACCTT CGGTAAATTT
TCGCCTTTTA CCGGCCAAAA CGTTGTTAAT CACGGTTTTG AAACCGATAA GCGCAAATTA
ATGAGCGCAC TCAATGAGAT GGATGCGCTG GTCTTTAGCT CGCGGGTCGA TAACTATCCG
CTGATCTTGT GTGAAGCGCT CTCGATCGGC GTACCGGTGA TCGCCACCCA CAGCGAGGCG
GCGCAGGAGG TGCTGGCGAA ATCCGGCGGC CAGACCTTTG CCGCTACAGA TGTACTGCGC
CTGGCGCAGC GGCGTAAGCC AGAGATTGCT CAGGCGGTAT TTGGCGCCAC GCTGGACGCC
TTTCGTATGC GTAGCCGCGT CGCGTACAGC GGTCAACAGA TGCTGGAGGA GTATGTCTCG
TTCTATCAGA ATCTGTAG
 
Protein sequence
MNILQFNVRL AEGGAAGVAL DLHLRALQKG LTSRFVYGYG KGGKKSVSHH RYPQVIKQTP 
RGTAIANIAL FRFLNRDLFG NLDNLYRTVI QTSGPLVLHF HVLHSYWLNL ADIVTFCEKV
KAQKPDVTLV WTLHDHWSVT GRCAFTDGCE GWKSGCQKCP TLSNYPPVRV DRAHQLIDGK
RQRFRDMLRL GCRFISPSQH VAEAFNSVYG AGRCQIINNG IDLATEAILA QLSPVPLNPG
KPRIAIVAHD LRYDGKTDQR LVHDMMALGE KIELHTFGKF SPFTGQNVVN HGFETDKRKL
MSALNEMDAL VFSSRVDNYP LILCEALSIG VPVIATHSEA AQEVLAKSGG QTFAATDVLR
LAQRRKPEIA QAVFGATLDA FRMRSRVAYS GQQMLEEYVS FYQNL