Gene SeHA_C4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4047 
Symbol 
ID6492576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3929748 
End bp3930872 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content48% 
IMG OID642744148 
Productlipopolysaccharide core biosynthesis protein RfaG 
Protein accessionYP_002047753 
Protein GI194449220 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.40753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTG CCTTTTGCTT ATATAAATAT TTTCCTTTTG GCGGTCTGCA GCGTGATTTT 
ATGCGTATTG CTCAAACCGT GGCGGCGCGA GGTCATCAGG TTCGTGTTTA TACTCAGTCA
TGGGAAGGGG AATGCCCGGA TAACTTTGAA TTAATCCGCG TGCCGGTTAA ATCCCGGACG
AACCACGGTC GTAACGCAGA ATATTATGCC TGGGTGCAAC ACCATTTGCG CGACCACCCT
GTCGATCGGG TGGTTGGATT CAATAAGATG CCGGGCCTTG ATGTGTATTA CGCCGCAGAC
GTGTGTTACG CCGAAAAAGT CGCACAGGAA AAAGGATTTT TCTATCGCCT GACATCACGC
TATCGGCATT ATGCTGCTTT TGAACGCGCC ACGTTTGAAC ACGGCAAGCC GACGCAACTA
TTAATGCTGA CGAATAAGCA GATTGCTGAC TTCCAAAAAC ATTATCAGAC TGAAGCGGAG
CGTTTCCATA TTCTTCCTCC GGGGATTTAC CCGGACAGAA AATATAGCCA ACAGATCCCA
AACAGTCGTC AAATTTATCG TCAGAAAAAT GGTATCTCAG AACAGCAAAA ATTACTGTTG
CAAGTAGGGT CTGACTTTAC CCGTAAAGGT GTGGATCGCT CTATTGAAGC GCTGGCATCG
CTACCCGAAT CTTTACGGCA AAATACGGTG CTCTATGTTG TCGGGCAGGA TAAGCCGAAG
AAGTTTGCAG CACTGGCTGA AAGAAGCGGC GTCGGCACGA ATGTGCATTT TTTCTCCGGA
CGTAATGATA TCGCGGAGTT AATGGCGGCA GCCGACCTTT TACTGCATCC AGCCTATCAG
GAAGCTGCTG GTATTGTTTT GCTGGAAGCC ATTACTGCTG GTTTGCCGGT GCTGACAACT
GCGGTGTGCG GTTATGCACA TTATATTGTG GATGCAAACT GTGGCGAAGC GATGACTGAA
CCATTCCGTC AGGATGCGCT AAATGAGGTT TTACTCAAAG CGCTGACACA GCCTTCCTTA
CGCAACGCCT GGGCTGAAAA TGCGCGGTAT TATGCTGATA CCCAGGATTT ATACAGCTTA
CCGGAGAAGG CCACGGATAT TATTACAGGT GATTTAGATG GTTGA
 
Protein sequence
MRVAFCLYKY FPFGGLQRDF MRIAQTVAAR GHQVRVYTQS WEGECPDNFE LIRVPVKSRT 
NHGRNAEYYA WVQHHLRDHP VDRVVGFNKM PGLDVYYAAD VCYAEKVAQE KGFFYRLTSR
YRHYAAFERA TFEHGKPTQL LMLTNKQIAD FQKHYQTEAE RFHILPPGIY PDRKYSQQIP
NSRQIYRQKN GISEQQKLLL QVGSDFTRKG VDRSIEALAS LPESLRQNTV LYVVGQDKPK
KFAALAERSG VGTNVHFFSG RNDIAELMAA ADLLLHPAYQ EAAGIVLLEA ITAGLPVLTT
AVCGYAHYIV DANCGEAMTE PFRQDALNEV LLKALTQPSL RNAWAENARY YADTQDLYSL
PEKATDIITG DLDG