Gene SeHA_C4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4037 
SymbolrfaC 
ID6488596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3920307 
End bp3921260 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content55% 
IMG OID642744138 
ProductADP-heptose:LPS heptosyl transferase I 
Protein accessionYP_002047743 
Protein GI194450269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTT TGATCGTTAA AACATCATCG ATGGGCGACG TATTACATAC CCTGCCTGCG 
CTTACCGACG CGCAACAGGC GATTCCGGGG ATTCAATTTG ATTGGGCTGT CGAAGAAGGG
TTTGCACAAA TTCCGTCCTG GCACAGTGCT GTCGATCGCG TGATTCCCGT CGCTATTCGC
CGTTGGCGCA AGGCCTGGTT TTCCGCGCCC ATCAAAGCGG AACGCACAGC CTTTCGTCGG
GCGGTATGCG CAAACCAATA CGACGCTGTG ATTGATGCGC AGGGGCTGGT AAAAAGCGCG
GCGCTGGTGA CGCGTCTGGC GCATGGGATA AAGCACGGTA TGGACTGGAG TACCGCCCGC
GAACCGCTGG CCAGCCTGTT CTATAACCGT AAACACCATA TCGCAAAGCA ACAACATGCG
GTTGAACGGA CGCGCGAGCT GTTCGCCAAA AGCCTGGGAT ACGATAAACC GCAGTCGCAG
GGCGATTATG CCATCGCAAA ACATTTTCTG CATTGCCAGC AGGCGGTTAG CGATCCGTAT
GCGGTGTTTT TACATGCCAC GACCCGCGAT GATAAACACT GGCCGGAAGC AAACTGGCGC
GAGCTTATCG GCCTGGTGGG CAACACCGGA TTACGGATAA AGCTTCCCTG GGGCGCGCCT
CATGAGGAGG CCCGGGCTAA ACGACTGGCC GAAGGCTTTG ACTATGTGGA TGTGTTACCG
CGCATGAGCC TGGAGGAGGT CGCCAGAGTG CTGGCTGGCG CAAAATTTGT CGTATCGGTT
GATACCGGCC TGAGCCATCT CACCGCCGCG CTCGACAGAC CGAATATTAC GCTATATGGC
CCAACGGACC CTGGGTTAAT TGGAGGTTAT GGGAAGAACC AAATGGCATG CTGCTCACCA
GAACAGAACC TGGCGAATTT AGATGCCACA AGCGTATTTG GAAAGATTCA TTAA
 
Protein sequence
MRVLIVKTSS MGDVLHTLPA LTDAQQAIPG IQFDWAVEEG FAQIPSWHSA VDRVIPVAIR 
RWRKAWFSAP IKAERTAFRR AVCANQYDAV IDAQGLVKSA ALVTRLAHGI KHGMDWSTAR
EPLASLFYNR KHHIAKQQHA VERTRELFAK SLGYDKPQSQ GDYAIAKHFL HCQQAVSDPY
AVFLHATTRD DKHWPEANWR ELIGLVGNTG LRIKLPWGAP HEEARAKRLA EGFDYVDVLP
RMSLEEVARV LAGAKFVVSV DTGLSHLTAA LDRPNITLYG PTDPGLIGGY GKNQMACCSP
EQNLANLDAT SVFGKIH