Gene SeD_A4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4104 
Symbol 
ID6875589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3952185 
End bp3953198 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content35% 
IMG OID642787051 
Productlipopolysaccharide 1,2-glucosyltransferase 
Protein accessionYP_002217678 
Protein GI198242856 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.049978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCAT TTCCTGAGAT AGAAATAGCT GAATATAAAG TTTTTGATGA AAGTAATAAT 
AATAATGATG ATAACGTATT AAACATTTCT TATGGTGTTG ATGAAAACTA TCTTGATGGT
GTGGGGGTAT CAATCGCTTC AGTTGTATTA AACAATAATA TCCCGCTCGC TTTTCACATT
ATTTGTGATT CATACTCCCC GTGTTTTGTA AAATATATAG AGCGTTTAGC CGTACAGCAT
CACATAAAAA TTTCTCTTTA TCTTATTAAA GTAGAAAGCC TTGAGGTATT GCCTCAAACT
AAAGTATGGT CGAGAGCAAT GTATTTTCGT TTATTTGCTT TCGATTATCT CAGCAAGAAG
GTAAATACCT TACTTTATTT GGATGCCGAT GTTGTATGCA AAGGATCTTT GCAAGATCTT
CTACGGCTTG ATTTGACAGA GAAGATTGCT GCGGTCGTAA AAGATGTTGA TTCCATCCAG
AATAAGGTAA ATGAGAGATT AAGCGCTTTT AATTTACAAG GTGGTTATTT TAACTCCGGC
GTGGTTTTTG TTAACCTGAA ATTATGGAAA GAGAATGCCT TAACCAAAAA GGCATTTTTA
CTTTTGGCAG GTAAAGAGGC TGACTCTTTT AAATATCCCG ATCAGGATGT TTTGAATATT
CTCCTACAGG ATAAAGTCAT TTTTCTACCG CGACCATATA ATACCATTTA TACTATCAAA
AGTGAGTTGA AAGATAAGTC ACATAAAAAA TATAGCAATA TAATTAATGA TAATACTGTT
TTAATTCATT ATACGGGCGC TACAAAACCA TGGCATGCCT GGGCAAATTA TCCTTCAGTT
ATCTATTATA AAAATGCACG ACTGAACTCG CCCTGGAAAG ATTCTCCTGC AAAAGATGCG
CGTACCATAG TCGAATTTAA GAAGCGATAT AAACATCTTC TCGTGCAGGG TCATTATTTT
AAAGGCCTTA TGGCTGGAAG CGCATATCTT TATCGTAAAC TTTTCCACAA ATAA
 
Protein sequence
MDSFPEIEIA EYKVFDESNN NNDDNVLNIS YGVDENYLDG VGVSIASVVL NNNIPLAFHI 
ICDSYSPCFV KYIERLAVQH HIKISLYLIK VESLEVLPQT KVWSRAMYFR LFAFDYLSKK
VNTLLYLDAD VVCKGSLQDL LRLDLTEKIA AVVKDVDSIQ NKVNERLSAF NLQGGYFNSG
VVFVNLKLWK ENALTKKAFL LLAGKEADSF KYPDQDVLNI LLQDKVIFLP RPYNTIYTIK
SELKDKSHKK YSNIINDNTV LIHYTGATKP WHAWANYPSV IYYKNARLNS PWKDSPAKDA
RTIVEFKKRY KHLLVQGHYF KGLMAGSAYL YRKLFHK