Gene SeD_A2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2440 
Symbol 
ID6872234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2313095 
End bp2314315 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID642785529 
Productputative colanic acid biosynthesis glycosyltransferase WcaL 
Protein accessionYP_002216187 
Protein GI198243356 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0000788378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTCA GCTTTTTTCT GCTGAAATTT CCACTCTCAT CGGAAACCTT TGTGCTGAAT 
CAGATTACTG CGTTTATTGA TATGGGCCAT GAGGTGGAGA TTGTCGCGTT ACAAAAAGGC
GATACCCAAC ATACTCACGC CGCCTGGGAG AAGTATGGCC TGGCGGCGAA AACCCGCTGG
TTACAGGATG AGCCCCAGGG ACGGCTGGCG AAACTGCGCT ACCGGGCATG TAAAACGCTG
CCGGGGCTGC ATCGGGCGGC GACCTGGAAA GCGCTCAATT TTACCCGCTA TGGCGATGAA
TCACGCAATT TGATCCTTTC CGCGATTTGC GCGCAGGTGA GCCAGCCTTT TGGGGCGGAT
GTGTTCATCG CACACTTTGG TCCGGCGGGC GTGACGGCGG CCAAACTACG CGAACTGGGC
GTGCTTCGCG GCAAAATCGC GACTATTTTC CACGGGATTG ATATTTCCAG CCGCGAAGTG
CTCAGTCATT ACACGCCGGA GTATCAGCAG TTGTTTCGTC GTGGCGATCT GATGCTGCCC
ATCAGCGAAC TGTGGGCCGG TCGCCTGAAA AGTATGGGCT GTCCGCCGGA AAAGATTGCC
GTTTCGCGCA TGGGCGTCGA CATGACGCGT TTTACCCATC GTCCGGTGAA AGCGCCAGGG
ATGCCGCTGG AGATGATTTC CGTCGCGCGC CTGACTGAGA AAAAAGGCCT GCATGTGGCG
ATTGAAGCCT GTCGGCAACT GAAAGCGCAG GGCGTGGCGT TTCGCTACCG CATTCTGGGC
ATTGGCCCGT GGGAACGTCG GCTGCGCACG CTCATCGAGC AGTATCAGCT AGAGGATGTC
ATTGAGATGC CGGGGTTTAA ACCGAGCCAT GAAGTGAAGG CGATGCTGGA TGACGCCGAT
GTTTTTTTGC TGCCGTCGAT TACCGGTACG GATGGCGATA TGGAAGGTAT TCCGGTAGCG
CTGATGGAGG CGATGGCGGT AGGGATTCCC GTGGTATCTA CCGTGCATAG CGGTATTCCG
GAACTGGTGG AGGCCGGCAA ATCCGGCTGG CTGGTGCCGG AAAACGATGC GCAGGCGCTG
GCGGCCCGAC TCGCTGAGTT CAGCCGGATT GACCACGACA CGCTGGAGTC GGTGATCACG
CGCGCCCGTG AAAAAGTGGC GCAAGATTTT AATCAGCAGG CGATTAATCG CCAGCTAGCC
AGCCTGCTAC AAACGATATA A
 
Protein sequence
MKVSFFLLKF PLSSETFVLN QITAFIDMGH EVEIVALQKG DTQHTHAAWE KYGLAAKTRW 
LQDEPQGRLA KLRYRACKTL PGLHRAATWK ALNFTRYGDE SRNLILSAIC AQVSQPFGAD
VFIAHFGPAG VTAAKLRELG VLRGKIATIF HGIDISSREV LSHYTPEYQQ LFRRGDLMLP
ISELWAGRLK SMGCPPEKIA VSRMGVDMTR FTHRPVKAPG MPLEMISVAR LTEKKGLHVA
IEACRQLKAQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMPGFKPSH EVKAMLDDAD
VFLLPSITGT DGDMEGIPVA LMEAMAVGIP VVSTVHSGIP ELVEAGKSGW LVPENDAQAL
AARLAEFSRI DHDTLESVIT RAREKVAQDF NQQAINRQLA SLLQTI