Gene SeD_A4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4070 
Symbol 
ID6871271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3915515 
End bp3917431 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content55% 
IMG OID642787019 
ProductPTS system mannitol-specific transporter subunit EIICBA 
Protein accessionYP_002217646 
Protein GI198244431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2213] Phosphotransferase system, mannitol-specific IIBC component
[COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain 
TIGRFAM ID[TIGR00851] PTS system, mannitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.293886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCG ATATTAAGAT CAAAGTGCAA AGCTTTGGTC GATTCCTCAG CAATATGGTG 
ATGCCTAATA TCGGCGCGTT TATCGCGTGG GGTATTATCA CCGCATTATT TATTCCAACA
GGGTGGTTGC CTAACGAAAC GCTGGCGAAA CTGGTTGGTC CGATGATCAC CTACCTGTTG
CCGCTGCTCA TCGGTTATAC CGGCGGTCGT CTGGTTGGCG GCGAACGCGG CGGCGTAGTG
GGCGCTATCA CTACCATGGG CGTTATTGTC GGCGCGGATA TGCCGATGTT CCTCGGCTCG
ATGATCGCCG GTCCATTAGG CGGCTACTGC ATTAAGAAAT TCGACAGTTG GGTAGACGGT
AAGATCAAAT CCGGTTTTGA GATGCTGGTG AACAACTTCT CCGCTGGCAT CATCGGTATG
ATCCTCGCCA TCCTGGCGTT CCTCGGTATT GGTCCGGCGG TTGAAGTCCT GTCCAAAATT
CTGGCGGCGG GCGTTAACTT CATGGTGGCG CACGATATGC TGCCGCTGGC GTCCATTTTT
GTTGAACCGG CGAAAATCCT GTTCCTCAAT AACGCGATTA ACCACGGCAT CTTCTCACCG
CTGGGTATCC AGCAGTCCCA TGAGATGGGT AAATCCATCT TCTTCCTGAT TGAAGCTAAC
CCGGGTCCGG GGATGGGCGT CCTGCTGGCG TACATGTTCT TCGGTCGCGG CAGCGCTAAA
CAGTCTGCGG GCGGCGCGGC CATCATCCAC TTCCTGGGCG GTATCCACGA AATTTACTTC
CCGTACGTGC TGATGAACCC ACGCCTGATT CTGGCCGTTA TCCTTGGCGG TATGACCGGC
GTATTCACCC TGACCATCCT GAACGGCGGT CTGGTCTCTC CGGCGTCTCC GGGTTCCATT
CTGGCGGTAC TGGCGATGAC GCCAAAAGGC GCTTACTTCG CTAACATCGC TGCCATCGTG
GCGGCAATGG CGGTCTCCTT CGTGGTTTCT GCAATTCTGC TGAAAACCAG CAAAGTGAAA
GAAGAAGATG ACATTGAAGC GGCCACCCGT CGTATGCATG ACATGAAAGC GGAATCTAAA
GGCGCCTCTC CGCTGGCGGC TGGCGACGTG ACCAACGACC TGAGCCATGT GCGTAAAATC
ATCGTGGCTT GCGATGCCGG TATGGGTTCC AGCGCGATGG GCGCAGGGGT ACTGCGTAAG
AAAGTACAGG ATGCAGGCCT GAGCCAGATC TCCGTCACCA ACAGCGCCAT TAACAATCTG
CCGCCGGATG TGGATCTAGT CATTACTCAC CGTGACCTGA CCGAGCGCGC GATGCGTCAG
GTACCGCAGG CGCAGCATAT TTCGCTGACC AACTTCCTGG ATAGCGGCCT GTACACCAGC
CTGACCGAAC GTCTGGTTGC CGCACAGCGC CATAATACTA ATGAAGAGAA AGTCCGCGAC
CATCTGAAAG ACAGCTTTGA GGAGGGTGAT AACAACCTGT TCAAACTGGG TGCGGAGAAT
ATCTTCCTGG GCCGTAAAGC CGCTAACAAA GAAGAGGCGA TTCGCTTTGC CGGTGAACAA
CTGGTGAAAG GCGGCTATGT TGAGCCGGAA TACGTCGAGG CGATGCTGGA CCGCGAAAAG
CTGACGCCGA CTTACCTCGG TGAATCCATC GCAGTACCGC ACGGTACAGT AGAAGCTAAA
GACCGCGTGC TGAAAACCGG CGTGGTGTTC TGCCAGTATC CGGAAGGCGT ACGCTTCGGT
GAAGAAGAAG ACGATATTGC CCGTCTGGTC ATTGGTATCG CCGCACGTAA CAATGAGCAC
ATTCAAGTGA TTACCAGCCT GACCAACGCG CTAGATGATG AATCCGTGAT CGAACGTCTG
GCGCACACCA CCAGCGTGGA TGAAGTGCTG GAACTGCTGG CAGGTAAAAA AGCTTAA
 
Protein sequence
MSSDIKIKVQ SFGRFLSNMV MPNIGAFIAW GIITALFIPT GWLPNETLAK LVGPMITYLL 
PLLIGYTGGR LVGGERGGVV GAITTMGVIV GADMPMFLGS MIAGPLGGYC IKKFDSWVDG
KIKSGFEMLV NNFSAGIIGM ILAILAFLGI GPAVEVLSKI LAAGVNFMVA HDMLPLASIF
VEPAKILFLN NAINHGIFSP LGIQQSHEMG KSIFFLIEAN PGPGMGVLLA YMFFGRGSAK
QSAGGAAIIH FLGGIHEIYF PYVLMNPRLI LAVILGGMTG VFTLTILNGG LVSPASPGSI
LAVLAMTPKG AYFANIAAIV AAMAVSFVVS AILLKTSKVK EEDDIEAATR RMHDMKAESK
GASPLAAGDV TNDLSHVRKI IVACDAGMGS SAMGAGVLRK KVQDAGLSQI SVTNSAINNL
PPDVDLVITH RDLTERAMRQ VPQAQHISLT NFLDSGLYTS LTERLVAAQR HNTNEEKVRD
HLKDSFEEGD NNLFKLGAEN IFLGRKAANK EEAIRFAGEQ LVKGGYVEPE YVEAMLDREK
LTPTYLGESI AVPHGTVEAK DRVLKTGVVF CQYPEGVRFG EEEDDIARLV IGIAARNNEH
IQVITSLTNA LDDESVIERL AHTTSVDEVL ELLAGKKA