Gene SeD_A3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3884 
Symbol 
ID6872791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3713869 
End bp3716262 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content55% 
IMG OID642786846 
Productmaltodextrin phosphorylase 
Protein accessionYP_002217474 
Protein GI198241813 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGC CAACCTTCAA TAAAGATCAA TTCCAGGCCG CCCTGACGCG TCAGTGGCAG 
CGTTTCGGTT TACTGTCGGC GTCCGACATG ACGCCTCGTC AGTGGTGGCA GGCCGTTAGC
GGCGCGCTGG CGGAATTACT GAGCGCGCAA CCGGTAGCGC AGCCGACAAA AGGCCAGCGT
CATGTGAACT ACATTTCGAT GGAATTTTTG ATTGGTCGCC TGACGGGAAA TAACCTGTTA
AATCTGGGAT GGTACCAGGA CGTTAGCGAT GTACTGAAAG CGCACGATAT TAATCTGACC
GATCTGCTGG AAGAGGAAGT CGATCCGGCG CTCGGCAACG GTGGTCTGGG ACGTCTGGCC
GCCTGCTTCC TCGATTCGAT GGCCACCGTC GGGCAGTCCG CTACGGGCTA TGGTCTGAAC
TACCAGTATG GGCTGTTCCG TCAGTCATTT GTTGAAGGCA AGCAGATGGA AGCGCCGGAT
GACTGGCATC GCGGCAGCTA TCCGTGGTTC CGCCACAACG AGGCGCTGGA CGTCCAGGTT
GGGATCGGCG GTAAAGTCAC CAAAGAAGGG CGCTGGGAGC CAGGTTTTGT GATTACAGGC
CAGGCCTGGG ATCTGCCGGT GTTAGGCTAT CGTAACGGCG TCGCGCAACC GCTGCGTTTG
TGGCAGGCGA CCCACGCGCA TCCGTTTGAT CTGACCAAAT TCAACGACGG CGCTTTCCTG
CGGGCGGAAC AGCAGGGTAT CGATGCGGAA AAACTGACGA AGGTGCTTTA TCCCAACGAT
AACCACACGG CGGGCAAAAA ACTGCGTCTG ATGCAGCAAT ACTTCCAGTG CGCCTGCTCG
GTAGCGGATA TTCTGCGTCG CCACCATCTG GCGGGCCGTA AGCTGCACGA ACTGGCCGAT
TTCGAAGTCA TTCAGTTGAA CGATACTCAC CCGACCATCG CCATTCCGGA ACTGCTGCGC
GTGCTGATTG ACGAGCACCA ACTGAGCTGG GATGACGCCT GGGCTATCAC CAGCAAAACC
TTCGCCTACA CCAACCATAC CCTGATGCCG GAAGCGCTGG AGTGCTGGGA CGAGAGGTTA
ATCAAAGCGC TGTTGCCGCG TCATATGCAG ATTATCAAGC AGATTAACGA CCGCTTTAAG
AAACTGGTCG ATAACACCTG GCCTGGCGAT AAGCAGGTAT GGACAAAACT GGCGGTGGTG
CATGACCGTC AGGTGCGCAT GGCCAATATG TGCGTGGTCA GCGGCTTTGC GGTCAACGGC
GTGGCGGCGC TGCACTCCGA TCTGGTGGTG AAAGATCTGT TCCCGGAATA TCACCAGCTT
TGGCCGAACA AATTCCACAA TGTCACCAAC GGCATTACGC CGCGTCGCTG GATTAAACAG
TGCAATCCGC AGCTTGCGGC GTTGCTGGAT AAAACGCTGA AAAAAGAGTG GGCTAACGAT
CTCGACCAGT TGATCAACCT CGAAAAATAC GCTGACGATG CGAAGTTCCG TCAGCAGTAT
CGCGACATCA AACGGGCGAA CAAAGAACGG CTGGTGAAAT TCATCCAGGC CCGTACCGGG
ATTGAGATTT CAAGCAACGC GATTTTTGAT ATTCAGATCA AACGCCTGCA CGAGTACAAG
CGTCAGCACC TGAACCTGTT GCATATTCTG GCGCTGTACA AAGAGATCCG CGAAAACCCG
CAGGCTGATC GCGTACCGCG CGTATTTCTG TTTGGCGCGA AGGCGGCGCC GGGCTATTAC
CTGGCGAAGA ACATCATTTT TGCTATCAAT AAGGTTGCGG AAGCCATTAA TAACGACCCG
GCGGTGGGTG ATAAGCTGAA GGTGGTTTTC CTGCCGGATT ACTGCGTCTC GGCGGCGGAA
ATGCTCATTC CGGCGGCGGA TATTTCCGAG CAAATTTCTA CTGCCGGGAA AGAGGCGTCC
GGCACCGGCA ACATGAAACT GGCGCTGAAC GGGGCGTTGA CTGTGGGAAC GCTGGACGGC
GCTAACGTTG AAATCGCTGA GAAGGTGGGT GAAGAGAATA TCTTTATCTT TGGCCATACT
GTGGAAGAGG TCAAGGCGCT CAAAGCCAAA GGCTACGATC CGGTGAAATG GCGTAAAAAA
GACAAAGTGC TGGATGCTGT GCTAAAAGAG CTGGAAAGCG GTCAATACAG CGATGGCGAT
AAACATGCCT TTGACCAGAT GCTGCATAGC CTCGGCAAAC AGGGGGGCGA TCCGTACCTG
GTCATGGCGG ACTTCGCCGC TTATGTCGAG GCGCAAAAGC AGGTGGATGC GCTGTATCGC
GACCAGGAAG CGTGGACGCG CGCCGCGATC CTCAATACCG CGCGCTGCGG TATGTTCAGT
TCCGATCGCT CTATTCGCGA TTATCAGGCC CGTATCTGGC AGGCAAAACG CTAA
 
Protein sequence
MSQPTFNKDQ FQAALTRQWQ RFGLLSASDM TPRQWWQAVS GALAELLSAQ PVAQPTKGQR 
HVNYISMEFL IGRLTGNNLL NLGWYQDVSD VLKAHDINLT DLLEEEVDPA LGNGGLGRLA
ACFLDSMATV GQSATGYGLN YQYGLFRQSF VEGKQMEAPD DWHRGSYPWF RHNEALDVQV
GIGGKVTKEG RWEPGFVITG QAWDLPVLGY RNGVAQPLRL WQATHAHPFD LTKFNDGAFL
RAEQQGIDAE KLTKVLYPND NHTAGKKLRL MQQYFQCACS VADILRRHHL AGRKLHELAD
FEVIQLNDTH PTIAIPELLR VLIDEHQLSW DDAWAITSKT FAYTNHTLMP EALECWDERL
IKALLPRHMQ IIKQINDRFK KLVDNTWPGD KQVWTKLAVV HDRQVRMANM CVVSGFAVNG
VAALHSDLVV KDLFPEYHQL WPNKFHNVTN GITPRRWIKQ CNPQLAALLD KTLKKEWAND
LDQLINLEKY ADDAKFRQQY RDIKRANKER LVKFIQARTG IEISSNAIFD IQIKRLHEYK
RQHLNLLHIL ALYKEIRENP QADRVPRVFL FGAKAAPGYY LAKNIIFAIN KVAEAINNDP
AVGDKLKVVF LPDYCVSAAE MLIPAADISE QISTAGKEAS GTGNMKLALN GALTVGTLDG
ANVEIAEKVG EENIFIFGHT VEEVKALKAK GYDPVKWRKK DKVLDAVLKE LESGQYSDGD
KHAFDQMLHS LGKQGGDPYL VMADFAAYVE AQKQVDALYR DQEAWTRAAI LNTARCGMFS
SDRSIRDYQA RIWQAKR