Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4472 |
Symbol | |
ID | 6873328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4310460 |
End bp | 4311995 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642787388 |
Product | putative ABC transporter ATP-binding protein ego |
Protein accession | YP_002217999 |
Protein GI | 198243982 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATCA GTCACAATAC TGCATCCCCT CTGATTTGTG TGCAGAACAT TTATAAAAGT TATTCCGGCG TCGAAGTACT AAAGGGAATT GACTTTACTC TGCATGCGGG AGAGGTGCAC GCATTGCTTG GCGGCAATGG TGCGGGTAAA TCAACATTAA TGAAGATTAT TGCCGGTATA GTCCCGCCAG ATGGAGGGAC TATCGATATT GCTGGTGTGC GTTGCAGTCA TTTAACGCCT CTGAAGGCGC ACCAGTATGG CATTTACCTG GTTCCCCAGG AGCCTCTGTT ATTTCCGAGT TTATCTGTGC GGGAAAATAT CTTGTTTGGC TTGCAGGGAC GTCAGGCCTC CACGGAAAAA ATGCAGCAGC TATTAAAGGC GATGGGATGC CAACTCGATC CGGCGAGCGC TGCGGGTACG CTTGATGTTG CAGACCGCCA GATCGTTGAA ATTATGCGCG GCTTGATGCG CGACTCGCGA ATCTTAATTC TTGATGAGCC CACGGCGTCG TTAACGCCAG CCGAAACTGA CCGGTTATTT ACGCGTCTGC AAGAGTTGCT GAAAAAGGGT GTCGGAATTG TATTTATTTC TCATAAGCTA CCAGAAATTA GACAGTTAGC TCACTGCGTT AGCGTGATGC GTGACGGTAA AATCGCATTA TTCGGAAAAA CGCATGACCT TTCTACCGAC GAGATTATTC AAGCTATCAC CCCGGCAACG CAGGGCGTCA GTCTTTCCGC GAATCAAAAG TTGTGGCTGG AATTGCCTGG CAGCCGCCCG CAGAACGAAC GTGGTGCGAC GGTATTAGCG CTGGAGTCAC TGACGGGCGA AGGTTTTATG AATATCAACC TTGAGGTGCG GGCAGGCGAA ATCCTTGGTC TGGCCGGGTT GGTCGGCGCG GGACGCACAG AACTGGCTGA AACGCTGTAC GGTATTAGAC CGGTCAATGC GGGGCGGATG CTGTTCAATG GCGAAGAAAT TAACGCCCTG ACAACCCAAC AGCGGTTGCA GCTCGGCCTG GTCTATTTGC CGGAAGATCG GCAGTCATCC GGGCTGTATC TTGACGCTTC CCTGGCATGG AATGTCTGTT CGCTGACCCA CAACCAAAAA GGATTTTGGA TAAAGCCCCA GCGGGATAAC GCCACCCTTG AACGTTACCA CCGCGCGTTA AATATCAAAC TCAATAATGC CGAACAGGCG GCGCGTACTT TATCCGGCGG TAACCAGCAA AAAGTATTGA TTGCCAAATG CCTGGAAGCC TCTCCGCAAT TACTGATTGT CGATGAACCG ACCCGCGGTG TCGATGTCTC CGCCCGCAGC GATATTTATC AGCTGTTGCG CAGTATCGCG CAACAAAATG TCGCGGTGCT GTTTATTTCC TCCGATCTGG AAGAGATAGA GCAGATGGCC GATCGCGTAT ATGTCATGCA CCAGGGGGAA CTGGGGGGGC CTGCGTTATG CGGCGAGGAA ATTAACGTTG ATACCATCAT GCACGTTGCG TTTGGCGAAC ATGGTGCGTC GGAGGCAACA TGTTGA
|
Protein sequence | MQISHNTASP LICVQNIYKS YSGVEVLKGI DFTLHAGEVH ALLGGNGAGK STLMKIIAGI VPPDGGTIDI AGVRCSHLTP LKAHQYGIYL VPQEPLLFPS LSVRENILFG LQGRQASTEK MQQLLKAMGC QLDPASAAGT LDVADRQIVE IMRGLMRDSR ILILDEPTAS LTPAETDRLF TRLQELLKKG VGIVFISHKL PEIRQLAHCV SVMRDGKIAL FGKTHDLSTD EIIQAITPAT QGVSLSANQK LWLELPGSRP QNERGATVLA LESLTGEGFM NINLEVRAGE ILGLAGLVGA GRTELAETLY GIRPVNAGRM LFNGEEINAL TTQQRLQLGL VYLPEDRQSS GLYLDASLAW NVCSLTHNQK GFWIKPQRDN ATLERYHRAL NIKLNNAEQA ARTLSGGNQQ KVLIAKCLEA SPQLLIVDEP TRGVDVSARS DIYQLLRSIA QQNVAVLFIS SDLEEIEQMA DRVYVMHQGE LGGPALCGEE INVDTIMHVA FGEHGASEAT C
|
| |