Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1720 |
Symbol | mdoD |
ID | 6872457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1657160 |
End bp | 1658779 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642784857 |
Product | glucan biosynthesis protein D |
Protein accession | YP_002215525 |
Protein GI | 198246242 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.29319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCG TGTGCGGTTC CAGCGGTATT GCTTCCCTCT TTTCTCAGGC GGCGTTTGCC GCGGAATCCG ATATTGCGGA TGGTAAAATT GTCCGTTTTG ATTTTGCTGG TCTGCAATCA ATGGCCCAGG CGTTAGCGAA AAAGCCCTGG GGTGGCGCGC CGGGACCCTT GCCGGATACG CTCGCCAATC TGACGCCGCA GGCCTATAAC AGCATTCAGT ATGACGCGGC GCATTCACTC TGGAACGGTG TTGCCAACCG GCAGCTCGAT ATTCAGTTTT TCCACGTAGG GATGGGCTTT CGTCGTCGCG TACGCATGTT TTCCGTTGAT ACGACGACGC ATCTTGCTCG CGAGATTCAT TTTCGCCCGG AACTGTTTAA ATACAACGAT GCCGGCGTCG ATACGACGCA ACTGGAAGGG CAGAGCGATC TCGGTTTCGC CGGTTTCCGT GTCTTTAAAG CGCCGGAACT GGCGCGGCGC GATGTCGTCT CCTTCCTGGG CGCCAGTTAT TTCCGGGCGG TAGATGATAC TTATCAGTAT GGCCTGTCGG CTCGCGGGCT GGCGATAGAT ACCTATACTG ACGGCCAGGA AGAGTTCCCT GACTTCACCG CATTCTGGTT TGACACCGCG AAGCCGGGCG ATACTACGTT TACCGTTTAC GCTCTGCTGG ACAGCGCCAG CGTGACGGGC GCGTATAAAT TTGTGATCCA TTGCGAAAAA ACGCAGGTGA TCATGGATGT AGAAAACCAT CTCTACGCCC GTAAAGATAT AAAGCAACTT GGCATTGCGC CGATGACCAG TATGTTTAGC TGTGGGAATA ATGAACGTCG GGTATGCGAC ACCATTCACC CGCAAATTCA CGACTCCGAT CGGCTGGCGA TGTGGCGGGG TAACGGCGAG TGGATTTGCC GCCCGCTGAA TAATCCGCAG AAATTGCAGT TCAATGCATA TATGGACGAT AACCCAAAAG GGTTCGGCCT GCTGCAACTC GATCGCGATT TCTCGCATTA TCAGGATGTG ATGGGCTGGT ACAACAAACG TCCGAGCCTG TGGGTGGAGC CGCGCAGTAA GTGGGGGAAA GGCGCGGTTA GCCTGATGGA GATCCCAACC ACTGGTGAAA CTCTGGATAA TGTGGTCTGT TTCTGGCAGC CGGAAAAAGC GATCAAAGCC GGGGATACGC TGGCGTTTAA TTATCGTTTG TACTGGAGCG CGCAGCCGCC GGTACAATCT CCGCTTGCGC GGGTCATGGC GACCCGTACA GGGATGGGCG GCTTTCCCGA AGGTTGGGCG CCGGGCGAAC ATTACTCAGA TAAATGGGCG CGCCGTTTTG CTATTGATTT TGTCGGCGGC GATCTGAAAG CGGCCGCGCC AAAAGGCATT GAGCCGGTAA TTACGCTCTC CAGCGGTGAG GCGAAGCAGA TTGAGATCCT CTACGTTGAG CCTTTCGACG GTTATCGTAT CCAGTTTGAC TGGTATCCGA CCTCGGATTC TACGGCACCG GTGGATATGC GTATGTTCCT GCGCTGCCAG AGGGAGGCTA TCAGCGAAAC CTGGCTGTAT CAGTATTTCC CGCCCGCGCC GGATAAGCGC CGTTATGTCG ACGATCGTAT CATGCGTTAG
|
Protein sequence | MAAVCGSSGI ASLFSQAAFA AESDIADGKI VRFDFAGLQS MAQALAKKPW GGAPGPLPDT LANLTPQAYN SIQYDAAHSL WNGVANRQLD IQFFHVGMGF RRRVRMFSVD TTTHLAREIH FRPELFKYND AGVDTTQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY GLSARGLAID TYTDGQEEFP DFTAFWFDTA KPGDTTFTVY ALLDSASVTG AYKFVIHCEK TQVIMDVENH LYARKDIKQL GIAPMTSMFS CGNNERRVCD TIHPQIHDSD RLAMWRGNGE WICRPLNNPQ KLQFNAYMDD NPKGFGLLQL DRDFSHYQDV MGWYNKRPSL WVEPRSKWGK GAVSLMEIPT TGETLDNVVC FWQPEKAIKA GDTLAFNYRL YWSAQPPVQS PLARVMATRT GMGGFPEGWA PGEHYSDKWA RRFAIDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYVE PFDGYRIQFD WYPTSDSTAP VDMRMFLRCQ REAISETWLY QYFPPAPDKR RYVDDRIMR
|
| |