Gene SeD_A1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1720 
SymbolmdoD 
ID6872457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1657160 
End bp1658779 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content55% 
IMG OID642784857 
Productglucan biosynthesis protein D 
Protein accessionYP_002215525 
Protein GI198246242 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.29319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTTC CAGCGGTATT GCTTCCCTCT TTTCTCAGGC GGCGTTTGCC 
GCGGAATCCG ATATTGCGGA TGGTAAAATT GTCCGTTTTG ATTTTGCTGG TCTGCAATCA
ATGGCCCAGG CGTTAGCGAA AAAGCCCTGG GGTGGCGCGC CGGGACCCTT GCCGGATACG
CTCGCCAATC TGACGCCGCA GGCCTATAAC AGCATTCAGT ATGACGCGGC GCATTCACTC
TGGAACGGTG TTGCCAACCG GCAGCTCGAT ATTCAGTTTT TCCACGTAGG GATGGGCTTT
CGTCGTCGCG TACGCATGTT TTCCGTTGAT ACGACGACGC ATCTTGCTCG CGAGATTCAT
TTTCGCCCGG AACTGTTTAA ATACAACGAT GCCGGCGTCG ATACGACGCA ACTGGAAGGG
CAGAGCGATC TCGGTTTCGC CGGTTTCCGT GTCTTTAAAG CGCCGGAACT GGCGCGGCGC
GATGTCGTCT CCTTCCTGGG CGCCAGTTAT TTCCGGGCGG TAGATGATAC TTATCAGTAT
GGCCTGTCGG CTCGCGGGCT GGCGATAGAT ACCTATACTG ACGGCCAGGA AGAGTTCCCT
GACTTCACCG CATTCTGGTT TGACACCGCG AAGCCGGGCG ATACTACGTT TACCGTTTAC
GCTCTGCTGG ACAGCGCCAG CGTGACGGGC GCGTATAAAT TTGTGATCCA TTGCGAAAAA
ACGCAGGTGA TCATGGATGT AGAAAACCAT CTCTACGCCC GTAAAGATAT AAAGCAACTT
GGCATTGCGC CGATGACCAG TATGTTTAGC TGTGGGAATA ATGAACGTCG GGTATGCGAC
ACCATTCACC CGCAAATTCA CGACTCCGAT CGGCTGGCGA TGTGGCGGGG TAACGGCGAG
TGGATTTGCC GCCCGCTGAA TAATCCGCAG AAATTGCAGT TCAATGCATA TATGGACGAT
AACCCAAAAG GGTTCGGCCT GCTGCAACTC GATCGCGATT TCTCGCATTA TCAGGATGTG
ATGGGCTGGT ACAACAAACG TCCGAGCCTG TGGGTGGAGC CGCGCAGTAA GTGGGGGAAA
GGCGCGGTTA GCCTGATGGA GATCCCAACC ACTGGTGAAA CTCTGGATAA TGTGGTCTGT
TTCTGGCAGC CGGAAAAAGC GATCAAAGCC GGGGATACGC TGGCGTTTAA TTATCGTTTG
TACTGGAGCG CGCAGCCGCC GGTACAATCT CCGCTTGCGC GGGTCATGGC GACCCGTACA
GGGATGGGCG GCTTTCCCGA AGGTTGGGCG CCGGGCGAAC ATTACTCAGA TAAATGGGCG
CGCCGTTTTG CTATTGATTT TGTCGGCGGC GATCTGAAAG CGGCCGCGCC AAAAGGCATT
GAGCCGGTAA TTACGCTCTC CAGCGGTGAG GCGAAGCAGA TTGAGATCCT CTACGTTGAG
CCTTTCGACG GTTATCGTAT CCAGTTTGAC TGGTATCCGA CCTCGGATTC TACGGCACCG
GTGGATATGC GTATGTTCCT GCGCTGCCAG AGGGAGGCTA TCAGCGAAAC CTGGCTGTAT
CAGTATTTCC CGCCCGCGCC GGATAAGCGC CGTTATGTCG ACGATCGTAT CATGCGTTAG
 
Protein sequence
MAAVCGSSGI ASLFSQAAFA AESDIADGKI VRFDFAGLQS MAQALAKKPW GGAPGPLPDT 
LANLTPQAYN SIQYDAAHSL WNGVANRQLD IQFFHVGMGF RRRVRMFSVD TTTHLAREIH
FRPELFKYND AGVDTTQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDGQEEFP DFTAFWFDTA KPGDTTFTVY ALLDSASVTG AYKFVIHCEK
TQVIMDVENH LYARKDIKQL GIAPMTSMFS CGNNERRVCD TIHPQIHDSD RLAMWRGNGE
WICRPLNNPQ KLQFNAYMDD NPKGFGLLQL DRDFSHYQDV MGWYNKRPSL WVEPRSKWGK
GAVSLMEIPT TGETLDNVVC FWQPEKAIKA GDTLAFNYRL YWSAQPPVQS PLARVMATRT
GMGGFPEGWA PGEHYSDKWA RRFAIDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYVE
PFDGYRIQFD WYPTSDSTAP VDMRMFLRCQ REAISETWLY QYFPPAPDKR RYVDDRIMR