Gene SeHA_C1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1800 
SymbolmdoD 
ID6487777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1757920 
End bp1759539 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content55% 
IMG OID642742016 
Productglucan biosynthesis protein D 
Protein accessionYP_002045661 
Protein GI194447357 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.947764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTTC CAGCGGTATT GCTTCCCTCT TTTCTCAGGC GGCGTTTGCC 
GCGGAATCCG ATATTGCGGA TGGTAAAATT GTCCGTTTTG ATTTTGCTGG TCTGCAATCA
ATGGCCCAGG CGTTAGCGAA AAAGCCCTGG GGTGGCGCGC CGGGACCCTT GCCGGATACG
CTCGCCAATC TGACGCCGCA GGCCTATAAC AGCATTCAGT ATGACGCGGC GCATTCACTC
TGGAACGGTG TTGCCAACCG GCAGCTCGAT ATTCAGTTTT TCCACGTAGG GATGGGCTTT
CGTCGTCGCG TACGCATGTT TTCCGTTGAT ACGACGACGC ATCTTGCTCG CGAGATTCAT
TTTCGCCCGG AACTGTTTAA ATACAACGAT GCCGGCGTCG ACACGACGCA ACTGGAAGGG
CAGAGCGATC TCGGTTTCGC CGGTTTCCGT GTCTTTAAAG CGCCGGAACT GGCGCGGCGC
GATGTCGTCT CCTTCCTGGG CGCCAGTTAT TTCCGGGCGG TAGATGATAC TTATCAGTAT
GGCCTGTCGG CTCGCGGGCT GGCGATAGAT ACCTATACTG ACGGTCAGGA AGAGTTCCCT
GACTTCACCG CATTCTGGTT TGACACCGCG AAGCCGGGCG ATACTACGTT TACCGTTTAC
GCTCTGCTGG ACAGCGCCAG CGTGACGGGC GCGTATAAAT TTGTGATTCA TTGCGAAAAA
TCGCAGGTGA TCATGGATGT AGAAAACCAT CTCTACGCCC GTAAAGATAT TAAGCAACTT
GGCATTGCGC CGATGACCAG TATGTTTAGC TGTGGGAATA ATGAACGCCG GGTATGCGAC
ACCATTCACC CGCAAATTCA CGACTCCGAT CGGCTGGCGA TGTGGCGGGG TAACGGCGAG
TGGATTTGCC GCCCGCTGAA TAATCCGCAG AAATTGCAGT TCAATGCATA TATGGACGAT
AACCCAAAAG GGTTCGGCCT GCTGCAACTC GATCGCGATT TCTCGCATTA TCAGGATGTG
ATGGGCTGGT ACAACAAACG TCCGAGCCTG TGGGTGGAGC CGCGCAGTAA GTGGGGGAAA
GGCGCGGTTA GCCTGATGGA GATCCCAACC ACTGGCGAAA CTCTGGATAA TGTGGTCTGT
TTCTGGCAGC CGGAAAAAGC GATCAAAGCC GGGGATACGC TGGCGTTTAA TTATCGTTTG
TACTGGAGCG CGCAGCCGCC GGTACAATCG CCGCTTGCGC GGGTCATGGC GACCCGTACA
GGGATGGGCG GCTTTCCCGA GGGTTGGGCG CCGGGCGAAC ATTACCCGGA TAAATGGGCG
CGCCGTTTTG CTATTGATTT TGTCGGCGGC GATCTGAAAG CGGCCGCGCC AAAAGGCATT
GAGCCGGTAA TTACGCTCTC CAGCGGTGAG GCGAAGCAGA TTGAGATCCT CTACGTTGAG
CCTTTCGACG GTTATCGTAT CCAGTTTGAC TGGTATCCGA CCTCGGATTC TACGGCACCG
GTGGATATGC GTATGTTCCT GCGCTGCCAG GGGGAGGCTA TCAGCGAAAC CTGGCTGTAT
CAGTATTTCC CGCCCGCGCC GGATAAGCGC CGTTATGTTG ACGATCGTAT CATGCGTTAG
 
Protein sequence
MAAVCGSSGI ASLFSQAAFA AESDIADGKI VRFDFAGLQS MAQALAKKPW GGAPGPLPDT 
LANLTPQAYN SIQYDAAHSL WNGVANRQLD IQFFHVGMGF RRRVRMFSVD TTTHLAREIH
FRPELFKYND AGVDTTQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDGQEEFP DFTAFWFDTA KPGDTTFTVY ALLDSASVTG AYKFVIHCEK
SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGNNERRVCD TIHPQIHDSD RLAMWRGNGE
WICRPLNNPQ KLQFNAYMDD NPKGFGLLQL DRDFSHYQDV MGWYNKRPSL WVEPRSKWGK
GAVSLMEIPT TGETLDNVVC FWQPEKAIKA GDTLAFNYRL YWSAQPPVQS PLARVMATRT
GMGGFPEGWA PGEHYPDKWA RRFAIDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYVE
PFDGYRIQFD WYPTSDSTAP VDMRMFLRCQ GEAISETWLY QYFPPAPDKR RYVDDRIMR