Gene Ent638_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1971 
SymbolmdoD 
ID5113387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2143300 
End bp2144955 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content55% 
IMG OID640492159 
Productglucan biosynthesis protein D 
Protein accessionYP_001176698 
Protein GI146311624 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCA GACGTTTTTT ACAGGGCTCG CTGGCAATGG CCGCACTGAG CGGTACAACC 
GGACTCTCAA CGCTCTTTTC CCGCGCGGCC TTTGCCGCAG ATTCCGATAT TGCCGACGGT
CAGAGTCGTC GCTTTGACTT CTCCGTTCTG CAATCCATGG CGCACGATCT GGCGCAAACT
CCGTGGGGTG GTGCGCCGCG TCCACTGCCA AATACGCTGG CGACCATGAC GCCGCAGGCA
TATAACGCCA TTCAATACGA TGCGAAACAG TCGCTGTGGA ACAATATTGA AGACCGTCAG
CTGGACGTGC AATTCTTCCA TATGGGAATG GGTTTTCGCC GCCGGGTAAG AATGTTCTCG
CTGGATTCCG CCTCGTCCCA GGCGCGTGAA ATTCATTTCC GTCCTGAATT GTTTAACTAC
AACGATGCGG GTGTGGATAC GAAACAGCTC GAAGGGCAAA GCGACTTAGG GTTTGCCGGG
TTCCGCGCGT TCAAAGCGCC CGAACTGGCG CGTCGCGATA TCGTCTCGTT CCTTGGCGCG
AGCTATTTCC GTGCGGTAGA TGACACGTAT CAGTACGGTC TTTCCGCGCG CGGTCTGGCG
ATTGATACCT TTACGGATAC GCCCGAAGAG TTCCCTGATT TCACCTCGTT CTGGTTTGAA
ACCGTGAAAC CGGGCGATAC CACCTTTACC GTCTACACGC TGCTCGACAG CCCAAGTATC
ACTGGCGCAT ATAAATTCGT GATTCATTGC GAGAAGAGCC AGGTGATCAT GGAGGTGGAT
AACCACCTGT ATGCACGTAA AGATATCAAA CAGCTCGGCA TTTCTCCGAT GACCAGTATG
TTTGCCTGCG GCAATAACGA GCGTCGCATG TGCGATACCA TTCACCCGCA AATCCACGAC
TCCGATCGTC TGGCGATGTG GCGTGGGAAT GGGGAATGGA TTTGCCGTCC GCTGAATAAT
CCTCAGAAAT TACAGTTCAA CGCTTACCAG GACAAAAACC CGAAAGGCTT TGGTCTCCTC
CAGCTCGACC GCGATTTCTC CCACTATCAG GACATCATGG GCTGGTACAA CAAGCGCCCG
AGTCTGTGGG TTGAACCGCG CAACCAGTGG GGCAAAGGCA GCGTAGGCCT GATGGAGATC
CCGACCACGG GTGAAACGCT GGATAACGTT GTGTGCTTCT GGCAGCCAGA AAAACCAGTG
AAAGCGGGCG ACGAGCTGGA CTTCAAATAT CGCCTGTACT GGAGCGCGCA GCCGCCGGTT
CGTTCTCCGC TGGCAAATGT GTATGCCACG CGGACGGGGA TGGGCGGCTT CCCGGAAGGC
TGGGCACCGG GCGAAAACTA CCCGAAAACC TGGGCGCGTC GTTTTGCCAT TGATTTCGTT
GGCGGTGATT TAAAGGCCGC CGCGCCGAAA GGTATCGAAC CGGTGATTAC GCTCTCTAAC
GGTGAAGCGC GCCAGGTTGA AATCCTTTAT GTTGAACCGT TCGATGGCTA TCGCATTTTG
TTCGACTGGT ACCCAACAAA CGATTCAACC GATCCGATCG ACATGCGTAT GTTCCTGCGC
TGTCAGGGCG ATGCGATCAG CGAAACCTGG CTGTATCAGT ATTTCCCGCC TGCGCCTGAC
AAACGTGTCT ATGTTGATGA CCGCGTAATG CGCTAA
 
Protein sequence
MNRRRFLQGS LAMAALSGTT GLSTLFSRAA FAADSDIADG QSRRFDFSVL QSMAHDLAQT 
PWGGAPRPLP NTLATMTPQA YNAIQYDAKQ SLWNNIEDRQ LDVQFFHMGM GFRRRVRMFS
LDSASSQARE IHFRPELFNY NDAGVDTKQL EGQSDLGFAG FRAFKAPELA RRDIVSFLGA
SYFRAVDDTY QYGLSARGLA IDTFTDTPEE FPDFTSFWFE TVKPGDTTFT VYTLLDSPSI
TGAYKFVIHC EKSQVIMEVD NHLYARKDIK QLGISPMTSM FACGNNERRM CDTIHPQIHD
SDRLAMWRGN GEWICRPLNN PQKLQFNAYQ DKNPKGFGLL QLDRDFSHYQ DIMGWYNKRP
SLWVEPRNQW GKGSVGLMEI PTTGETLDNV VCFWQPEKPV KAGDELDFKY RLYWSAQPPV
RSPLANVYAT RTGMGGFPEG WAPGENYPKT WARRFAIDFV GGDLKAAAPK GIEPVITLSN
GEARQVEILY VEPFDGYRIL FDWYPTNDST DPIDMRMFLR CQGDAISETW LYQYFPPAPD
KRVYVDDRVM R