Gene ECD_01045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01045 
SymbolmdoG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1111902 
End bp1113437 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content50% 
IMG OID 
Productglucan biosynthesis protein, periplasmic 
Protein accessionACT42940 
Protein GI253977270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0678082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC 
TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CTTTAGCTGG GAAAGGCTAC
GAGACGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG
CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC
GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC
ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT
GACAAAGATA CGGTAAAAGA CCTTGGCTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC
AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT
GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGCCTGG CAATTGATAC CGCCTTGCCA
TCGGGTGAAG AATTTCCGCG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCGACTGAT
AAACGTTTAA CCATCTATGC ATTGCTTGAC TCGCCGCGTG CGACAGGTGC TTACAAATTC
GTGGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT
AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA
CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT CTCTATCCAT
GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC
AGCTTCTCCA TGGAAAACCC GCAAGGCTTT GGTCTGTTGC AGCGCGGTCG TGATTTCTCC
CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTCCGTCCAA GCGCATGGGT GACTCCGAAA
GGGGAGTGGG GTAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC
GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC
TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA
TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG
CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA
GAGGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC
ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA
GATGCCAAGA AAACCACTGA AATGCGTGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT
GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
 
Protein sequence
MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY ETPKSNLPSV FRDMKYADYQ 
QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH
DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP
SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD
KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS
SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN
DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ
PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK
DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E