Gene EcHS_A1170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1170 
SymbolmdoG 
ID5591802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1173008 
End bp1174543 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content50% 
IMG OID640920329 
Productglucan biosynthesis protein G 
Protein accessionYP_001457892 
Protein GI157160574 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0753618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA TGCGTTGGTT GAGTGCTGCA GTAATGTTAA CCCTGTATAC ATCTTCAAGC 
TGGGCTTTCA GTATTGATGA TGTCGCAAAG CAAGCTCAAT CCTTAGCCGG GAAAGGCTAT
GAGGCGCCCA AAAGCAACTT GCCCTCCGTT TTCCGCGATA TGAAATACGC GGACTATCAG
CAGATCCAGT TTAATCATGA CAAAGCGTAC TGGAACAATC TGAAGACCCC ATTCAAACTC
GAGTTCTACC ATCAGGGTAT GTACTTCGAT ACCCCGGTCA AAATAAATGA AGTGACTGCC
ACCGCAGTCA AACGAATCAA ATACAGCCCG GATTATTTCA CTTTCGGCGA TGTTCAGCAT
GACAAAGACA CGGTAAAAGA CCTTGGTTTT GCCGGTTTTA AAGTGCTTTA CCCGATCAAC
AGCAAAGATA AAAACGATGA AATCGTCAGC ATGCTCGGGG CCAGCTATTT CCGCGTGATT
GGTGCAGGTC AGGTTTATGG CCTTTCTGCC CGCGGCCTGG CAATTGATAC CGCCTTGCCA
TCGGGTGAAG AATTTCCACG CTTCAAAGAG TTCTGGATCG AGCGTCCAAA ACCGACTGAT
AAACGTTTAA CCATTTATGC ATTGCTTGAC TCGCCGCGCG CGACAGGTGC TTACAAATTC
GTGGTTATGC CAGGGCGTGA CACGGTTGTG GATGTGCAGT CGAAAATCTA TCTGCGCGAT
AAAGTCGGCA AACTGGGGGT TGCACCGTTA ACCAGTATGT TCCTGTTTGG GCCGAACCAA
CCGTCGCCTG CAAATAACTA TCGTCCGGAG TTGCACGACT CTAACGGTCT GTCTATCCAT
GCCGGTAATG GCGAATGGAT CTGGCGTCCG TTGAATAACC CGAAACATTT AGCGGTCAGC
AGCTTCTCGA TGGAGAACCC GCAAGGCTTC GGTCTGTTGC AGCGCGGTCG TGATTTCTCC
CGCTTTGAAG ATCTCGATGA TCGTTACGAT CTTCGTCCAA GCGCATGGGT GACTCCGAAA
GGGGAGTGGG GTAAAGGCAG CGTTGAGCTG GTGGAAATTC CAACCAACGA TGAAACCAAC
GATAACATCG TCGCTTACTG GACGCCGGAT CAGCTGCCGG AGCCGGGTAA AGAGATGAAC
TTTAAATACA CCATCACCTT CAGCCGTGAT GAAGACAAAC TGCATGCGCC AGATAACGCA
TGGGTGCAAC AAACGCGTCG TTCAACGGGG GATGTGAAGC AGTCGAACCT GATTCGCCAG
CCTGACGGTA CTATCGCCTT TGTGGTCGAT TTTACCGGCG CAGAGATGAA AAAACTGCCA
GAGGATACCC CGGTCACAGC GCAAACCAGC ATTGGTGATA ATGGTGAGAT AGTTGAAAGC
ACGGTGCGCT ATAACCCGGT TACCAAAGGC TGGCGTCTGG TGATGCGTGT GAAAGTGAAA
GATGCCAAGA AAACCACTGA AATGCGTGCT GCGCTGGTGA ATGCCGATCA GACGTTGAGT
GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
 
Protein sequence
MMKMRWLSAA VMLTLYTSSS WAFSIDDVAK QAQSLAGKGY EAPKSNLPSV FRDMKYADYQ 
QIQFNHDKAY WNNLKTPFKL EFYHQGMYFD TPVKINEVTA TAVKRIKYSP DYFTFGDVQH
DKDTVKDLGF AGFKVLYPIN SKDKNDEIVS MLGASYFRVI GAGQVYGLSA RGLAIDTALP
SGEEFPRFKE FWIERPKPTD KRLTIYALLD SPRATGAYKF VVMPGRDTVV DVQSKIYLRD
KVGKLGVAPL TSMFLFGPNQ PSPANNYRPE LHDSNGLSIH AGNGEWIWRP LNNPKHLAVS
SFSMENPQGF GLLQRGRDFS RFEDLDDRYD LRPSAWVTPK GEWGKGSVEL VEIPTNDETN
DNIVAYWTPD QLPEPGKEMN FKYTITFSRD EDKLHAPDNA WVQQTRRSTG DVKQSNLIRQ
PDGTIAFVVD FTGAEMKKLP EDTPVTAQTS IGDNGEIVES TVRYNPVTKG WRLVMRVKVK
DAKKTTEMRA ALVNADQTLS ETWSYQLPAN E