Gene EcHS_A2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2193 
Symbol 
ID5594349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2175221 
End bp2176357 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID640921326 
Productpolysaccharide biosynthesis/export protein 
Protein accessionYP_001458865 
Protein GI157161547 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00000597973 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AAATTGTTAG ATTTTCGGCA TTAGCGTTGG CAATTGGGTT TTTATCGGGT 
TGTACCATTA TCCCTGGTCA GGGATTAAAC AGTCTGCGGA AAAACGTGGT TGAGCTACCG
GACAGTGACT ACGATCTGGA TAAACTGGTT AATGTGTACC CTATGACTCC TGGGCTTATC
GATCAACTTC GTCCAGAGAC CGTTCTGGCG AGACCTAACC CACAATTAGA TAATTTGCTC
CGAAGCTATG AATATCGCAT TGGTGTGGGC GATGTATTGA TGGTTACGGT ATGGGATCAC
CCGGAACTGA CAACGCCAGC AGGTCAGTAC CGTAGCGCCA GCGACACTGG TAACTGGGTT
AACTCTGACG GTACCATTTT CTATCCATAT ATTGGTAAGG TGCAGGTGGC GGGCAAAACG
CTTAGCCAGG TACGCCAGGA TATAGCCAAC CGATTGGCCA CTTATATTGA AAGCCCACAG
GTTGATGTTA GCGTTGCTGC GTTTCGTTCT CAAAAGGTTT ACGTGACAGG CGAAGTGACA
AAATCAGGCC AGCAACCTAT TACCAATATT CCTTTAACGG TTATGGATGC AATAAATGCC
GCTGGTGGGC TGGCACCAGA CGCAGATTGG CGTAATGTTG TGCTGACTCA TAATGGTAAA
GATACAAAAG TATCACTTTA TGCATTAATG CAAAAAGGGG ATTTAACACA AAATCATATG
TTATATCCTG GAGATATTCT CTTTGTACCA AGGAATGACG ATCTTAAAGT GTTTGTCATG
GGAGAGGTTG GCAAGCAGAG CACATTGAAG ATGGATCGTA GTGGAATGAC ATTAGCAGAG
GCAATCGGGA ATGCGGAAGG CATGTCTCAA GCGTACAGTG ATGCCACGGG AGTCTTCGTT
ATTCGCCAAC TGAAAGGTGA TAAACAAGGT AAAATTGCTA ATATCTATCA GTTGAACGCG
CAAGATGCCT CCGCGATGGT TCTTGGTACA GAATTTGAAT TACAACCTTA TGATATCGTC
TATGTCACAT CGGCTCCATT AGTACGTTGG AATCGTGTAA TTTCCCAACT TGTACCTACC
ATTACTGGAG TACATGATAT GACAGAAACT GTAAGATATA TTAGGACCTG GCCATAA
 
Protein sequence
MKKKIVRFSA LALAIGFLSG CTIIPGQGLN SLRKNVVELP DSDYDLDKLV NVYPMTPGLI 
DQLRPETVLA RPNPQLDNLL RSYEYRIGVG DVLMVTVWDH PELTTPAGQY RSASDTGNWV
NSDGTIFYPY IGKVQVAGKT LSQVRQDIAN RLATYIESPQ VDVSVAAFRS QKVYVTGEVT
KSGQQPITNI PLTVMDAINA AGGLAPDADW RNVVLTHNGK DTKVSLYALM QKGDLTQNHM
LYPGDILFVP RNDDLKVFVM GEVGKQSTLK MDRSGMTLAE AIGNAEGMSQ AYSDATGVFV
IRQLKGDKQG KIANIYQLNA QDASAMVLGT EFELQPYDIV YVTSAPLVRW NRVISQLVPT
ITGVHDMTET VRYIRTWP