Gene EcHS_A3727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3727 
Symbol 
ID5593756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3717602 
End bp3719662 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content53% 
IMG OID640922842 
ProductAsmA family protein 
Protein accessionYP_001460321 
Protein GI157163003 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG CAGGCAAAAT AACCGCTGCG ATTTCAGGGG CTTTCTTGTT GTTGATTGTC 
GTGGCGATCA TTTTGATTGC AACTTTTGAC TGGAATCGAC TCAAACCGAC TATCAACCAG
AAAGTCTCTG CGGAGTTGAA TCGTCCGTTC GCTATCCGTG GCGATCTGGG CGTGGTGTGG
GAGCGGCAAA AACAAGAAAC TGGCTGGCGC AGCTGGGTGC CGTGGCCCCA TGTACACGCG
GAAGACATCA TTCTTGGCAA TCCACCGGAT ATTCCCGAAG TCACGATGGT GCATTTGCCA
CGCGTAGAGG CAACGCTGGC CCCGCTGGCG CTGCTGACCA AAACGGTCTG GCTGCCGTGG
ATCAAGCTCG AAAAGCCCGA CGCGCGCCTG ATTCGCCTCT CTGAAAAGAA CAATAACTGG
ACGTTTAATC TTGCCAACGA TGATAACAAA GACGCGAATG CAAAGCCGTC GGCATGGTCG
TTTCGGCTGG ATAATATTCT TTTCGATCAA GGGCGGATCG CCATTGATGA CAAAGTAAGC
AAAGCGGATC TGGAGATTTT TGTTGATCCC TTAGGCAAGC CGCTGCCGTT CAGCGAAGTT
ACTGGATCGA AAGGTAAAGC GGATAAAGAA AAGGTGGGCG ATTACGTTTT TGGCCTGAAG
GCGCAGGGAC GATATAACGG TGAACCGCTC ACGGGTACGG GAAAAATAGG CGGTATGCTG
GCGCTGCGTG GCGAAGGGAC GCCGTTTCCG GTACAGGCTG ATTTCCGCTC TGGTAACACC
CGTGTTGCTT TTGATGGCGT CGTGAATGAC CCAATGAAGA TGGGCGGTGT CGATTTACGG
CTTAAATTTT CTGGCGATTC ACTGGGTGAT CTCTATGAAC TGACGGGCGT TCTGCTGCCC
GATACCCCGC CGTTTGAAAC GGATGGTCGG CTGGTAGCGA AAATCGACAC TGAAAAATCG
TCGGTCTTTG ATTATCGCGG TTTTAATGGG CGAATTGGTG ATAGCGATAT CCACGGTTCT
CTGGTCTACA CCACCGGAAA GCCACGACCA AAACTGGAAG GTGATGTCGA GTCGCGGCAA
TTGCGGCTGG CGGACCTGGG ACCGTTGATT GGCGTTGATT CCGGGAAAGG GGCAGAAAAG
TCGAAACGGT CTGAACAGAA GAAGGGCGAA AAAAGCGTTC AGCCTGCGGG CAAAGTGCTG
CCTTATGACC GCTTCGAAAC CGATAAATGG GACGTTATGG ATGCCGATGT TCGCTTCAAA
GGGCGGCGCA TTGAGCATGG CAGTAGCCTG CCGATTAGCG ATCTTTCTAC TCATATCATC
CTCAAAAATG CTGACCTGCG CCTGCAACCG CTGAAATTTG GCATGGCGGG CGGCAGCATT
GCGGCGAATA TTCATCTGGA AGGCGATAAA AAGCCGATGC AGGGGCGGGC AGATATTCAG
GCTCGTCGAC TGAAACTGAA AGAACTGATG CCCGATGTGG AACTGATGCA GAAGACGCTG
GGGGAAATGA ACGGTGACGC GGAACTACGC GGTAGCGGTA ACTCGGTGGC GGCACTTTTA
GGCAACAGTA ACGGCAACCT GAAACTGTTG ATGAATGACG GGCTGGTGAG CCGCAACCTG
ATGGAGATTG TTGGGCTGAA TGTCGGCAAC TACATTGTCG GTGCGATATT TGGTGATGAT
GAGGTGCGGG TGAACTGCGC GGCGGCGAAT CTGAATATTG CCAACGGCGT GGCGCGCCCG
CAGATTTTTG CTTTCGATAC TGAGAACGCG TTGATTAATG TTACCGGCAC GGCAAGTTTT
GCTTCGGAAC AGCTGGATTT GACTATTGAT CCGGAGAGTA AAGGAATTCG GATTATCACA
CTGCGTTCGC CGCTGTATGT GCGGGGGACG TTTAAAAATC CGCAGGCTGG GGTGAAAGCC
GGACCGCTGA TTGCCCGTGG TGCTGTTGCT GCGGCACTGG CAACGCTGGT AACACCGGCG
GCGGCGTTAC TGGCACTGAT CTCACCTTCC GAAGGGGAGG CTAATCAGTG TCGGACGATT
TTGTCGCAGA TGAAGAAGTG A
 
Protein sequence
MSKAGKITAA ISGAFLLLIV VAIILIATFD WNRLKPTINQ KVSAELNRPF AIRGDLGVVW 
ERQKQETGWR SWVPWPHVHA EDIILGNPPD IPEVTMVHLP RVEATLAPLA LLTKTVWLPW
IKLEKPDARL IRLSEKNNNW TFNLANDDNK DANAKPSAWS FRLDNILFDQ GRIAIDDKVS
KADLEIFVDP LGKPLPFSEV TGSKGKADKE KVGDYVFGLK AQGRYNGEPL TGTGKIGGML
ALRGEGTPFP VQADFRSGNT RVAFDGVVND PMKMGGVDLR LKFSGDSLGD LYELTGVLLP
DTPPFETDGR LVAKIDTEKS SVFDYRGFNG RIGDSDIHGS LVYTTGKPRP KLEGDVESRQ
LRLADLGPLI GVDSGKGAEK SKRSEQKKGE KSVQPAGKVL PYDRFETDKW DVMDADVRFK
GRRIEHGSSL PISDLSTHII LKNADLRLQP LKFGMAGGSI AANIHLEGDK KPMQGRADIQ
ARRLKLKELM PDVELMQKTL GEMNGDAELR GSGNSVAALL GNSNGNLKLL MNDGLVSRNL
MEIVGLNVGN YIVGAIFGDD EVRVNCAAAN LNIANGVARP QIFAFDTENA LINVTGTASF
ASEQLDLTID PESKGIRIIT LRSPLYVRGT FKNPQAGVKA GPLIARGAVA AALATLVTPA
AALLALISPS EGEANQCRTI LSQMKK