Gene EcHS_A0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0437 
Symbol 
ID5593254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp456918 
End bp458315 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content49% 
IMG OID640919622 
Productouter membrane autotransporter 
Protein accessionYP_001457207 
Protein GI157159889 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGATTC AGAATGACAT CTCCAGCGTA TTTGCAACTG ATACTCTGGA ACTGACCAGT 
GGCAACGTTA AAGATAACAA CGGTAATGTT TACGCAGGTG TGTTTGACAT TCATAGTAAT
GACTACATCC TGAATGCAGA CCTCGTCAAC GATCGTACCA ACGACACCAG CAAAGCTAAC
TATGGTTATG GTGTTATTGC GATGAACTCT GATGGTCACT TGACCGTTAA TGGTAACAAC
GACATTAATA ATGGCGACGA AGTTGACAAC AGCTCTGTTG ACAACGTTGT AGCTGCAACC
GGTAACTACA AAGTTCGTAT CGACAACTCC ACTGGTGCTG GCGCGATTGC TGATTACGCA
GGTAAGCAGC TCATCTATAT CGATGATACC AAAACCAACG CGACCTTCTC TGCTGCTAAC
AAAGCTGACC TGGGTGCATA CACCTATCAG GCTGAACAGC GCGGTAACAC CGTTGTTCTG
CAACAGATGG AGCTGACCGA CTACGCTAAC ATGGCGCTGA GCATCCCTTC TGCGAACACC
AATATCTGGA ACCTGGAACA AGACACCGTT GGTACTCGTC TGACCAACTC TCGTCATGGC
CTGGCTGATA ACGGCGGCGC ATGGGTAAGC TACTTCGGTG GTAACTTCAA CGGCGACAAC
GGCACCATCA ACTATGATCA GGATGTTAAC GGCATCATGG TCGGTGTTGA TACCAAAATT
GACGGTAACA ACGCTAAGTG GATCGTCGGT GCGGCTGCAG GCTTCGCTAA AGGTGACATG
AATGACCGTT CTGGTCAGGT AGATCAAGAC AGCCAGACTG CCTACATCTA CTCTTCTGCT
CACTTCGCGA ACAACGTCTT TGTTGATGGT AGCTTGAGCT ACTCTCACTT CAACAACGAC
CTGTCTGCAA CCATGAGCAA CGGTACTTAC GTTGACGGTA GCACCAACTC CGACGCTTGG
GGCTTCGGCT TGAAAGCCGG TTACGACTTC AAACTGGGTG ATGCTGGTTA TGTGACTCCT
TACGGCAGCA TTTCTGGTCT GTTCCAGTCT GGTGATGACT ACCAGCTGAG CAACGACATG
AAAGTTGACG GTCAGTCTTA CGACAGCATG CGTTATGAAC TGGGTGTAGA TGCAGGTTAT
ACCTTCACCT ACAGCGAAGA CCAGGCTCTG ACTCCGTACT TCAAACTGGC TTACGTCTAC
GACGACTCTA ACAACGATAA CGATGTGAAC GGTGATTCCA TCGATAACGG TACTGAAGGG
TCTGCGGTAC GTGTTGGTCT GGGTACTCAG TTCAGCTTCA CCAAGAACTT CAGCGCCTAT
ACCGATGCTA ACTACCTCGG TGGTGGTGAC GTAGATCAAG ATTGGTCCGC GAACGTGGGT
GTTAAATATA CCTGGTAA
 
Protein sequence
MAIQNDISSV FATDTLELTS GNVKDNNGNV YAGVFDIHSN DYILNADLVN DRTNDTSKAN 
YGYGVIAMNS DGHLTVNGNN DINNGDEVDN SSVDNVVAAT GNYKVRIDNS TGAGAIADYA
GKQLIYIDDT KTNATFSAAN KADLGAYTYQ AEQRGNTVVL QQMELTDYAN MALSIPSANT
NIWNLEQDTV GTRLTNSRHG LADNGGAWVS YFGGNFNGDN GTINYDQDVN GIMVGVDTKI
DGNNAKWIVG AAAGFAKGDM NDRSGQVDQD SQTAYIYSSA HFANNVFVDG SLSYSHFNND
LSATMSNGTY VDGSTNSDAW GFGLKAGYDF KLGDAGYVTP YGSISGLFQS GDDYQLSNDM
KVDGQSYDSM RYELGVDAGY TFTYSEDQAL TPYFKLAYVY DDSNNDNDVN GDSIDNGTEG
SAVRVGLGTQ FSFTKNFSAY TDANYLGGGD VDQDWSANVG VKYTW