Gene EcE24377A_0396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0396 
Symbol 
ID5589161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp424630 
End bp427398 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content45% 
IMG OID640924120 
Productouter membrane autotransporter 
Protein accessionYP_001461547 
Protein GI157157315 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.381376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTGGAG ATTCAGGGGG GCAGTCTAGC AACTATGTAA ACTATAGTGG TTTTGTCTAT 
TACAACAACA CCAATGGTGA TTTCGATCAG TCCTTTAACG GCGATACCGT TAACGGGACA
ATCTCAACCT ATTATTTGAA CCATGATTAT GCAGACAGTA CTGCTAATCA GCTTGATATC
AGTAATTCAG TGATTCACGG TTCGATTACT TCTATGCTGC CTGGCGGTTA TTATGATCGT
TTTGATGCAG ATGGTAATAA TCTGGGTGGA TATGATTTTT ACACTGATGC GGTTGTTGAT
ACACACTGGC GTGATGGTGA TGTTTTCACT TTGAACATTG CTAACACTAC TATTGATGAT
GATTATGAAG CTCTTTACTT CACTGATTCT TATAAAGATG GTGATGTAAC CAAGCACACA
AATGAGACAT TTGATACAAG TGAAGGCGTT GCTGTTAATC TTGATGTAGA AAGTAACATC
AATATTTCCA ATAACTCCCG CGTTGCAGGT ATTGCATTAT CTCAAGGTAA TACTTACAAC
GAAACCTACA CTACCGAATC TCATACTTGG GATAACAATA TCTCTGTAAA AGATTCCACA
GTGACTTCGG GTTCAAATTA TATCCTGGAT AGCAATACTT ATGGCAAAAC TGGTCACTTT
GGCAATTCTG ATGAACCGAG TGATTATGCT GGCCCGGGTG ATGTTGCAAT GTCCTTTACT
GCTTCAGGTT CCGACTATGC GATGAAGAAC AATGTATTCC TCAGCAATTC AACGCTGATG
GGTGATGTTG CCTTTACCAG CACCTGGAAT AGTAATTTTG ATCCGAATGG TCATGATTCC
AACGGTGACG GGGTGAAAGA TACCAACGGG GGTTGGACTG ATGATAGCCT CAACGTTGAT
GAACTAAATC TCACTCTCGA TAACGGAAGC AAGTGGGTTG GTCAGGCAAT TTATAACGTT
GCTGAAACGT CAGCAATGTA TGATGTTGCT ACAAACAGCC TTACTCCTGA TGCAACATAT
GAAAACAATG ACTGGAAACG TGTTGTTGAT GACAAGGTCT TCCAGAGCGG TGTATTTAAC
GTAGCGTTGA ATAACGGTTC TGAATGGGAT ACTACAGGTC GTTCCATCGT TGATACCTTG
ACAGTTAATA ATGGTTCTCA GGTTAATGTT TCGGAATCTA AATTAACTTC AGATACTATC
GATTTAACTA ACGGTTCTTC GCTGAACATT GGTGAAGATG GCTACGTTGA TACCGATCAT
CTGACTATTA ACTCCTACAG TACTGTTGCG TTGACCGAAT CTACTGGGTG GGGGGCTGAT
TACAACCTGT ACGCCAATAC TATCACCGTA ACTAACGGTG GTGTATTGGA TGTGAACGTT
GATCAGTTCG ATACTGAAGC TTTCCGTACT GACAAACTGG AACTGACCAG CGGCAACATC
GCTGACCATA ACGGTAACGT AGTATCTGGT GTGTTCGATA TCCATAGCAG CGATTACGTT
CTGAACGCTG ATCTGGTGAA CGACCGTACG TGGGATACTT CCAAGTCTAA CTACGGTTAC
GGTATTGTTG CTATGAACTC TGACGGTCAC CTGACTATCA ATGGTAACGG CGACGTAGAC
AACGGTACTG AACTGGATAA CAGCTCTGTT GATAACGTTG TTGCTGCAAC CGGTAACTAC
AAAGTTCGTA TCGACAACGC AACTGGCGCT GGCGCTATCG CTGATTACAA AGATAAAGAA
ATTATCTACG TAAACGACGT CAACACCAAC GCGACCTTCT CTGCTGCTAA CAAAGCTGAC
CTGGGTGCAT ACACCTATCA GGCTGAACAG CGCGGTAACA CCGTTGTTCT GCAACAGATG
GAGCTGACCG ACTACGCTAA CATGGCGCTG AGCATCCCGT CTGCGAACAC CAATATCTGG
AACCTGGAAC AAGACACCGT TGGTACTCGT CTGACCAACT CTCGTCATGG CCTGGCTGAT
AACGGCGGCG CATGGGTAAG CTACTTCGGT GGTAACTTCA ACGGCGACAA CGGCACCATC
AACTATGATC AGGATGTTAA CGGCATCATG GTCGGTGTTG ATACCAAAAT TGACGGTAAC
AACGCTAAGT GGATCGTCGG TGCGGCTGCA GGCTTCGCTA AAGGTGACAT GAATGACCGT
TCTGGTCAGG TGGATCAAGA CAGCCAGACT GCCTACATCT ACTCTTCTGC TCACTTCGCG
AACAACGTCT TTGTTGATGG TAGCTTGAGT TACTCTCACT TCAACAACGA CCTGTCTGCA
ACCATGAGCA ACGGTACTTA CGTTGACGGT AGCACCAACT CCGACGCTTG GGGCTTCGGT
TTGAAAGCCG GTTACGACTT CAAACTGGGT GATGCTGGTT ACGTGACTCC TTACGGCAGC
ATTTCTGGTC TGTTCCAGTC TGGTGATGAC TACCAGCTGA GCAACGACAT GAAAGTTGAC
GGTCAGTCTT ACGACAGCAT GCGTTATGAA CTGGGTGTAG ATGCAGGTTA TACCTTCACC
TACAGCGAAG ACCAGGCTCT GACTCCGTAC TTCAAACTGG CTTACGTCTA CGACGACTCT
AACAACGATA ACGATGTGAA CGGTGATTCC ATCGATAACG GTACTGAAGG GTCTGCGGTA
CGTGTTGGTC TGGGTACTCA GTTCAGCTTC ACCAAGAACT TCAGCGCCTA TACCGATGCT
AACTACCTCG GTGGTGGTGA CGTAGATCAA GACTGGTCCG CGAACGTGGG TGTTAAATAT
ACCTGGTAA
 
Protein sequence
MSGDSGGQSS NYVNYSGFVY YNNTNGDFDQ SFNGDTVNGT ISTYYLNHDY ADSTANQLDI 
SNSVIHGSIT SMLPGGYYDR FDADGNNLGG YDFYTDAVVD THWRDGDVFT LNIANTTIDD
DYEALYFTDS YKDGDVTKHT NETFDTSEGV AVNLDVESNI NISNNSRVAG IALSQGNTYN
ETYTTESHTW DNNISVKDST VTSGSNYILD SNTYGKTGHF GNSDEPSDYA GPGDVAMSFT
ASGSDYAMKN NVFLSNSTLM GDVAFTSTWN SNFDPNGHDS NGDGVKDTNG GWTDDSLNVD
ELNLTLDNGS KWVGQAIYNV AETSAMYDVA TNSLTPDATY ENNDWKRVVD DKVFQSGVFN
VALNNGSEWD TTGRSIVDTL TVNNGSQVNV SESKLTSDTI DLTNGSSLNI GEDGYVDTDH
LTINSYSTVA LTESTGWGAD YNLYANTITV TNGGVLDVNV DQFDTEAFRT DKLELTSGNI
ADHNGNVVSG VFDIHSSDYV LNADLVNDRT WDTSKSNYGY GIVAMNSDGH LTINGNGDVD
NGTELDNSSV DNVVAATGNY KVRIDNATGA GAIADYKDKE IIYVNDVNTN ATFSAANKAD
LGAYTYQAEQ RGNTVVLQQM ELTDYANMAL SIPSANTNIW NLEQDTVGTR LTNSRHGLAD
NGGAWVSYFG GNFNGDNGTI NYDQDVNGIM VGVDTKIDGN NAKWIVGAAA GFAKGDMNDR
SGQVDQDSQT AYIYSSAHFA NNVFVDGSLS YSHFNNDLSA TMSNGTYVDG STNSDAWGFG
LKAGYDFKLG DAGYVTPYGS ISGLFQSGDD YQLSNDMKVD GQSYDSMRYE LGVDAGYTFT
YSEDQALTPY FKLAYVYDDS NNDNDVNGDS IDNGTEGSAV RVGLGTQFSF TKNFSAYTDA
NYLGGGDVDQ DWSANVGVKY TW