Gene EcHS_A0363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0363 
Symbol 
ID5592457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp376124 
End bp378421 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content46% 
IMG OID640919548 
Productputative outer membrane autotransporter 
Protein accessionYP_001457134 
Protein GI157159816 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.497355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA GTAAGGCATT TTACCGCAGC GCATTAGCGA CAGCTATTGT TATGGCTCTT 
TCTGCACCAG CATTCGCTGC TGATAACGCG GTATCAACTA CACCGGTTAC ACTGGATAAA
GATAAGACAA CTCTGGATCA AGATGTTGTT ATTAGCAATA CAGCAGACCA ACAGATTACA
GCCGTAACAA TTAATGCGGC AGATAAAGAT CTTAATGTTA CTTTTGGCGG TCATGATATT
ACTGCTGAAT CAACGGCAGA CAAAAAATTC CTTGAAGGTG TAAAAGTTAG CGGTGACAAA
AATGTTGTGA TTAATGCTAC AGGCTCCACC ATCACAGCTC AAGGTGAAGG CACCTATGTC
CGGACTGCAA TGGTCATTTC TTCAACTGGC GATGTTGTTG TTAATGGCGG TAATTTCGTT
GCAAAAAATG AAAAAAGTAG TGCGACAGGA ATATCTCTGG AAGGGGCCAC GGGAAATAAT
GTAACGCTAA ATGGTACAAC CATAAATGCT CAAGGTAATA AGAGTTCCAG CAACGGCTCT
ACGGCAATTT TTGCTCAAAA GGGTAGCGTA TTGAATGGTT TTAAAGGTGA TGCAACCGAC
AACATTACCC TTGCTGGCTC AAATATTATT AATGGCCGGA TTGAAACAAT AGTTATTGCC
AAGGAGAATA CGGGAACTCA TACAGTCAAT CTGAATATTA AGGATGGCTC AGTAATTGGG
GCGGCTAATA ATAAACAAAC AATTTATGCT TCTGCTTCGG CACAAGGAGC AGGTTCAGCA
ACGCAAAATT TAAATTTATC TGTCGCCGAT TCAACCATCT ACTCTGATAT CCATGCCCTT
TCTGCAAGCG AGAATTCAGC CGGTACCACA ACAAATGTAA ACATGAACGT TGCCCGCTCT
TACTGGGAAG GTAATGCTTA TACCTTCAAT AGCGGCGATA AAGCGGGTAG TAATCTGGAT
ATAAATCTTT CCGATAGCTC AGTCTGGAAA GGCAAAGTTT CAGGGGCAGG AGATGCCAGT
GTATCTCTGC AAAACGGGTC TGTCTGGAAT GTTACGGGAT CCTCAACTGT TGATGCTCTG
GCAGTAAAAG ACAGTACGGT TAATATCACG AAGGCTACAG TCAATACTGG CACGTTTGCT
TCTCAGAACG GCACTCTGAT TGTTGATGCC TCTTCTGAAA ACACTCTGGA TATCAGCGGA
AAAGCGAGCG GTGACTTGCG TGTTTACAGT GCGGGTTCAT TGGATCTTAT CAATGAACAA
ACGGCATTTA TTTCTACCGG CAAAGACAGC ACTCTAAAAG CCACAGGCAC AACGGAAGGT
GGTCTGTATC AATATGACCT GACACAGGGA GCTGATGGTA ACTTTTATTT CGTAAAAAAC
ACGCATAAAG CATCCAACGC CAGCTCCCTG ATTCAGGCAA TGGCAGCAGC TCCGGCTAAC
GTTGCTAATC TGCAGGCTGA CACGCTCTCA GCCCGTCAGG ATGCTGTCCG TCTGAGCGAA
AATGACAAGG GCGGCGTATG GATTCAGTAC TTTGGCGGTA AACAGAAACA TACCACCGCG
GGAAATGCAT CCTATGACCT GGATGTAAAC GGTGTAATGC TGGGTGGTGA TACCCGCTTC
ATGACTGAAG ATGGTAGCTG GCTGGCAGGT GTAGCGATGT CTTCTGCGAA AGGTGACATG
ACCACCATGC AGAGCAAAGG CGACACTGAA GGTTACAGCT TCCACGCTTA CCTGAGCCGC
CAGTATAACA ACGGTATCTT CATTGATACT GCTGCACAGT TTGGTCACTA CAGCAACACG
GCAGATGTTC GCCTGATGAA TGGTGGCGGT ACCATCAAAG CTGACTTTAA CACCAATGGT
TTCGGTGCGA TGGTTAAAGG CGGTTACACA TGGAAAGACG GTAATGGCCT GTTTATTCAG
CCATATGCCA AACTGTCTGC GCTGACGCTG GAAGGTGTGG ATTATCAGCT CAACGGCGTG
GACGTTCATT CTGACAGCTA TAACTCTGTG CTGGGTGAGG CCGGTACGCG CGTGGGTTAT
GACTTCGCTG TGGGCAACTC GACCGTTAAA CCTTATCTGA ATCTGGCCGC ACTGAACGAA
TTCTCTGATG GCAACAAAGT CCGTCTGGGT GATGAGTCTG TCAATGCCAG CATTGACGGT
GCAGCATTCC GCGTGGGTGC AGGTGTACAG GCTGATATCA CCAAAAACAT GGGAGCATAT
GCAAGCCTTG ACTACACCAA AGGTGACGAC ATTGAGAACC CGCTACAGGG TGTAGTTGGT
ATCAATGTGA CCTGGTAA
 
Protein sequence
MKNSKAFYRS ALATAIVMAL SAPAFAADNA VSTTPVTLDK DKTTLDQDVV ISNTADQQIT 
AVTINAADKD LNVTFGGHDI TAESTADKKF LEGVKVSGDK NVVINATGST ITAQGEGTYV
RTAMVISSTG DVVVNGGNFV AKNEKSSATG ISLEGATGNN VTLNGTTINA QGNKSSSNGS
TAIFAQKGSV LNGFKGDATD NITLAGSNII NGRIETIVIA KENTGTHTVN LNIKDGSVIG
AANNKQTIYA SASAQGAGSA TQNLNLSVAD STIYSDIHAL SASENSAGTT TNVNMNVARS
YWEGNAYTFN SGDKAGSNLD INLSDSSVWK GKVSGAGDAS VSLQNGSVWN VTGSSTVDAL
AVKDSTVNIT KATVNTGTFA SQNGTLIVDA SSENTLDISG KASGDLRVYS AGSLDLINEQ
TAFISTGKDS TLKATGTTEG GLYQYDLTQG ADGNFYFVKN THKASNASSL IQAMAAAPAN
VANLQADTLS ARQDAVRLSE NDKGGVWIQY FGGKQKHTTA GNASYDLDVN GVMLGGDTRF
MTEDGSWLAG VAMSSAKGDM TTMQSKGDTE GYSFHAYLSR QYNNGIFIDT AAQFGHYSNT
ADVRLMNGGG TIKADFNTNG FGAMVKGGYT WKDGNGLFIQ PYAKLSALTL EGVDYQLNGV
DVHSDSYNSV LGEAGTRVGY DFAVGNSTVK PYLNLAALNE FSDGNKVRLG DESVNASIDG
AAFRVGAGVQ ADITKNMGAY ASLDYTKGDD IENPLQGVVG INVTW