Gene EcHS_A0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0351 
Symbol 
ID5594736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp361814 
End bp366067 
Gene Length4254 bp 
Protein Length1417 aa 
Translation table11 
GC content52% 
IMG OID640919536 
Productputative intimin 
Protein accessionYP_001457122 
Protein GI157159804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.751888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGTT ATAAAACAGA TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC 
TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC
CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT
ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG
ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ACTTTATTAC CGGAATGGCC
ACAGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC TGCGCGCGTC
AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG
ATTTATGATA CGCCAACAAA TATGTTGTTC ACTCAGGGAG CAATACATCG TACCGACGAT
CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG
GGGGTGAACA CCTTTATCGA CCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT
GCGGAATACT GGCGCGATTA TCTGAAACTG AGCGCCAATG GTTATATTCG GGCTTCTGGC
TGGAAAAAAT CGCCGGATAT TGAGGATTAT CAGGAACGCC CGGCGAATGG TTGGGATATC
CGCGCAGAGG GCTATTTACC TGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG
TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT
ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG
CAGGGCAAGA GCGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAATTA TCGGATTGGC
GAACCTCTGG AGAAACAACT CGATACAGAC AGCATTCGCG AGCGTCGAAT GCTGGCAGGC
AGCCGCTATG ACCTGGTTGA GCGTAATAAC AACATCGTTC TTGAGTACCG CAAATCTGAA
GTGATCCGTA TTGCTCTGCC TGAGCGTATT GAAGGTAAGG GCGGTCAGAC ACTTTCCCTG
GGGCTTGTGG TCAGCAAAGC AACTCACGGA CTGAAAAATG TGCAGTGGGA AGCGCCGTCA
TTACTGGCTG AAGGTGGCAA AATTACCGGT CAGGGTAGTC AGTGGCAAGT AACGCTCCCG
GCTTATCGTC CAGGCAAAGA CAATTATTAT GCGATTTCTG CGGTTGCCTA CGATAACAAA
GGCAATGCCT CAAAACGCGT GCAGACAGAG GTGGTCATTA CCGGAGCAGG TATGAGTGCC
GATCGCACGG CGTTAACGCT TGACGGTCAG AGCCGTATTC AAATGCTTGC TAACGGTAAT
GAGCAAAGAC CGCTGGTGCT GTCTCTGCGC GACGCCGAGG GCCAGCCAGT CACGGGCATG
AAAGATCAGA TCAAGACTGA ACTAACTTTC AAACCGGCTG GAAATATTGT GACTCGTACC
CTGAAGGCCA CTAAATCACA GGCAAAGCCA ACACTGGGTG AGTTCACCGA AACTGAAGCC
GGGGTGTATC AGTCTGTCTT TACTACCGGA ACGCAGTCAG GTGAGGCAAC GATTACTGTT
AGTGTTGATG GCATGAGCAA AACCGTCACT GCAGAACTGC GGGCCACGAT GATGGATGTG
GCAAACTCCA CCCTGAGCGC TAACGAGCCG TCAGGTGACG TGGTTGCTGA TGGTCAGCAA
GCCTACACGC TGACGCTGAC TGCGGTGGAT ACTGATGGTA ACCCGGTGAC GGGAGAGGCC
AGCCGCTTGC GATTTGTTCC GCAAGACACT AATGGTGTCA CCATTGGTAC AATTTCGGAG
ATAAAACCAG GCGTTTACAG CGCCACGGTT TCTTCGACCC GTGCCGGAAA CGTTGTTGTG
CGTGCTTTCA GCGAGCAGTA TCAGCTGGGC ACATTACAAC AAACGCTGAA GTTTGTTGCC
GGGCCGCTTG ATGCAGCACA TTCGTCCATC ACACTGAATC CTGATAAACC GGTGGTTGGC
GGTACAGTTA CGGCAATCTG GACGGCAAAA GATGCTAATG ACAACCCTGT AACTGGTCTC
AATCCGGATG CACCGTCATT ATCGGGCGCA GCTGCTGCTG GTTCTACGGC ATCAGGCTGG
ACGGATAATG GCGACGGGAC CTGGACTGCG CAGATTTCTC TCGGCACTAC GGCGGGTGAA
TTAGAGGTTA TTCCGAAGCT AAATGGACAG GATGCGGCAG CAAATGCGGC AAAAGTAACC
GTGGTGGCTG ATGCGTTATC TTCAAACCAG TCGAAAGTCT CTGTCGCAGA AGATCACGTA
AAAGCCGGCG AAAGCACAAC CGTGACGCTT ATTGCAAAAG ATGCACATGG CAACGCTATC
AGTGGTCTTT CCCTGTCGGC AAGCCTGACG GGTGCTGCGT CTGAAGGGGC GACTGTTTCT
GGTTGGACCG AAAAAGGTGA TGGTTCCTAT GTCGCTACGC TGACAACAGG TGGAAAGACG
GGTGAGCTTC TCGTCATGCC GCTATTCAAC GGCCAGCCAG CAGCCACCGA AGCCGCGCAG
TTGACTGTCA TTGCGGGGGA GATGTCATCA GCGAACTCTA CGCTTGTTGC TGACAATAAG
GCTCCGACCG TCAAAACGAC GACGAAACTC ACCTTCACCG TGAAGGATGC GTACGGGAAC
CTTGTCACCG GGCTGAAGCC AGATGCACCG CAGTTTAGTG GTGCCGCCAG CACGGGGACA
GAGCGACCTT CAACAGGAGA CTGGACAGAA ACAAGTAATG GGGTCTACGT GGCGACCTTG
ACTCTGGGAT CTGCCGCGGG CCAGTTGTCT GTGATGCCGC GAGTGAACGG CCAAAATGCC
GTTGCTCAGC CACTGGTGCT GAATGTTGCT GGTGACGCAT CTAAGGCTGA GATTCGTGAT
ATGACGGTGA AGGTTGATAA CCAGCTGGCT AATGGACAAT CGACTAACCA GGTAACCCTG
ACCGTTGTGG ACACCTATGG TAACCCGTTG CAGGGACAAA ATGTGACGCT GACTCTGCCG
AAAGGTGTGA CCAGCAAGAC GGGGAATACG GTAACAACCG ATGCGGCAGG TAAAGCCGAC
ATTGAGCTGA TGTCAACGGT TGCCGGGGAA CACAGCATCA CGGCCTCAGT GAATAATGCT
CAGAAGACGG TTACGGTGAA ATTCAAGGCG GATTTCAGTA CCGGTCAGGC GAGTCTGGAG
GTTGATAGCG CCGCGCCAAA AGTAGCAAAC GGCAAAGATG CCTTTACGCT GACGGCGACC
GTTGAGGATA AAAATGGTAA CCCTGTTCCA GGGAGCCTGG TGACCTTTAA TCTGCCCCGG
GGTGTCAAGC CGCTTACAGG CGATAATGTC TGGGTGAAAG CCAACGATGA GGGGAAAGCA
GAGTTGCAGG TGGTTTCAGT GACTGCCGGA ACGTATGAGA TCACGGCATC GGCGGGGAAT
AGCCAGCCTT CGGATACGCA GACTATAACG TTTGTAGCCG ATAAGGCTAC CGCAACCGTC
TCCGGTATTG AGGTGATTGG CAACTATGCG CTGGCGGACG GCAAAGCCAA ACAAACGTAT
AAAGTTACGG TGACTGATGC CAATAACAAT TTGGTGAAAG ATAGCGACGT GACGCTGACT
GCCAGCCCGG CTTCGTTAAA CCTGGAACCG AATGGCACTG CGACAACGAA TGAGCAAGGG
CAGGCTATTT TCACCGCTAC CACTACTGTT GCGGCGACAT ACACACTCAA GGCGCAAGTG
AGTCAGACCA ACGGTCAGGT ATCAACGAAA ACTGCCGAAT CTAAATTCGT TGCGGATGAT
AAAAACGCGG TACTCACCGC ATCATCTGAT ATGCAATCTC TGGTGGCGGA TGGGAAATCG
ACTGCGAAGC TGGAGGTGAC ACTGATGTCG GCAAACAACC CCGTTGGCGG GAATATGTGG
GTCGACATTC AGACGCCGGA AGGGGTGACG GAGAAGGATT ATCAGTTCCT GCCGTCGAAA
AATGACCATT TCGTGAGCGG AAAAATCACG CGTAAATTTA GTACCAGCAA GCCTGGTGTC
TATACGTTCA CATTTAACGC CCTGACCTAT GGCGGGTACG AAATGAAGCC AGTGACGGTG
ACCATTACCG CGGTGGATGC CGATACGGCA AAGGACGAGG AGGCGATGAA ATAA
 
Protein sequence
MSRYKTDHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT 
TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV
KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA
GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDIEDY QERPANGWDI
RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK
QGKSGENDTR FGLEVNYRIG EPLEKQLDTD SIRERRMLAG SRYDLVERNN NIVLEYRKSE
VIRIALPERI EGKGGQTLSL GLVVSKATHG LKNVQWEAPS LLAEGGKITG QGSQWQVTLP
AYRPGKDNYY AISAVAYDNK GNASKRVQTE VVITGAGMSA DRTALTLDGQ SRIQMLANGN
EQRPLVLSLR DAEGQPVTGM KDQIKTELTF KPAGNIVTRT LKATKSQAKP TLGEFTETEA
GVYQSVFTTG TQSGEATITV SVDGMSKTVT AELRATMMDV ANSTLSANEP SGDVVADGQQ
AYTLTLTAVD TDGNPVTGEA SRLRFVPQDT NGVTIGTISE IKPGVYSATV SSTRAGNVVV
RAFSEQYQLG TLQQTLKFVA GPLDAAHSSI TLNPDKPVVG GTVTAIWTAK DANDNPVTGL
NPDAPSLSGA AAAGSTASGW TDNGDGTWTA QISLGTTAGE LEVIPKLNGQ DAAANAAKVT
VVADALSSNQ SKVSVAEDHV KAGESTTVTL IAKDAHGNAI SGLSLSASLT GAASEGATVS
GWTEKGDGSY VATLTTGGKT GELLVMPLFN GQPAATEAAQ LTVIAGEMSS ANSTLVADNK
APTVKTTTKL TFTVKDAYGN LVTGLKPDAP QFSGAASTGT ERPSTGDWTE TSNGVYVATL
TLGSAAGQLS VMPRVNGQNA VAQPLVLNVA GDASKAEIRD MTVKVDNQLA NGQSTNQVTL
TVVDTYGNPL QGQNVTLTLP KGVTSKTGNT VTTDAAGKAD IELMSTVAGE HSITASVNNA
QKTVTVKFKA DFSTGQASLE VDSAAPKVAN GKDAFTLTAT VEDKNGNPVP GSLVTFNLPR
GVKPLTGDNV WVKANDEGKA ELQVVSVTAG TYEITASAGN SQPSDTQTIT FVADKATATV
SGIEVIGNYA LADGKAKQTY KVTVTDANNN LVKDSDVTLT ASPASLNLEP NGTATTNEQG
QAIFTATTTV AATYTLKAQV SQTNGQVSTK TAESKFVADD KNAVLTASSD MQSLVADGKS
TAKLEVTLMS ANNPVGGNMW VDIQTPEGVT EKDYQFLPSK NDHFVSGKIT RKFSTSKPGV
YTFTFNALTY GGYEMKPVTV TITAVDADTA KDEEAMK