Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0351 |
Symbol | |
ID | 5594736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 361814 |
End bp | 366067 |
Gene Length | 4254 bp |
Protein Length | 1417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640919536 |
Product | putative intimin |
Protein accession | YP_001457122 |
Protein GI | 157159804 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.751888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGTT ATAAAACAGA TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ACTTTATTAC CGGAATGGCC ACAGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC TGCGCGCGTC AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG ATTTATGATA CGCCAACAAA TATGTTGTTC ACTCAGGGAG CAATACATCG TACCGACGAT CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG GGGGTGAACA CCTTTATCGA CCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT GCGGAATACT GGCGCGATTA TCTGAAACTG AGCGCCAATG GTTATATTCG GGCTTCTGGC TGGAAAAAAT CGCCGGATAT TGAGGATTAT CAGGAACGCC CGGCGAATGG TTGGGATATC CGCGCAGAGG GCTATTTACC TGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG CAGGGCAAGA GCGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAATTA TCGGATTGGC GAACCTCTGG AGAAACAACT CGATACAGAC AGCATTCGCG AGCGTCGAAT GCTGGCAGGC AGCCGCTATG ACCTGGTTGA GCGTAATAAC AACATCGTTC TTGAGTACCG CAAATCTGAA GTGATCCGTA TTGCTCTGCC TGAGCGTATT GAAGGTAAGG GCGGTCAGAC ACTTTCCCTG GGGCTTGTGG TCAGCAAAGC AACTCACGGA CTGAAAAATG TGCAGTGGGA AGCGCCGTCA TTACTGGCTG AAGGTGGCAA AATTACCGGT CAGGGTAGTC AGTGGCAAGT AACGCTCCCG GCTTATCGTC CAGGCAAAGA CAATTATTAT GCGATTTCTG CGGTTGCCTA CGATAACAAA GGCAATGCCT CAAAACGCGT GCAGACAGAG GTGGTCATTA CCGGAGCAGG TATGAGTGCC GATCGCACGG CGTTAACGCT TGACGGTCAG AGCCGTATTC AAATGCTTGC TAACGGTAAT GAGCAAAGAC CGCTGGTGCT GTCTCTGCGC GACGCCGAGG GCCAGCCAGT CACGGGCATG AAAGATCAGA TCAAGACTGA ACTAACTTTC AAACCGGCTG GAAATATTGT GACTCGTACC CTGAAGGCCA CTAAATCACA GGCAAAGCCA ACACTGGGTG AGTTCACCGA AACTGAAGCC GGGGTGTATC AGTCTGTCTT TACTACCGGA ACGCAGTCAG GTGAGGCAAC GATTACTGTT AGTGTTGATG GCATGAGCAA AACCGTCACT GCAGAACTGC GGGCCACGAT GATGGATGTG GCAAACTCCA CCCTGAGCGC TAACGAGCCG TCAGGTGACG TGGTTGCTGA TGGTCAGCAA GCCTACACGC TGACGCTGAC TGCGGTGGAT ACTGATGGTA ACCCGGTGAC GGGAGAGGCC AGCCGCTTGC GATTTGTTCC GCAAGACACT AATGGTGTCA CCATTGGTAC AATTTCGGAG ATAAAACCAG GCGTTTACAG CGCCACGGTT TCTTCGACCC GTGCCGGAAA CGTTGTTGTG CGTGCTTTCA GCGAGCAGTA TCAGCTGGGC ACATTACAAC AAACGCTGAA GTTTGTTGCC GGGCCGCTTG ATGCAGCACA TTCGTCCATC ACACTGAATC CTGATAAACC GGTGGTTGGC GGTACAGTTA CGGCAATCTG GACGGCAAAA GATGCTAATG ACAACCCTGT AACTGGTCTC AATCCGGATG CACCGTCATT ATCGGGCGCA GCTGCTGCTG GTTCTACGGC ATCAGGCTGG ACGGATAATG GCGACGGGAC CTGGACTGCG CAGATTTCTC TCGGCACTAC GGCGGGTGAA TTAGAGGTTA TTCCGAAGCT AAATGGACAG GATGCGGCAG CAAATGCGGC AAAAGTAACC GTGGTGGCTG ATGCGTTATC TTCAAACCAG TCGAAAGTCT CTGTCGCAGA AGATCACGTA AAAGCCGGCG AAAGCACAAC CGTGACGCTT ATTGCAAAAG ATGCACATGG CAACGCTATC AGTGGTCTTT CCCTGTCGGC AAGCCTGACG GGTGCTGCGT CTGAAGGGGC GACTGTTTCT GGTTGGACCG AAAAAGGTGA TGGTTCCTAT GTCGCTACGC TGACAACAGG TGGAAAGACG GGTGAGCTTC TCGTCATGCC GCTATTCAAC GGCCAGCCAG CAGCCACCGA AGCCGCGCAG TTGACTGTCA TTGCGGGGGA GATGTCATCA GCGAACTCTA CGCTTGTTGC TGACAATAAG GCTCCGACCG TCAAAACGAC GACGAAACTC ACCTTCACCG TGAAGGATGC GTACGGGAAC CTTGTCACCG GGCTGAAGCC AGATGCACCG CAGTTTAGTG GTGCCGCCAG CACGGGGACA GAGCGACCTT CAACAGGAGA CTGGACAGAA ACAAGTAATG GGGTCTACGT GGCGACCTTG ACTCTGGGAT CTGCCGCGGG CCAGTTGTCT GTGATGCCGC GAGTGAACGG CCAAAATGCC GTTGCTCAGC CACTGGTGCT GAATGTTGCT GGTGACGCAT CTAAGGCTGA GATTCGTGAT ATGACGGTGA AGGTTGATAA CCAGCTGGCT AATGGACAAT CGACTAACCA GGTAACCCTG ACCGTTGTGG ACACCTATGG TAACCCGTTG CAGGGACAAA ATGTGACGCT GACTCTGCCG AAAGGTGTGA CCAGCAAGAC GGGGAATACG GTAACAACCG ATGCGGCAGG TAAAGCCGAC ATTGAGCTGA TGTCAACGGT TGCCGGGGAA CACAGCATCA CGGCCTCAGT GAATAATGCT CAGAAGACGG TTACGGTGAA ATTCAAGGCG GATTTCAGTA CCGGTCAGGC GAGTCTGGAG GTTGATAGCG CCGCGCCAAA AGTAGCAAAC GGCAAAGATG CCTTTACGCT GACGGCGACC GTTGAGGATA AAAATGGTAA CCCTGTTCCA GGGAGCCTGG TGACCTTTAA TCTGCCCCGG GGTGTCAAGC CGCTTACAGG CGATAATGTC TGGGTGAAAG CCAACGATGA GGGGAAAGCA GAGTTGCAGG TGGTTTCAGT GACTGCCGGA ACGTATGAGA TCACGGCATC GGCGGGGAAT AGCCAGCCTT CGGATACGCA GACTATAACG TTTGTAGCCG ATAAGGCTAC CGCAACCGTC TCCGGTATTG AGGTGATTGG CAACTATGCG CTGGCGGACG GCAAAGCCAA ACAAACGTAT AAAGTTACGG TGACTGATGC CAATAACAAT TTGGTGAAAG ATAGCGACGT GACGCTGACT GCCAGCCCGG CTTCGTTAAA CCTGGAACCG AATGGCACTG CGACAACGAA TGAGCAAGGG CAGGCTATTT TCACCGCTAC CACTACTGTT GCGGCGACAT ACACACTCAA GGCGCAAGTG AGTCAGACCA ACGGTCAGGT ATCAACGAAA ACTGCCGAAT CTAAATTCGT TGCGGATGAT AAAAACGCGG TACTCACCGC ATCATCTGAT ATGCAATCTC TGGTGGCGGA TGGGAAATCG ACTGCGAAGC TGGAGGTGAC ACTGATGTCG GCAAACAACC CCGTTGGCGG GAATATGTGG GTCGACATTC AGACGCCGGA AGGGGTGACG GAGAAGGATT ATCAGTTCCT GCCGTCGAAA AATGACCATT TCGTGAGCGG AAAAATCACG CGTAAATTTA GTACCAGCAA GCCTGGTGTC TATACGTTCA CATTTAACGC CCTGACCTAT GGCGGGTACG AAATGAAGCC AGTGACGGTG ACCATTACCG CGGTGGATGC CGATACGGCA AAGGACGAGG AGGCGATGAA ATAA
|
Protein sequence | MSRYKTDHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDIEDY QERPANGWDI RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK QGKSGENDTR FGLEVNYRIG EPLEKQLDTD SIRERRMLAG SRYDLVERNN NIVLEYRKSE VIRIALPERI EGKGGQTLSL GLVVSKATHG LKNVQWEAPS LLAEGGKITG QGSQWQVTLP AYRPGKDNYY AISAVAYDNK GNASKRVQTE VVITGAGMSA DRTALTLDGQ SRIQMLANGN EQRPLVLSLR DAEGQPVTGM KDQIKTELTF KPAGNIVTRT LKATKSQAKP TLGEFTETEA GVYQSVFTTG TQSGEATITV SVDGMSKTVT AELRATMMDV ANSTLSANEP SGDVVADGQQ AYTLTLTAVD TDGNPVTGEA SRLRFVPQDT NGVTIGTISE IKPGVYSATV SSTRAGNVVV RAFSEQYQLG TLQQTLKFVA GPLDAAHSSI TLNPDKPVVG GTVTAIWTAK DANDNPVTGL NPDAPSLSGA AAAGSTASGW TDNGDGTWTA QISLGTTAGE LEVIPKLNGQ DAAANAAKVT VVADALSSNQ SKVSVAEDHV KAGESTTVTL IAKDAHGNAI SGLSLSASLT GAASEGATVS GWTEKGDGSY VATLTTGGKT GELLVMPLFN GQPAATEAAQ LTVIAGEMSS ANSTLVADNK APTVKTTTKL TFTVKDAYGN LVTGLKPDAP QFSGAASTGT ERPSTGDWTE TSNGVYVATL TLGSAAGQLS VMPRVNGQNA VAQPLVLNVA GDASKAEIRD MTVKVDNQLA NGQSTNQVTL TVVDTYGNPL QGQNVTLTLP KGVTSKTGNT VTTDAAGKAD IELMSTVAGE HSITASVNNA QKTVTVKFKA DFSTGQASLE VDSAAPKVAN GKDAFTLTAT VEDKNGNPVP GSLVTFNLPR GVKPLTGDNV WVKANDEGKA ELQVVSVTAG TYEITASAGN SQPSDTQTIT FVADKATATV SGIEVIGNYA LADGKAKQTY KVTVTDANNN LVKDSDVTLT ASPASLNLEP NGTATTNEQG QAIFTATTTV AATYTLKAQV SQTNGQVSTK TAESKFVADD KNAVLTASSD MQSLVADGKS TAKLEVTLMS ANNPVGGNMW VDIQTPEGVT EKDYQFLPSK NDHFVSGKIT RKFSTSKPGV YTFTFNALTY GGYEMKPVTV TITAVDADTA KDEEAMK
|
| |