Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3572 |
Symbol | |
ID | 5594551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3550377 |
End bp | 3551681 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922689 |
Product | hypothetical protein |
Protein accession | YP_001460170 |
Protein GI | 157162852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTGT ATATTCAGAT TATCGTGGTG GCGTGCCTGA CGGGTATGAC ATCGCTTCTG GCGCATCGCT CGGCGGCTGT TTTTCATGAC GGCATCCGCC CGATCCTGCC GCAACTGATT GAAGGCTATA TGAACCGTCG CGAGGCGGGG AGTATCGCTT TTGGTCTGAG CATTGGTTTT GTGGCCTCGG TGGGGATCTC TTTTACCCTG AAAACCGGGC TGCTAAACGC ATGGTTACTC TTTCTTCCTA CCGATATCCT CGGCGTACTG GCGATAAACA GCCTGATGGC GTTTGGTCTT GGCGCTATCT GGGGCGTGTT GATCCTTACT TGCCTGTTGC CAGTAAACCA GCTGCTGACC GCGCTGCCGG TGGATGTATT AGGTAGCCTC GGGGAATTAA GCTCGCCGGT GGTTTCTGCT TTTGCACTCT TCCCGTTGGT GGCGATTTTC TACCAGTTTG GCTGGAAGCA AAGTCTGGTC GCCGCCGTGG TTGTTCTGAT GACCCGTGTG GTAGTCGTGC GCTATTTCCC ACATCTTAAC CCTGAATCCA TCGAAATCTT TATTGGCATG GTGATGCTGC TGGGAATCGC GATAACTCAC GACCTGCGTC ATCGTGATGA AAATGACATC GATGCCAGCG GGCTTTCGGT GTTTGAAGAG CGCACGTCGC GGATTATCAA AAACTTACCC TATATCGCCA TCGTGGGAGC ATTGATTGCC GCCGTTGCCA GTATGAAGAT TTTCGCCGGC AGTGAAGTGT CGATCTTCAC TCTGGAGAAA GCGTACTCCG CAGGCGTAAC GCCGGAACAA TCGCAAACGC TGATCAACCA GGCTGCTCTG GCGGAGTTTA TGCGCGGACT GGGTTTTGTG CCGTTGATTG CCACCACCGC GTTAGCCACC GGCGTATATG CAGTTGCGGG CTTTACCTTT GTTTATGCGG TGGGCTATCT CTCGCCGAAT CCGATGGTTG CGGCGGTATT AGGCGCAGTG GTTATTTCGG CAGAAGTCCT GTTACTTCGT TCGATCGGCA AATGGCTGGG GCGCTATCCG TCGGTGCGTA ATGCGTCGGA TAACATCCGT AACGCCATGA ATATGCTGAT GGAAGTGGCG CTGCTGGTCG GTTCGATTTT CGCAGCAATT AAAATGGCGG GTTATACCGG ATTCTCTATC GCGGTTGCCA TTTACTTCCT CAACGAATCC CTGGGCCGTC CGGTACAGAA AATGGCGGCA CCGGTCGTGG CCGTAATGAT CACCGGTATT CTGCTGAATG TTCTTTACTG GCTTGGCCTG TTCGTTCCGG CTTAA
|
Protein sequence | MDLYIQIIVV ACLTGMTSLL AHRSAAVFHD GIRPILPQLI EGYMNRREAG SIAFGLSIGF VASVGISFTL KTGLLNAWLL FLPTDILGVL AINSLMAFGL GAIWGVLILT CLLPVNQLLT ALPVDVLGSL GELSSPVVSA FALFPLVAIF YQFGWKQSLV AAVVVLMTRV VVVRYFPHLN PESIEIFIGM VMLLGIAITH DLRHRDENDI DASGLSVFEE RTSRIIKNLP YIAIVGALIA AVASMKIFAG SEVSIFTLEK AYSAGVTPEQ SQTLINQAAL AEFMRGLGFV PLIATTALAT GVYAVAGFTF VYAVGYLSPN PMVAAVLGAV VISAEVLLLR SIGKWLGRYP SVRNASDNIR NAMNMLMEVA LLVGSIFAAI KMAGYTGFSI AVAIYFLNES LGRPVQKMAA PVVAVMITGI LLNVLYWLGL FVPA
|
| |