Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0434 |
Symbol | |
ID | 5591840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 454195 |
End bp | 455721 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640919619 |
Product | hypothetical protein |
Protein accession | YP_001457204 |
Protein GI | 157159886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTCCT GGAAAAAGAA ACTTGTAGTA TCACAATTAG CATTGGCTTG CACTCTGGCT ATCACCTCTC AGGCTAATGC AACTACTTAT AATACATTCG GGTATCACGA TGACGCAGTC ACTCTATTCA ATTGGGGCGA CAATACTAAA ACTGATCACG ACTATTTGAC GTATGGTGGC TATGTATACG ACCATGCGGC TGATGGTTAT TTTGATACTG TATTTAGTGG CGATACCGTT AATGGTGTAA TTTCTACTTA TTATCTGAAT CACGACTATG GAACAGATAC CGCTAATACT CTGAATATTA CGAACTCAAA TATTCACGGT ATGATTACTT CCGATCAGAT CGGATACGGT GATTACGTCT GGACCAACGG TAGCGATTAT ACTGGTCATG ATTGGGTAGA TGGCGATATT TTCACTCTGA ACATCGCGAA CTCAACAATT GACGATGATT TTGATGCATT CTACTTCAAT GATACTTATC TGGATGCAGA CGGTAAAACG TCTAAAACAG ATTATGATCG TCTGGTAACA GCTGCTCTTG GCACCGCTGT AACTCTGGAT GTTGAAAGTA ACATCAATAT CAGCAACAAC TCCCATGTTG CAGGTATTAC TCTGGTTCAA AACGATTTAG GTAATGCGAC TTACAATACT GAAGGTCATC AGTGGGACAA CAATATCGTT GTTAATAATT CCACTGTAAC TTCAGGTTCG CTCTCAGAAG ACGAACAATC AGATCGTGGT CATTTTGGCA ACTCTGTTGA GCCAAGCGAC TATGGTAATG GTGCATCAGG TGCTGATGAT GTTGCATTAG CCTTCATAGA TGATGACACT TCTGATTATC GTATGGTCAA CAACGTTACA TTCAACAATT CTCAGTTGCT TGGCGACGTT GTATTTGACA GTACCTGGAA CGCTAACTTT GACGCGACGG GTCACCTGAT TGACAACTCT ACCACTGCTT ACACTCATGG CGGTTGGGCT ACTGACGATC AGAACGTCGA TCACCTGAAC CTGACTCTGA ACAACACCAA ATGGGTTGGT TCAGCGAATA TTGATTATGA CGTTGTTGTT GCTGACGAAG CCTTCTACGA CGTTGCGCCA AACAGCCTGA ACCCGTACGC TTCTTACTCT GAAGATGGCT GGAATCGTGT TGATAACGCT AACGCATTCC AGAGTGGCGT ATTCGATGTT GTTCTGAACA ATGGTTCTGA TTGGGAAACC ACGAAAGATT CTCTGATCGA TACCCTGGCT ATCAACAGCG GTTCTCAGGT TAACGTGAGT GCAGATTCTT CCCTGACTTC CGACACCATC ACTCTGAACG GTAGTTCTTC AATGGAAGTT AACGGTGAAG TTAACACTGA TCACCTGATC ATTGATACCT TCTCTACTGT TAACTTCGGT GAAGATACTG CTTCAGCTTG GACATCTGCA CCGCTGTACG CCAACACCAT TACCGTTACT AACGGTGGTG TGCTGGATGT AAACACCAAC ATGAATGAAT CGCCACGGAT AATCTAG
|
Protein sequence | MHSWKKKLVV SQLALACTLA ITSQANATTY NTFGYHDDAV TLFNWGDNTK TDHDYLTYGG YVYDHAADGY FDTVFSGDTV NGVISTYYLN HDYGTDTANT LNITNSNIHG MITSDQIGYG DYVWTNGSDY TGHDWVDGDI FTLNIANSTI DDDFDAFYFN DTYLDADGKT SKTDYDRLVT AALGTAVTLD VESNINISNN SHVAGITLVQ NDLGNATYNT EGHQWDNNIV VNNSTVTSGS LSEDEQSDRG HFGNSVEPSD YGNGASGADD VALAFIDDDT SDYRMVNNVT FNNSQLLGDV VFDSTWNANF DATGHLIDNS TTAYTHGGWA TDDQNVDHLN LTLNNTKWVG SANIDYDVVV ADEAFYDVAP NSLNPYASYS EDGWNRVDNA NAFQSGVFDV VLNNGSDWET TKDSLIDTLA INSGSQVNVS ADSSLTSDTI TLNGSSSMEV NGEVNTDHLI IDTFSTVNFG EDTASAWTSA PLYANTITVT NGGVLDVNTN MNESPRII
|
| |