Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1329 |
Symbol | |
ID | 5593230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1322223 |
End bp | 1323617 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920486 |
Product | hypothetical protein |
Protein accession | YP_001458047 |
Protein GI | 157160729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 0.953339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCCGTT TCGTTCCTCG CATTATTTCG TTTTATTTAC TCTTGCTGGC GGCAGGCGGT ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT GATGGTCTGC CGGATTTAGG CATGGCACCT GAAAATCATG ATGGGGAAAA ACACTTTGCT GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG CAGGCAAAAG CTTTCGCATT GGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGATGTCAA AGTGGATAAC GAAGGTCATT TTACCGGCAG TCGTGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT TATCTCACCT GGAGCCAGCT TGGTCTTACT CAGCAGGATG ATGGGTTGGT GAGCAATGTG GGCGTTGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT TTGCGACTAT CGGCAAACTT TTATCAGCCA TTTGCTGCAT GGCATGAACA GACAGCCACG CAGGAACAAC GGATGGCGCG CGGGTACGAC CTGACAGCCC GGATGCGCAT GCCGTTCTAT CAACACCTCA ATACCAGTGT CAGCGTAGAA CAGTATTTTG GTGATCGTGT TGATTTGTTT AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT GTGCCATTAG TCACTGTGAC GGCCCAGCAT AAACAGGGTG AAAGTGGCGA GAATCAAAAT AACCTCGGGC TGAATCTTAA TTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG GGCGAGGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGATAATCC GCAGCGAAAT AATCTACCGA CTCTTGAGTA CCGACAGCGA AAAACGTTAA CGGTGTTTCT GGCGACACCG CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCAGGCGCA CAAGCCAACA GCGCGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAGAA CGGGGAAGGG GCGAGCAATC ACTGGCGATT GTCGGTGGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGACG CATTGTCAAA CGACGAACTG CGCTGGGAAC CGTAA
|
Protein sequence | MSRFVPRIIS FYLLLLAAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT QQDDGLVSNV GVGQRWARGN WLVGYNTFYD NLLDENLQRA GFGAEAWGEY LRLSANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY QHLNTSVSVE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTAQH KQGESGENQN NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSAEGWTL IMPDWQNGEG ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP
|
| |