Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2194 |
Symbol | |
ID | 5594350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2176501 |
End bp | 2178015 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921327 |
Product | hypothetical protein |
Protein accession | YP_001458866 |
Protein GI | 157161548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.000160893 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGCGAG CGCTTTCTAT CTTGGTATTC CCCTCATTCT CATTGAGATA CGACAGCGGG GCTGAAAATG GATCTTGTAC AATGATAAAA ATTGCACGCA TTGCCGTGAC ATTGGGCTTG CTTTCCTCAC TGGGAGCCCA GGCTTACGCG GCCGGGTTAG TGGTAAATGA TAACGATCTG CGAAACGACC TTGCCTGGCT TTCCGATCGC GGGGTCATCC ATCTGAGCCT GTCGACCTGG CCGCTGAGCC AGGAAGAGAT CGCCCGGGCG CTAAAAAAGG CCAAACCGTC CTATTCTTCT GAGCAAGTGG TGCTGGCCCG TATCAATCAG CGACTGTCTG CTTTAAAAGC CGATTTTCGG GTCACCGGTT ATACCTCAAC CGATCAGCCG GGCACTCCGC AGGGGTTTGG TCAGACACAG CCGGCAGATA ATTCGTTAGG CCTGGCGTTT AACAACAGCG GACAATGGTG GGATGTCCAC CTCCAGGGTA ACGTCGAAGG GGGGGAGCGG ATCAGCAACG GATCGCGTTT CAACGCCAAC GGTGCCTACG GTGCGGTGAA ATTCTGGAAC CAGTGGCTCT CTTTTGGCCA GGTACCACAG TGGTGGGGAC CGGGCTATGA AGGAAGCCTG ATCCGCGGGG ATGCGATGCG GCCGATGACC GGCTTCCTGA TGCAGCGGGC AGAGCAGGCG GCGCCAGAGA CCTGGTGGTT ACGCTGGGTG GGTCCATGGC AGTACCAGAT CTCCGCCAGC CAGATGAATC AATATACCGC TGTGCCTCAT GCCAAAATTA TCGGCGGACG TTTTACCTTC ACGCCGTTCC AGTCATTAGA ATTAGGTGCG TCGCGTATTA TGCAGTGGGG CGGGGAAGGG CGACCGCAAT CCTTCAGCAG TTTCTGGGAT GGATTTACAG GGAAAGACAA TACCGGAACG GATAACGAGC CGGGGAACCA ACTGGCCGGA TTTGACTTTA AGTTTAAACT GGAGCCAACC CTCGGCTGGC CAGTGAGTTT CTACGGTCAA ATGATTGGTG AGGATGAATC TGGTTATCTA CCATCGGCAA ACATGTTCCT GGGAGGTGTC GAAGGCCACC ACGGCTGGGG TAAAGATGCG ATAAACTGGT ATCTTGAAGC ACATGATACG CGCACTAATA TGAGTCGAAC CAATTACAGC TACCGTCACC ATATTTATAA AGATGGATAT TATCAGCAAG GCTATCCGCT CGGCGATGCG ATGGGCGGGG ATGGTCAACT CATCGCCGGT AAGATTGGGC TTATTACAGA AGATAATCAG CGCTGGAGCA CGCGTTTGGT ATACGCCAAA GTTAACCCGG AGAATCAGTC GATCAATAAA GCATTCCCTC ATTCTGACAC CTTGAAGGGT GTACAGCTGG GATGGAGCGG AGATGTTTAT CAGTCGGTCC GTTTGAATAC TTCACTGTGG TACACCAACG CTAACAACAG CGACAGCGAT GACGTTGGGG CCAGCGCAGG GATAGAAATA CCGTTTAGTT TATAA
|
Protein sequence | MPRALSILVF PSFSLRYDSG AENGSCTMIK IARIAVTLGL LSSLGAQAYA AGLVVNDNDL RNDLAWLSDR GVIHLSLSTW PLSQEEIARA LKKAKPSYSS EQVVLARINQ RLSALKADFR VTGYTSTDQP GTPQGFGQTQ PADNSLGLAF NNSGQWWDVH LQGNVEGGER ISNGSRFNAN GAYGAVKFWN QWLSFGQVPQ WWGPGYEGSL IRGDAMRPMT GFLMQRAEQA APETWWLRWV GPWQYQISAS QMNQYTAVPH AKIIGGRFTF TPFQSLELGA SRIMQWGGEG RPQSFSSFWD GFTGKDNTGT DNEPGNQLAG FDFKFKLEPT LGWPVSFYGQ MIGEDESGYL PSANMFLGGV EGHHGWGKDA INWYLEAHDT RTNMSRTNYS YRHHIYKDGY YQQGYPLGDA MGGDGQLIAG KIGLITEDNQ RWSTRLVYAK VNPENQSINK AFPHSDTLKG VQLGWSGDVY QSVRLNTSLW YTNANNSDSD DVGASAGIEI PFSL
|
| |