Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2029 |
Symbol | |
ID | 5593822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2027281 |
End bp | 2028486 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640921173 |
Product | putative inner membrane protein |
Protein accession | YP_001458718 |
Protein GI | 157161400 |
COG category | [R] General function prediction only |
COG ID | [COG2391] Predicted transporter component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATGGC AGCAATTCAA ACACGCCTGG TTGATTAAAT TCTGGGCGCC CATCCCCGCG GTCATCGCGG CGGGTATTCT CTCCACTTAC TATTTTGGCA TCACTGGCAC CTTTTGGGCT GTCACAGGTG AATTTACCCG TTGGGGCGGT CAGCTCCTGC AGTTGTTCGG CGTCCATGCT GAAGAGTGGG GTTATTTTAA AATTATCCAT CTGGAAGGAT CGCCATTAAC CCGCATCGAC GGAATGATGA TCCTCGGTAT GTTTGGCGGC TGTTTTGCCG CAGCGCTGTG GGCCAACAAT GTCAAACTGC GCATGCCGCG CAGCCGTATC CGCATTATGC AGGCCATCAT TGGCGGTATT ATCGCCGGTT TTGGCGCACG TCTGGCAATG GGCTGTAACC TGGCGGCGTT CTTTACCGGA ATTCCACAGT TCTCGCTGCA CGCCTGGTTC TTTGCCATCG CCACTGCCAT TGGTTCATGG TTTGGCGCGC GCTTTACCCT TCTGCCCATC TTCCGTATTC CCGTGAAAAT GCAGAAAGTT TCTGCTGCCT CACCGTTGAC GCAAAAACCG GATCAGGCGC GGCGTCGTTT TCGTCTCGGG ATGCTGGTCT TTTTCGGCCT GCTGGGCTGG GCGCTGCTGA CGGCGATGAA CCAGCCCAAA CTGGGGCTGG CAATGCTGTT TGGCGTCGGC TTTGGTTTAC TGATTGAACG TGCGCAAATC TGCTTTACTT CGGCGTTCCG CGACATGTGG ATCACCGGAC GTACCCATAT GGCGAAAGCA ATCATTATCG GTATGGCGGT AAGTGCCATC GGGATCTTCA GTTACGTACA GTTAGGCGTT GAACCCAAAA TCATGTGGGC GGGACCAAAC GCGGTAATTG GTGGTTTACT GTTTGGTTTT GGCATCGTGC TGGCAGGCGG CTGCGAAACC GGCTGGATGT ACCGCGCGGT AGAAGGCCAG GTGCACTACT GGTGGGTCGG TCTGGGCAAC GTGATCGGCT CAACGATTCT GGCGTACTAC TGGGATGATT TCGCTCCGGC GCTGGCCACC GACTGGGACA AAATCAACCT GCTGAAAACC TTTGGCCCGA TGGGCGGCCT GCTGGTGACA TATTTGCTGT TGTTTGCTGC TCTAATGTTG ATTATTGGCT GGGAAAAACG CTTCTTCCGC CGTGCGGCAC CGCAGACTGC TAAGGAGATC GCATGA
|
Protein sequence | MSWQQFKHAW LIKFWAPIPA VIAAGILSTY YFGITGTFWA VTGEFTRWGG QLLQLFGVHA EEWGYFKIIH LEGSPLTRID GMMILGMFGG CFAAALWANN VKLRMPRSRI RIMQAIIGGI IAGFGARLAM GCNLAAFFTG IPQFSLHAWF FAIATAIGSW FGARFTLLPI FRIPVKMQKV SAASPLTQKP DQARRRFRLG MLVFFGLLGW ALLTAMNQPK LGLAMLFGVG FGLLIERAQI CFTSAFRDMW ITGRTHMAKA IIIGMAVSAI GIFSYVQLGV EPKIMWAGPN AVIGGLLFGF GIVLAGGCET GWMYRAVEGQ VHYWWVGLGN VIGSTILAYY WDDFAPALAT DWDKINLLKT FGPMGGLLVT YLLLFAALML IIGWEKRFFR RAAPQTAKEI A
|
| |