Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2526 |
Symbol | |
ID | 5594398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2541032 |
End bp | 2542288 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921647 |
Product | hypothetical protein |
Protein accession | YP_001459180 |
Protein GI | 157161862 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0038] Chloride channel protein EriC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCATC CGCGAGCCAG AACCATGTTG TTATTATCGC TCCCCGCCGT GGCAATTGGG ATTGCGTCCA GTCTTATTCT GATTGTGGTG ATGAAAATCG CCTCGGTATT ACAGAATTTG CTCTGGCAAC GACTGCCGGG AACTCTGGGG ATAGCCCAGG ATTCACCCCT CTGGATCATC GGTGTATTAA CGCTAACGGG TATTGCGGTG GGGTTGGTTA TCCGTTTCAG CCAGGGTCAT GCCGGACCAG ACCCCGCCTG TGAACCGCTG ATCGGCGCAC CGGTTCCGCC CTCTGCGCTA CCTGGACTTA TCGTAGCATT AATTCTCGGT CTTGCTGGCG GCGTCAGCCT GGGGCCGGAA CATCCGATCA TGACCGTCAA TATCGCCCTT GCGGTTGCCA TTGGCGCTCG TCTGTTACCG CGCGTCAACC GAATGGAGTG GACTATTTTA GCCTCTGCCG GAACCATCGG CGCGCTGTTT GGTACACCTG TTGCGGCGGC GTTGATATTT TCGCAAACCT TAAATGGCAG TAGTGAAGTT CCGCTATGGG ATCGTCTTTT TGCGCCGTTA ATGGCGGCAG CAGCTGGTGC ACTTACTACC GGATTATTTT TCCATCCTCA TTTTTCACTG CCCATTGCTC ATTACGGACA GATGGAAATG ACCGATATTC TCAGCGGCGC AATTGTCGCG GCGATTGCCA TCGCCGCAGG GATGGTTGCC GTATGGTGCT TACCACGGTT GCACGCGATG ATGCATCAAA TGAAAAATCC GGTGCTCGTG CTGGGTATTG GCGGATTTAT TCTCGGTATT CTGGGGGTTA TTGGTGGACC AGTTTCGCTG TTTAAAGGGC TGGATGAGAT GCAGCAGATG GTGGCAAATC AGGCTTTCAG CACCAGCGAT TACTTTTTGC TGGCGGTAAT TAAACTTGCC GCCCTGGTCG TTGCTGCCGC CAGTGGCTTT CGCGGTGGGC GAATTTTCCC GGCAGTGTTT GTCGGCGTGG CATTAGGGTT GATGCTGCAT GAGCACGTTC CCGCCGTACC AGCGGCAATA ACCGTTTCTT GCGCTATTCT CGGCATCGTG CTGGTGGTAA CACGCGATGG CTGGTTAAGT CTTTTTATGG CGGCAGTCGT TGTACCCAAT ACCACATTGC TACCGCTGCT CTGTATCGTC ATGCTTCCGG CATGGCTGTT ATTAGCAGGT AAGCCGATGA TGATGGTCAA TCGTTCGAAG CAACAGCCCC CCCACGATAA CGTTTAG
|
Protein sequence | MLHPRARTML LLSLPAVAIG IASSLILIVV MKIASVLQNL LWQRLPGTLG IAQDSPLWII GVLTLTGIAV GLVIRFSQGH AGPDPACEPL IGAPVPPSAL PGLIVALILG LAGGVSLGPE HPIMTVNIAL AVAIGARLLP RVNRMEWTIL ASAGTIGALF GTPVAAALIF SQTLNGSSEV PLWDRLFAPL MAAAAGALTT GLFFHPHFSL PIAHYGQMEM TDILSGAIVA AIAIAAGMVA VWCLPRLHAM MHQMKNPVLV LGIGGFILGI LGVIGGPVSL FKGLDEMQQM VANQAFSTSD YFLLAVIKLA ALVVAAASGF RGGRIFPAVF VGVALGLMLH EHVPAVPAAI TVSCAILGIV LVVTRDGWLS LFMAAVVVPN TTLLPLLCIV MLPAWLLLAG KPMMMVNRSK QQPPHDNV
|
| |