Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1248 |
Symbol | |
ID | 5593354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1249538 |
End bp | 1250659 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640920408 |
Product | cupin family protein |
Protein accession | YP_001457970 |
Protein GI | 157160652 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATACC AACTCACTCT TAACTGGCCC GATTTTCTTG AACGTCACTG GCAGAAACGC CCGGTGGTGT TAAAACGCGG CTTTAATAAT TTTATTGACC CGATCTCTCC AGACGAGTTG GCGGGTCTGG CGATGGAAAG CGAAGTCGAC AGTCGACTGG TCAGTCACCA GGATGGCAAA TGGCAGGTCA GCCACGGTCC GTTCGAAAGC TACGATCATC TCGGTGAAAC TAACTGGTCA TTGTTAGTGC AAGCAGTAAA TCACTGGCAT GAGCCGACCG CCGCGCTGAT GCGACCGTTC CGTGAACTAC CGGACTGGCG TATTGATGAT CTGATGATCT CTTTTTCTGT ACCTGGCGGC GGCGTCGGCC CGCATCTCGA TCAGTACGAC GTGTTTATCA TTCAGGGTAC CGGACGTCGT CGCTGGCGAG TGGGCGAAAA GCTGCAAATG AAACAGCACT GCCCACACCC GGATCTGTTA CAGGTCGATC CGTTCGAAGC CATCATCGAT GAAGAGCTGG AGCCTGGCGA TATTCTTTAT ATTCCGCCAG GATTCCCGCA TGAAGGCTAC GCGCTGGAAA ATGCGATGAA CTATTCCGTG GGTTTTCGCG CGCCAAATAC GCGGGAATTA ATTAGCGGAT TTGCCGATTA TGTGCTGCAA CGTGAACTGG GCGGCAACTA CTACAGCGAT CCTGATGTTC CACCTCGCGC TCATCCTGCG GACGTTCTGC CGCAAGAGAT GGATAAACTG CGTGAGATGA TGCTCGAATT GATCAACCAG CCGGAACACT TTAAGCAATG GTTTGGCGAG TTTATATCCC AGTCACGTCA TGAACTGGAT ATCGCGCCGC CAGAGCCGCC TTATCAGCCG GATGAAATCT ACGATGCGCT GAAACAAGGT GATGTGCTGG TGCGCCTGGG TGGTCTGCGC GTATTGCGCA TTGGCGACGA CGTGTATGCC AATGGTGAGA AGATCGATTC CCCGCACCGT CCGGCACTGG ATGCACTCGC CAGCAACATT GCGCTGACCG CGGAGAATTT TGGCGATGCG CTGGAAGATC CGTCATTCCT CGCGATGCTC GCGGCGCTGG TCAATAGCGG ATACTGGTTC TTCGAAGGGT AA
|
Protein sequence | MEYQLTLNWP DFLERHWQKR PVVLKRGFNN FIDPISPDEL AGLAMESEVD SRLVSHQDGK WQVSHGPFES YDHLGETNWS LLVQAVNHWH EPTAALMRPF RELPDWRIDD LMISFSVPGG GVGPHLDQYD VFIIQGTGRR RWRVGEKLQM KQHCPHPDLL QVDPFEAIID EELEPGDILY IPPGFPHEGY ALENAMNYSV GFRAPNTREL ISGFADYVLQ RELGGNYYSD PDVPPRAHPA DVLPQEMDKL REMMLELINQ PEHFKQWFGE FISQSRHELD IAPPEPPYQP DEIYDALKQG DVLVRLGGLR VLRIGDDVYA NGEKIDSPHR PALDALASNI ALTAENFGDA LEDPSFLAML AALVNSGYWF FEG
|
| |