Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4275 |
Symbol | |
ID | 6068079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4729409 |
End bp | 4730578 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641603712 |
Product | glycoside hydrolase family 13 protein |
Protein accession | YP_001727198 |
Protein GI | 170022244 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2382] Enterochelin esterase and related enzymes |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATAA AAATTGCTGC TTTAACGCTG GCTATTGCCA GCGGTATTTC TGCTCAGTGG GCCATAGCAG CGGATATGCC AGCCAGCCCG GCACCCACTA TTCCGGTTAA ACAGTATGTG ACTCAGGTCA ATGCCGATAA CAGCGTGACC TTTCGCTACT TTGCCCCTGG GGCAAAAAAC GTCTCCGTAG TGGTGGGTGT TCCGGTTCCG GACAATATTC ACCCGATGAC CAAAGACGAA GCGGGAGTCT GGTCGTGGCG CACACCTGTC CTGAAAGGCA ATCTGTACGA ATATTTTTTC AATGTTGATG GTGTACGCAG CATTGATACT GGCACCGCAA TGACTAAGCC TCAGCGCCAG GTTAACTCCA GTATGATTCT GGTGCCAGGC AGTTATCTGG ATACGCGTTC TGTTGCGCAT GGTGATTTGA TCGCCATAAC TTACCACTCC AACGCATTGC AATCTGAACG TCAGATGTAT GTCTGGACCC CGCCAGGATA CACCGGCATG GGCGAGCCTT TGCCAGTGCT CTATTTCTAT CACGGCTTTG GTGATACTGG ACGTTCCGCT ATCGATCAGG GGCGTATCCC GCAAATCATG GATAACCTGC TGGCTGAAGG GAAAATTAAA CCGATGCTGG TGGTGATCCC AGATACCGAA ACCGATGCGA AGGGCATTAT TCCCGAAGAT TTCGTGCCTC AGGAAAGACG TAAAGTCTTT TATCCGCTGA ATGCTAAAGC GGCAGATCGC GAACTGATGA ACGATATTAT CCCGCTGATT AGCAAGCGTT TTAACGTCCG TAAAGATGCC GATGGTCGCG CACTGGCAGG GCTTTCACAA GGTGGGTATC AGGCGCTGGT ATCGGGAATG AATCATCTGG AAAGCTTTGG CTGGCTGGCC ACATTCAGTG GTGTTACCAC GACAACCGTA CCGGATGAAG GTGTCGCGGC CCGGCTGAAC GATCCGGCAG CTATTAACCA GCAACTACGT AATTTTACTG TGGTCGTTGG TGATAAAGAT GTCGTAACCG GCAAGGATAT CGCCGGGCTG AAAACTGAGC TTGAGCAGAA AAAAATTAAG TTTGATTACC AGGAATATCC CGGCCTGAAC CATGAAATGG ATGTCTGGCG GCCTGCCTAT GCAGCCTTCG TACAGAAATT ATTTAAATAA
|
Protein sequence | MNIKIAALTL AIASGISAQW AIAADMPASP APTIPVKQYV TQVNADNSVT FRYFAPGAKN VSVVVGVPVP DNIHPMTKDE AGVWSWRTPV LKGNLYEYFF NVDGVRSIDT GTAMTKPQRQ VNSSMILVPG SYLDTRSVAH GDLIAITYHS NALQSERQMY VWTPPGYTGM GEPLPVLYFY HGFGDTGRSA IDQGRIPQIM DNLLAEGKIK PMLVVIPDTE TDAKGIIPED FVPQERRKVF YPLNAKAADR ELMNDIIPLI SKRFNVRKDA DGRALAGLSQ GGYQALVSGM NHLESFGWLA TFSGVTTTTV PDEGVAARLN DPAAINQQLR NFTVVVGDKD VVTGKDIAGL KTELEQKKIK FDYQEYPGLN HEMDVWRPAY AAFVQKLFK
|
| |