Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4068 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4404647 |
End bp | 4405636 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | ACX41668 |
Protein GI | 260451246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGT GGGGCGTAGG GTTAACATTT TTGCTGGCGG CAACCAGCGT TATGGCAAAG GATATTCAGC TTCTTAACGT TTCATATGAT CCAACGCGCG AATTGTACGA ACAGTACAAC AAGGCATTCA GCGCCCACTG GAAACAGCAA ACTGGTGATA ACGTGGTGAT TCGTCAGTCA CACGGTGGCT CAGGTAAACA AGCGACGTCG GTAATCAACG GTATTGAAGC TGATGTTGTC ACGCTGGCTC TGGCCTATGA CGTGGACGCA ATTGCGGAAC GCGGGCGGAT TGATAAAGAG TGGATCAAAC GTCTGCCGGA TAACTCCGCA CCGTACACTT CCACCATTGT TTTCCTGGTA CGTAAGGGAA ATCCGAAGCA GATCCATGAC TGGAACGATC TGATTAAACC GGGTGTTTCG GTGATCACGC CTAATCCGAA AAGCTCTGGT GGCGCGCGCT GGAACTACCT GGCAGCCTGG GGCTACGCGC TGCATCACAA CAACAACGAT CAGGCAAAAG CACAGGATTT TGTTCGGGCA CTGTATAAAA ACGTCGAAGT TCTGGATTCT GGCGCGCGTG GCTCCACTAA CACTTTTGTC GAGCGCGGAA TTGGCGATGT ACTGATTGCC TGGGAAAACG AAGCTCTGCT GGCAGCGAAT GAACTGGGGA AAGATAAATT CGAAATCGTC ACGCCGAGTG AGTCTATCCT CGCAGAGCCA ACCGTGTCGG TGGTCGATAA AGTGGTCGAG AAAAAAGGTA CTAAAGAGGT GGCGGAAGCC TACCTGAAAT ATCTCTACTC GCCAGAAGGT CAGGAAATTG CCGCGAAAAA CTACTACCGT CCGCGCGACG CTGAGGTGGC GAAAAAGTAC GAAAATGCGT TTCCAAAGCT GAAGTTATTC ACCATTGATG AAGAGTTCGG CGGCTGGACG AAAGCGCAAA AAGAGCATTT TGCTAACGGC GGTACGTTCG ATCAGATCAG CAAACGCTGA
|
Protein sequence | MNKWGVGLTF LLAATSVMAK DIQLLNVSYD PTRELYEQYN KAFSAHWKQQ TGDNVVIRQS HGGSGKQATS VINGIEADVV TLALAYDVDA IAERGRIDKE WIKRLPDNSA PYTSTIVFLV RKGNPKQIHD WNDLIKPGVS VITPNPKSSG GARWNYLAAW GYALHHNNND QAKAQDFVRA LYKNVEVLDS GARGSTNTFV ERGIGDVLIA WENEALLAAN ELGKDKFEIV TPSESILAEP TVSVVDKVVE KKGTKEVAEA YLKYLYSPEG QEIAAKNYYR PRDAEVAKKY ENAFPKLKLF TIDEEFGGWT KAQKEHFANG GTFDQISKR
|
| |