Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2788 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2986223 |
End bp | 2987335 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX40421 |
Protein GI | 260449999 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCT TAAATAAAAA ATGGCTATCG GGTCTGGTTG CGGGTGCTCT GATGGCCGTC TCTGTCGGCA CGCTCGCGGC TGAACAAAAA ACACTCCACA TTTATAACTG GTCTGATTAT ATCGCCCCGG ACACGGTGGC CAATTTTGAA AAAGAAACCG GTATTAAAGT CGTCTACGAT GTTTTCGACT CTAACGAAGT ACTGGAAGGC AAATTAATGG CCGGGAGTAC CGGCTTTGAT CTGGTGGTTC CATCTGCCAG CTTTCTGGAG CGCCAGTTGA CTGCGGGAGT TTTCCAGCCG CTGGACAAAA GCAAATTGCC GGAGTGGAAG AATCTCGACC CGGAACTGCT GAAGCTGGTC GCCAAACACG ATCCCGACAA TAAATTTGCT ATGCCCTATA TGTGGGCGAC GACCGGGATT GGCTATAACG TTGATAAAGT TAAAGCGGTG CTGGGCGAAA ACGCGCCCGT CGATAGCTGG GACTTGATCC TCAAACCTGA AAATCTGGAA AAACTGAAAA GCTGCGGTGT CTCTTTCCTG GATGCGCCAG AAGAAGTTTT TGCTACCGTG TTGAATTATC TCGGCAAAGA TCCCAACAGC ACTAAAGCGG ATGATTACAC CGGACCGGCA ACAGATCTGC TGTTAAAGCT GCGCCCGAAC ATTCGTTATT TCCATTCATC TCAATACATT AACGACCTGG CAAACGGCGA TATTTGCGTC GCTATCGGCT GGGCAGGTGA TGTCTGGCAG GCGTCAAACC GCGCGAAGGA AGCGAAGAAT GGCGTGAATG TCTCGTTCTC GATTCCAAAA GAAGGGGCGA TGGCGTTCTT TGATGTATTC GCCATGCCTG CGGATGCCAA AAACAAAGAC GAAGCCTATC AGTTCCTGAA TTACCTGCTG CGCCCGGATG TAGTAGCGCA TATTTCCGAC CATGTGTTCT ATGCCAACGC CAATAAAGCA GCCACGCCGC TGGTGAGTGC GGAAGTCCGT GAGAACCCAG GTATTTATCC GCCTGCGGAT GTTCGTGCGA AGCTGTTCAC TCTGAAAGTG CAGGATCCGA AAATCGACCG TGTGCGCACC CGCGCGTGGA CCAAAGTGAA GAGCGGAAAA TAA
|
Protein sequence | MTALNKKWLS GLVAGALMAV SVGTLAAEQK TLHIYNWSDY IAPDTVANFE KETGIKVVYD VFDSNEVLEG KLMAGSTGFD LVVPSASFLE RQLTAGVFQP LDKSKLPEWK NLDPELLKLV AKHDPDNKFA MPYMWATTGI GYNVDKVKAV LGENAPVDSW DLILKPENLE KLKSCGVSFL DAPEEVFATV LNYLGKDPNS TKADDYTGPA TDLLLKLRPN IRYFHSSQYI NDLANGDICV AIGWAGDVWQ ASNRAKEAKN GVNVSFSIPK EGAMAFFDVF AMPADAKNKD EAYQFLNYLL RPDVVAHISD HVFYANANKA ATPLVSAEVR ENPGIYPPAD VRAKLFTLKV QDPKIDRVRT RAWTKVKSGK
|
| |