Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3626 |
Symbol | |
ID | 5594210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3609624 |
End bp | 3610739 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640922743 |
Product | hypothetical protein |
Protein accession | YP_001460224 |
Protein GI | 157162906 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGCGTTT TTTTAGTCTC ATTAGTGGGT GGTGAGTACT GGTGGATGGT CATTATTCCC GTTGGGGCGC ATATCAGTTT TTCGCTGGGA TACGCCTGGC CGACCAGATA TCCTTTATCC GGCACGTCCG GACTACGTTG CCGTAACTTA CTCCTGTTTC TACTTCTCTT ACTTGGTATT GTCGCCGGGT ATCAGGCCCA TTTATATAAG CAGCAAAATC CTGGTGTCGG TGTACGCGAA AATATTGATA TCTGGGCCTG GCGACCCGAT AAACTCAATA ATCGACTGAC GCCGCTGCGT GGCAAACCGC AAATTCAGTT TAGGCAAAAC TGGCCGCGAA TCGATGGCGC CACGGCTGCG TACCCAATTT ATGCTTCTGC ATTTTATGCA TTAAGTGTAA TACCAGAGGA TTTTCACGTT TGGGATTATC TGGAAAACTC TCGTACGCCA GAAGCATATA ACAAAATCGT TAAGGGCGAT GCTGATATTA TTTTCGTGGC GCAACCTTCC GGTGGGCAAA AAAAACGTGC AAAGGAATCT GGTGTCACTT TGCTGTACAC CCCCTTTGCC CGTGAAGCAT TCGTTTTCAT CGTCAATGCG GATAATCCGG TTAATTCCCT GACTGAACAA CAGGTGCGTG ACATTTTCAG TGGTGCAATT ACCAATTGGC GTGCTGTTGG CGGTAACGAT CAGGAGATCC AGACCTGGCA GCGCCCGGAA GACTCTGGCA GCCAGACAGT GATGCAATCA CAGGTCATGA AAAATGTCCG CATGATCTCG CCGCAGGAAA CGAAAGTGGC AAGCGTGATG GAGGGAATGA TTAAAGTCGT TGCCGAATAC CGTAATACAA ACAACGCAAT AGGCTATACC TTCCGCTATT ACGCGACGCA AATGAATGCT GATAAAAATA TAAAATTGCT AGCGATTAAC GGTATTACAC CGACGGCGGA AAACATTCGC AACGGCAAAT ATGCGTACAT CGTCGATGCA TTTATGGTGA CGAGAGAAAA TACAACGTCA GAAACACAAA AACTGGTCGA ATGGTTTTTA ACGCCGCAGG GGCAGAGTCT GGTAGAAGAT GTGGGATATG TGCCGCTGTA TCCAACAATG GAATAA
|
Protein sequence | MGVFLVSLVG GEYWWMVIIP VGAHISFSLG YAWPTRYPLS GTSGLRCRNL LLFLLLLLGI VAGYQAHLYK QQNPGVGVRE NIDIWAWRPD KLNNRLTPLR GKPQIQFRQN WPRIDGATAA YPIYASAFYA LSVIPEDFHV WDYLENSRTP EAYNKIVKGD ADIIFVAQPS GGQKKRAKES GVTLLYTPFA REAFVFIVNA DNPVNSLTEQ QVRDIFSGAI TNWRAVGGND QEIQTWQRPE DSGSQTVMQS QVMKNVRMIS PQETKVASVM EGMIKVVAEY RNTNNAIGYT FRYYATQMNA DKNIKLLAIN GITPTAENIR NGKYAYIVDA FMVTRENTTS ETQKLVEWFL TPQGQSLVED VGYVPLYPTM E
|
| |