Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1568 |
Symbol | |
ID | 5591953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1573924 |
End bp | 1574850 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640920721 |
Product | putative ABC transport system ATP-binding protein |
Protein accession | YP_001458277 |
Protein GI | 157160959 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1124] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGACA CGTTATTAAC GTTACGCGAC GTCCATATCA ATTTCCCGGC CCGTAAAAAC TGGCTTGGTA AAACTACGGA ACATGTTCAT GCGATTAATG GTATTGATTT ACAGATCCGC CGTGGTGAAA CCTTAGGGAT CGTCGGCGAG TCAGGCTGCG GCAAAAGCAC CCTCGCACAG CTTTTAATGG GTATGCTGCA ACCGAGCCAC GGGCAGTACA TCCGTTCTGG CTCACAACGC ATTATGCAGA TGGTGTTTCA GGACCCGCTC TCTTCGCTCA ATCCGCGCTT ACCGGTGTGG CGCATCATCA CAGAACCGCT CTGGATAGCT AAGCGTAGTA GTGAACAACA GCGGCGAGCG TTGGCAGAGG AGCTGGCTGT GCAGGTGGGT ATTCGTCCGG AGTATCTCGA CCGCCTGCCT CATGCGTTCT CCGGCGGGCA GCGGCAACGT ATCGCCATTG CCAGAGCACT CTCTTCGCAG CCTGACGTGA TTGTGCTTGA TGAGCCAACC TCTGCGCTGG ATATCTCCGT GCAGGCGCAG ATCCTCAATT TACTGGTAAC GCTACAGGAA AATCACGGGC TGACCTATGT GCTGATTTCA CACAATGTCT CGGTGATACG TCATATGAGC GATCGGGTGG CGGTGATGTA TCTCGGGCAG ATTGTAGAAC TGGGCGACGC GCAGCAGGTG CTGACGGCAC CTGCACATCC ATACACCCGA TTATTGCTGG ATTCCCTCCC CGCCATTGAT AAACCGCTGG AGGAAGAATG GGCATTACGT AAAACGGATC TGCCGGGAAA CCGCACGTTG CCGCAAGGCT GTTTTTTCTA CGAACGTTGC CCGCTGGCAA CCCACGGATG TGAAGTCCGG CAATCATTAG CAATAAGGGA GGACGGACGT GAGCTCCGGT GCTGGCGGGC GCTGTAG
|
Protein sequence | MSDTLLTLRD VHINFPARKN WLGKTTEHVH AINGIDLQIR RGETLGIVGE SGCGKSTLAQ LLMGMLQPSH GQYIRSGSQR IMQMVFQDPL SSLNPRLPVW RIITEPLWIA KRSSEQQRRA LAEELAVQVG IRPEYLDRLP HAFSGGQRQR IAIARALSSQ PDVIVLDEPT SALDISVQAQ ILNLLVTLQE NHGLTYVLIS HNVSVIRHMS DRVAVMYLGQ IVELGDAQQV LTAPAHPYTR LLLDSLPAID KPLEEEWALR KTDLPGNRTL PQGCFFYERC PLATHGCEVR QSLAIREDGR ELRCWRAL
|
| |