Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3872 |
Symbol | |
ID | 5592341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3867831 |
End bp | 3868754 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640922982 |
Product | carboxylate/amino acid/amine transporter |
Protein accession | YP_001460459 |
Protein GI | 157163141 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | [TIGR00950] Carboxylate/Amino Acid/Amine Transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCCA CCAGAAAGGG GATGCTGAAC GTTCTGATTG CCGCCGTGTT GTGGGGAAGT TCAGGGGTCT GCGCGCAATA CATCATGGAG CAAAGCCAGA TGTCGTCGCA GTTTTTGACT ATGACGCGTT TGATATTCGC CGGTTTGATT CTACTGACGC TGTCATTTGT TCATGGCGAT AAAATCTTTT CTATTATTAA CAATCATAAA GATGCCATTA GCCTGCTGAT TTTTTCCGTG GTTGGCGCGC TAACTGTACA GCTCACTTTT TTGCTAACCA TCGAAAAATC GAACGCAGCC ACGGCAACGG TGCTGCAATT CCTCTCACCG ACGATTATCG TCGCCTGGTT CTCACTGGTG CGTAAATCGC GCCCGGGCAT TCTGGTTTTC TGCGCTATTT TGACATCGCT GGTCGGGACT TTTTTATTGG TGACACACGG TAATCCGACG TCATTATCGA TCTCTCCTGC CGCGTTGTTC TGGGGCATTG CCTCGGCATT TGCTGCTGCA TTCTATACCA CCTATCCCTC AACGCTAATT GCCCGCTATG GCACGTTGTC TGTCGTCGGC TGGAGTATGC TGATCGGCGG TCTGATTCTG TTGCCTTTTT ATGCCAGACA AGGGACAAAC TTTGTCGTTA ACGGCAGTTT GATTTTGGCG TTTTTTTATT TGATCGTCAT TGGTACGTCC CTGACATTTA GTCTGTACCT GAAAGGAGCA CAATTAATTG GCGGTCCAAA AGCCAGCATT TTGAGCTGTG CAGAACCATT AAGTAGCGCG CTACTCTCTT TGCTGTTGCT GGGGATCACG TTCACATTAC CGGACTGGCT GGGAACGCTG CTGATTCTGT CATCGGTAAT TTTGATTTCA ATGGATTCCC GTCGCCGCGC CAGAAAAATA AATCGTCCGG CGCGGGATGA GTGA
|
Protein sequence | MGSTRKGMLN VLIAAVLWGS SGVCAQYIME QSQMSSQFLT MTRLIFAGLI LLTLSFVHGD KIFSIINNHK DAISLLIFSV VGALTVQLTF LLTIEKSNAA TATVLQFLSP TIIVAWFSLV RKSRPGILVF CAILTSLVGT FLLVTHGNPT SLSISPAALF WGIASAFAAA FYTTYPSTLI ARYGTLSVVG WSMLIGGLIL LPFYARQGTN FVVNGSLILA FFYLIVIGTS LTFSLYLKGA QLIGGPKASI LSCAEPLSSA LLSLLLLGIT FTLPDWLGTL LILSSVILIS MDSRRRARKI NRPARDE
|
| |