Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0565 |
Symbol | |
ID | 5593681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 578742 |
End bp | 580034 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640919750 |
Product | amino acid permease family protein |
Protein accession | YP_001457333 |
Protein GI | 157160015 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAACA CGGAAGGTAA TAACGGTAAC AAACCTCTCG GTCTATGGAA CGTCGTTTCC ATCGGCATTG GGGCAATGGT GGGGGCGGGG ATCTTCGCGC TGCTGGGGCA GGCTGCATTG CTAATGGAAG CCTCGACCTG GGTCGCCTTT GCTTTTGGCG GTATTGTGGC GATGTTTTCC GGTTATGCCT ATGCGCGTCT GGGGGCGAGC TATCCCAGCA ATGGCGGCAT TATCGACTTC TTTCGTCGCG GATTAGGCAA CGGCGTCTTT TCGCTGGCGC TCTCGTTACT GTACCTGTTG ACGCTGGCGG TGAGCATCGC CATGGTCGCC CGTGCTTTTG GCGCTTATGC CGTGCAGTTT TTGCATGAAG GCAGCCAGGA GGAGCACCTT ATTTTGCTCT ACGCGTTGGG GATCATTGCG GTGATGACGC TTTTCAACTC CTTAAGCAAC CATGCGGTAG GGCGGCTGGA AGTGATCCTC GTCGGCATTA AAATGATGAT CCTGTTATTG CTGATTATTG CCGGTGTCTG GTCGCTGCAA CCGGCGCATA TTTCCGTCTC TGCGCCCCCC AGCTCCGGTG CGTTCTTCTC CTGTATTGGG ATAACTTTCC TTGCCTATGC GGGCTTTGGC ATGATGGCGA ACGCGGCGGA TAAAGTGAAA GATCCGCAGG TCATTATGCC ACGGGCGTTT CTGGTGGCGA TTGGCGTTAC CACGTTGCTT TATATCTCGC TGGCACTGGT TTTGCTTAGC GATGTATCGG CATTAGAGTT AGAAAAATAT GCCGATACCG CCGTAGCGCA GGCTGCTTCT CCGCTGCTCG GGCATGTGGG TTATGTGATC GTCGTCATCG GCGCTTTACT GGCGACGGCT TCAGCCATTA ACGCGAACCT GTTCGCCGTG TTTAACATCA TGGACAACAT GGGCAGCGAA CGCGAACTGC CGAAGCTAAT GAATAAATCC CTGTGGCGGC AGAGTACCTG GGGCAACATT ATTGTCGTGG TGTTGATTAT GCTGATGACG GCGGCACTGA ATTTAGGCTC ACTCGCCAGC GTTGCCAGCG CCACCTTTTT GATTTGCTAC CTGGCGGTGT TTGTGGTGGC GATCCGCCTG CGTCATGATA TTCACGCCTC GTTGCCGATT CTTATCGTTG GTACGTTGGT GATGTTGTTG GTGATCGTTG GCTTTATCTA CAGTCTGTGG TCCCAGGGTA GCCGTGCGTT GATATGGATT ATTGGCTCAC TCTTACTCAG CCTTATTGTG GCAATGGTCA TGAAGCGCAA TAAAACCGTA TAA
|
Protein sequence | MMNTEGNNGN KPLGLWNVVS IGIGAMVGAG IFALLGQAAL LMEASTWVAF AFGGIVAMFS GYAYARLGAS YPSNGGIIDF FRRGLGNGVF SLALSLLYLL TLAVSIAMVA RAFGAYAVQF LHEGSQEEHL ILLYALGIIA VMTLFNSLSN HAVGRLEVIL VGIKMMILLL LIIAGVWSLQ PAHISVSAPP SSGAFFSCIG ITFLAYAGFG MMANAADKVK DPQVIMPRAF LVAIGVTTLL YISLALVLLS DVSALELEKY ADTAVAQAAS PLLGHVGYVI VVIGALLATA SAINANLFAV FNIMDNMGSE RELPKLMNKS LWRQSTWGNI IVVVLIMLMT AALNLGSLAS VASATFLICY LAVFVVAIRL RHDIHASLPI LIVGTLVMLL VIVGFIYSLW SQGSRALIWI IGSLLLSLIV AMVMKRNKTV
|
| |