Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0529 |
Symbol | |
ID | 6143733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 540292 |
End bp | 541584 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615423 |
Product | amino acid permease family protein |
Protein accession | YP_001742630 |
Protein GI | 170681675 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA CGGAAGGTAA TAACGGTAAC AAACCCCTCG GTCTATGGAA CGTCGTTTCC ATCGGCATTG GGGCAATGGT GGGGGCGGGG ATCTTCGCGC TGCTGGGGCA GGCTGCGTTG CTAATGGAAG CCTCGACCTG GGTCGCCTTT GCTTTTGGCG GTATTGTGGC GATGTTTTCC GGTTATGCCT ATGCGCGCCT GGGGGCGAGC TATCCCAGCA ATGGCGGTAT TATCGACTTC TTTCGTCGCG GATTAGGCAA CGGCGTCTTT TCGCTGGCAC TCTCGTTATT GTACCTGTTG ACGCTGGCGG TGAGCATCGC CATGGTCGCC CGTGCTTTTG GCGCTTATGC CGTGCAGTTT TTGCATGAAG GCAGTCAGGA GGAGCACCTT ATATTGCTCT ACGCGTTGGG GATCATTGCG GTGATGACGC TTTTCAACTC CTTAAGCAAC CATGCGGTAG GGCGGCTGGA AGTGATCCTC GTCGGCATTA AAATGATGAT CCTGTTATTG CTGATCATTG CAGGTGTCTG GTCGCTGCAA CCGGCACATA TTTCCGTCTC TGCGCCCCCC AGCTCCGGTG CGTTCTTCTC CTGTATTGGG ATAACCTTCC TTGCCTATGC GGGCTTTGGC ATGATGGCGA ACGCGGCGGA TAAAGTGAAA GATCCGCAGA TCATTATGCC ACGGGCGTTT CTGGTGGCGA TTGGCGTTAC CACGTTGCTT TATATCTCGC TGGCGCTGGT TTTGCTTAGC GATGTATCGG CATTAGAGTT AGAAAAATAT GCCGATACCG CCGTAGCGCA GGCTGCTTCT CCGCTGCTCG GGCATGTGGG TTATGTGATC GTCGTCATCG GCGCTTTACT GGCGACGGCT TCAGCCATCA ACGCGAACCT GTTCGCCGTG TTTAACATCA TGGACAACAT GGGCAGCGAA CGCGAACTGC CGAAGCTAAT GAATAAATCC CTGTGGCGGC AGAGTACCTG GGGCAACATC ATTGTCGTGG TGCTGATTAT GCTGATGACG GCGGCGCTGA ATTTAGGCTC ACTCGCCAGC GTTGCCAGCG CCACCTTTTT GATCTGCTAT CTGGCGGTGT TTGTGGTGGC GATCCGCCTG CGTCATGATA TTCACGCCTC GTTGCCGATT CTTATCGTTG GTACGTCGGT GATGTTGTTG GTGATCGTTG GCTTTATCTA CAGCCTGTGG TCCCAGGGCA GCCGTGCGTT GATATGGATT ATTGGCGCAA TCTTACTCAG CCTTATTGTG GCAATGGTCA TGAAGCGCAA TAAAACCGTA TAA
|
Protein sequence | MMNTEGNNGN KPLGLWNVVS IGIGAMVGAG IFALLGQAAL LMEASTWVAF AFGGIVAMFS GYAYARLGAS YPSNGGIIDF FRRGLGNGVF SLALSLLYLL TLAVSIAMVA RAFGAYAVQF LHEGSQEEHL ILLYALGIIA VMTLFNSLSN HAVGRLEVIL VGIKMMILLL LIIAGVWSLQ PAHISVSAPP SSGAFFSCIG ITFLAYAGFG MMANAADKVK DPQIIMPRAF LVAIGVTTLL YISLALVLLS DVSALELEKY ADTAVAQAAS PLLGHVGYVI VVIGALLATA SAINANLFAV FNIMDNMGSE RELPKLMNKS LWRQSTWGNI IVVVLIMLMT AALNLGSLAS VASATFLICY LAVFVVAIRL RHDIHASLPI LIVGTSVMLL VIVGFIYSLW SQGSRALIWI IGAILLSLIV AMVMKRNKTV
|
| |