Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2220 |
Symbol | |
ID | 6145436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2238564 |
End bp | 2239994 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617096 |
Product | amino acid permease family protein |
Protein accession | YP_001744270 |
Protein GI | 170680872 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.52526 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGGA ATGTTCAGGA AAAACAGCTG CGATGGTACA ACATTGCGCT GATGTCTTTT ATCACTGTCT GGGGTTTTGG CAACGTTGTT AATAACTACG CCAACCAGGG GCTGGTGGTT GTTTTTTCAT GGGTGTTTAT CTTTGCGCTC TATTTCACAC CTTATGCGCT AATTGTTGGT CAGTTAGGCT CGACCTTCAA AGATGGGAAG GGCGGGGTCA GTACCTGGAT TAAACACACG ATGGGACCCG GACTGGCTTA CCTCGCCGCG TGGACCTACT GGGTAGTGCA TATTCCCTAT CTGGCACAAA AACCCCAGGC AATACTGATT GCGCTCGGTT GGGCGATGAA AGGCGACGGT TCGTTAATCA AAGAATATTC AGTCGTAGCG TTACAGGGGT TAACGCTGGT GCTGTTTATC TTCTTTATGT GGGTTGCTTC ACGCGGTATG AAATCGCTGA AAATCGTCGG TTCTGTGGCA GGGATTGCGA TGTTTGTTAT GTCGCTCCTG TATGTGGCGA TGGCGGTAAC CGCGCCTGCG ATTACTGAAG TGCATATTGC GACCACAAAC ATTACCTGGG AAACGTTCAT TCCTCATATC GACTTTACCT ACATCACCAC TATTTCAATG CTGGTTTTCG CGGTTGGCGG AGCAGAGAAG ATTTCTCCTT ACGTTAATCA AACGCGCAAC CCAGGAAAAG AATTTCCAAA AGGGATGTTA TGCCTGGCGG TGATGGTTGC GGTTTGTGCC ATTCTGGGCT CGCTGGCGAT GGGGATGATG TTTGATTCGC GTAATATCCC GGATGACTTA ATGACTAACG GTCAGTATTA CGCCTTTCAG AAACTGGGCG AGTATTACAA CATGGGTAAT ACTTTAATGG TGATTTACGC CATTGCGAAT ACCCTGGGAC AAGTAGCTGC GCTGGTATTC TCGATTGATG CGCCGCTTAA AGTACTATTA GGCGATGCTG ATAGCAAATA TATTCCAGCC AGTTTATGTC GTACCAACGC TTCTGGTACG CCCGTTAATG GCTATTTTCT GACCCTGGTG CTGGTGGCGA TCCTGATTAT GCTGCCGACG CTGGGTATCG GCGATATGAA TAATCTCTAC AAGTGGTTGC TGAATCTTAA CTCGGTGGTG ATGCCACTAC GTTATCTGTG GGTATTTGTT GCATTTATTG CAGTCGTTCG TCTGGCGCAG AAATATAAAC CAGAGTATGT CTTTATTCGT AACAGGCCGC TGGCGATGAC TGTCGGGATC TGGTGTTTTA CCTTTACCGC TTTTGCCTGC CTGACAGGGA TCTTCCCGAA AATGGAAGCC TTCACTGCAG AGTGGACCTT CCAGTTAGCG CTGAATGTTG CAACGCCGTT TGTCCTGGTT GGACTGGGGC TGATATTCCC GCTGCTGGCG CGTAAAGCGA ATAGTAAATA A
|
Protein sequence | MAGNVQEKQL RWYNIALMSF ITVWGFGNVV NNYANQGLVV VFSWVFIFAL YFTPYALIVG QLGSTFKDGK GGVSTWIKHT MGPGLAYLAA WTYWVVHIPY LAQKPQAILI ALGWAMKGDG SLIKEYSVVA LQGLTLVLFI FFMWVASRGM KSLKIVGSVA GIAMFVMSLL YVAMAVTAPA ITEVHIATTN ITWETFIPHI DFTYITTISM LVFAVGGAEK ISPYVNQTRN PGKEFPKGML CLAVMVAVCA ILGSLAMGMM FDSRNIPDDL MTNGQYYAFQ KLGEYYNMGN TLMVIYAIAN TLGQVAALVF SIDAPLKVLL GDADSKYIPA SLCRTNASGT PVNGYFLTLV LVAILIMLPT LGIGDMNNLY KWLLNLNSVV MPLRYLWVFV AFIAVVRLAQ KYKPEYVFIR NRPLAMTVGI WCFTFTAFAC LTGIFPKMEA FTAEWTFQLA LNVATPFVLV GLGLIFPLLA RKANSK
|
| |