Gene EcSMS35_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2220 
Symbol 
ID6145436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2238564 
End bp2239994 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content48% 
IMG OID641617096 
Productamino acid permease family protein 
Protein accessionYP_001744270 
Protein GI170680872 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.52526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGGA ATGTTCAGGA AAAACAGCTG CGATGGTACA ACATTGCGCT GATGTCTTTT 
ATCACTGTCT GGGGTTTTGG CAACGTTGTT AATAACTACG CCAACCAGGG GCTGGTGGTT
GTTTTTTCAT GGGTGTTTAT CTTTGCGCTC TATTTCACAC CTTATGCGCT AATTGTTGGT
CAGTTAGGCT CGACCTTCAA AGATGGGAAG GGCGGGGTCA GTACCTGGAT TAAACACACG
ATGGGACCCG GACTGGCTTA CCTCGCCGCG TGGACCTACT GGGTAGTGCA TATTCCCTAT
CTGGCACAAA AACCCCAGGC AATACTGATT GCGCTCGGTT GGGCGATGAA AGGCGACGGT
TCGTTAATCA AAGAATATTC AGTCGTAGCG TTACAGGGGT TAACGCTGGT GCTGTTTATC
TTCTTTATGT GGGTTGCTTC ACGCGGTATG AAATCGCTGA AAATCGTCGG TTCTGTGGCA
GGGATTGCGA TGTTTGTTAT GTCGCTCCTG TATGTGGCGA TGGCGGTAAC CGCGCCTGCG
ATTACTGAAG TGCATATTGC GACCACAAAC ATTACCTGGG AAACGTTCAT TCCTCATATC
GACTTTACCT ACATCACCAC TATTTCAATG CTGGTTTTCG CGGTTGGCGG AGCAGAGAAG
ATTTCTCCTT ACGTTAATCA AACGCGCAAC CCAGGAAAAG AATTTCCAAA AGGGATGTTA
TGCCTGGCGG TGATGGTTGC GGTTTGTGCC ATTCTGGGCT CGCTGGCGAT GGGGATGATG
TTTGATTCGC GTAATATCCC GGATGACTTA ATGACTAACG GTCAGTATTA CGCCTTTCAG
AAACTGGGCG AGTATTACAA CATGGGTAAT ACTTTAATGG TGATTTACGC CATTGCGAAT
ACCCTGGGAC AAGTAGCTGC GCTGGTATTC TCGATTGATG CGCCGCTTAA AGTACTATTA
GGCGATGCTG ATAGCAAATA TATTCCAGCC AGTTTATGTC GTACCAACGC TTCTGGTACG
CCCGTTAATG GCTATTTTCT GACCCTGGTG CTGGTGGCGA TCCTGATTAT GCTGCCGACG
CTGGGTATCG GCGATATGAA TAATCTCTAC AAGTGGTTGC TGAATCTTAA CTCGGTGGTG
ATGCCACTAC GTTATCTGTG GGTATTTGTT GCATTTATTG CAGTCGTTCG TCTGGCGCAG
AAATATAAAC CAGAGTATGT CTTTATTCGT AACAGGCCGC TGGCGATGAC TGTCGGGATC
TGGTGTTTTA CCTTTACCGC TTTTGCCTGC CTGACAGGGA TCTTCCCGAA AATGGAAGCC
TTCACTGCAG AGTGGACCTT CCAGTTAGCG CTGAATGTTG CAACGCCGTT TGTCCTGGTT
GGACTGGGGC TGATATTCCC GCTGCTGGCG CGTAAAGCGA ATAGTAAATA A
 
Protein sequence
MAGNVQEKQL RWYNIALMSF ITVWGFGNVV NNYANQGLVV VFSWVFIFAL YFTPYALIVG 
QLGSTFKDGK GGVSTWIKHT MGPGLAYLAA WTYWVVHIPY LAQKPQAILI ALGWAMKGDG
SLIKEYSVVA LQGLTLVLFI FFMWVASRGM KSLKIVGSVA GIAMFVMSLL YVAMAVTAPA
ITEVHIATTN ITWETFIPHI DFTYITTISM LVFAVGGAEK ISPYVNQTRN PGKEFPKGML
CLAVMVAVCA ILGSLAMGMM FDSRNIPDDL MTNGQYYAFQ KLGEYYNMGN TLMVIYAIAN
TLGQVAALVF SIDAPLKVLL GDADSKYIPA SLCRTNASGT PVNGYFLTLV LVAILIMLPT
LGIGDMNNLY KWLLNLNSVV MPLRYLWVFV AFIAVVRLAQ KYKPEYVFIR NRPLAMTVGI
WCFTFTAFAC LTGIFPKMEA FTAEWTFQLA LNVATPFVLV GLGLIFPLLA RKANSK