Gene EcE24377A_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1501 
SymbolsapA 
ID5590636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1487194 
End bp1488837 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content54% 
IMG OID640925193 
Productpeptide ABC transporter, periplasmic peptide-binding protein 
Protein accessionYP_001462598 
Protein GI157158938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAGG TATTATCGTC TCTTTTGGTG ATTGCTGGAC TTGTGAGTGG TCAGGCAATC 
GCCGCGCCTG AATCCCCCCC GCATGCTGAT ATCCGCGACA GCGGTTTTGT CTATTGCGTC
AGCGGGCAAG TCAACACCTT TAACCCATCC AAAGCGAGCA GTGGGTTAAT TGTCGATACC
CTTGCCGCCC AGTTTTATGA TCGACTGCTG GATGTCGATC CCTATACCTA TCGCCTGATG
CCGGAACTTG CCGAAAGCTG GGAAGTACTC GACAACGGCG CGACCTATCG CTTCCACCTG
CGTCGCGATG TTCCGTTTCA AAAAACCGAC TGGTTTACTC CCACTCGTAA AATGAATGCC
GACGATGTGG TGTTTACCTT CCAGCGAATT TTTGACCGCA ACAACCCGTG GCATAACGTC
AACGGCAGCA ACTTCCCCTA CTTCGACAGC CTGCAATTTG CCGATAACGT GAAAAGCGTC
CGCAAACTGG ATAATCATAC CGTTGAGTTC CGACTCGCCC AGCCGGATGC TTCTTTTTTG
TGGCACCTCG CAACCCATTA TGCTTCGGTC ATGTCGGCAG AATATGCCCG GAAGTTAGAG
AAAGAAGATC GCCAGGAGCA ACTCGACCGT CAACCGGTCG GCACCGGTCC GTATCAGTTG
TCGGAATACC GCGCCGGGCA ATTTATTCGC CTACAACGTC ATGATGACTT CTGGCGCGGT
AAACCGTTAA TGCCGCAGGT AGTGGTGGAT TTAGGCTCCG GCGGCACCGG ACGTCTGTCG
AAACTCCTGA CCGGGGAATG CGACGTTCTG GCCTGGCCTG CTGCCAGCCA GCTATCCATT
TTGCGTGACG ATCCGCGCTT GCGTTTAACG CTGCGTCCTG GGATGAACGT CGCCTATCTG
GCATTTAACA CCGCCAAACC GCCGCTAAAT AATCCCGCTG TCCGCCATGC ACTGGCACTG
GCGATTAATA ACCAGCGCCT GATGCAATCC ATCTATTATG GTACGGCTGA AACGGCGGCC
TCTATTTTAC CGCGCGCCTC GTGGGCCTAT GACAACGAGG CTAAAATTAC TGAATACAAT
CCGGCGAAAT CGCGCGAACA GTTGAAGGCG TTGGGGCTGG AAAACTTAAC GCTGAAACTG
TGGGTGCCCA CACGTTCGCA GGCGTGGAAC CCCAGTCCAC TGAAAACTGC CGAACTGATT
CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTGATTG TGCCGGTAGA AGGTCGCTTT
CAGGAGGCGC GGTTGATGGA TATGAGCCAT GATCTGACGT TATCCGGTTG GGCGACGGAC
AGTAACGACC CGGACAGTTT CTTCCGTCCG TTACTGAGCT GCGCGGCAAT TCATTCACAG
ACCAACCTCG CCCACTGGTG CGATCCGAAA TTCGACAGCG TGTTGCGTAA GGCGCTCTCC
TCGCAGCAGC TGGCAGCGCG TATTGAAGCC TATGACGAAG CGCAGAGTAT TCTGGCGCAG
GAATTACCTA TTCTGCCGCT GGCGTCGTCA TTGCGTTTGC AGGCCTATCG GTACGATATC
AAAGGTCTGG TACTTAGCCC GTTTGGTAAC GCCTCCTTTG CTGGGGTGTA TCGCGAGAAA
CAGGATGAGG TGAAAAAACC ATGA
 
Protein sequence
MRQVLSSLLV IAGLVSGQAI AAPESPPHAD IRDSGFVYCV SGQVNTFNPS KASSGLIVDT 
LAAQFYDRLL DVDPYTYRLM PELAESWEVL DNGATYRFHL RRDVPFQKTD WFTPTRKMNA
DDVVFTFQRI FDRNNPWHNV NGSNFPYFDS LQFADNVKSV RKLDNHTVEF RLAQPDASFL
WHLATHYASV MSAEYARKLE KEDRQEQLDR QPVGTGPYQL SEYRAGQFIR LQRHDDFWRG
KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLSI LRDDPRLRLT LRPGMNVAYL
AFNTAKPPLN NPAVRHALAL AINNQRLMQS IYYGTAETAA SILPRASWAY DNEAKITEYN
PAKSREQLKA LGLENLTLKL WVPTRSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF
QEARLMDMSH DLTLSGWATD SNDPDSFFRP LLSCAAIHSQ TNLAHWCDPK FDSVLRKALS
SQQLAARIEA YDEAQSILAQ ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVYREK
QDEVKKP