Gene EcHS_A1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1408 
SymbolsapA 
ID5592605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1402921 
End bp1404564 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content54% 
IMG OID640920563 
Productpeptide ABC transporter, periplasmic peptide-binding protein 
Protein accessionYP_001458122 
Protein GI157160804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAGG TATTATCGTC TCTTTTGGTG ATTGCTGGAC TTGTGAGTGG TCAGGCAATC 
GCCGCGCCTG AATCTCCCCC GCATGCTGAT ATCCGCGACA GCGGTTTTGT CTATTGCGTC
AGCGGGCAAG TCAACACCTT TAACCCATCC AAAGCGAGCA GTGGGTTAAT TGTCGATACC
CTTGCCGCCC AGTTTTATGA TCGACTGCTG GATGTCGATC CCTATACCTA TCGCCTGATG
CCGGAACTTG CCGAAAGCTG GGAAGTACTC GACAACGGCG CGACCTATCG CTTCCACCTG
CGTCGCGATG TTCCGTTTCA AAAAACCGAC TGGTTTACTC CCACTCGTAA AATGAATGCC
GACGATGTGG TGTTTACCTT CCAGCGAATT TTTGACCGCA ACAACCCGTG GCATAACGTC
AACGGCAGCA ACTTCCCCTA CTTCGACAGC CTGCAATTTG CCGATAACGT GAAAAGCGTC
CGCAAACTGG ATAATCATAC CGTTGAGTTC CGACTCGCCC AGCCGGATGC TTCTTTTTTG
TGGCACCTCG CAACCCATTA TGCTTCGGTC ATGTCGGCAG AATATGCCCG GAAGTTAGAG
AAAGAAGATC GCCAGGAGCA ACTCGACCGT CAACCGGTCG GCACCGGTCC GTATCAGTTG
TCGGAATACC GCGCCGGGCA ATTTATTCGC CTACAACGTC ATGATGACTT CTGGCGCGGT
AAACCGTTAA TGCCGCAGGT AGTGGTGGAT TTAGGCTCCG GCGGCACCGG ACGTCTGTCG
AAACTCCTGA CCGGGGAATG CGACGTTCTG GCCTGGCCTG CTGCCAGCCA GCTATCCATT
TTGCGTGACG ACCCGCGCTT GCGTTTAACG CTGCGTCCTG GGATGAACGT CGCCTATCTG
GCATTTAACA CCGCCAAACC GCCGCTAAAT AATCCCGCTG TCCGCCATGC GCTGGCACTG
GCGATTAATA ACCAGCGCCT GATGCAATCC ATCTATTATG GTACGGCTGA AACGGCGGCC
TCTATTTTAC CGCGCGCCTC GTGGGCCTAT GACAACGAGG CTAAAATTAC TGAATACAAT
CCGGCGAAAT CGCGCGAACA GTTGAAGTCG TTGGGGCTGG AAAATTTAAC GCTGAAACTG
TGGGTGCCCA CACGTTCGCA GGCGTGGAAC CCCAGTCCAC TGAAAACTGC CGAACTGATT
CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTGATTG TGCCGGTAGA AGGTCGCTTT
CAGGAGGCGC GGTTGATGGA TATGAGCCAT GATCTGACGT TATCCGGTTG GGCGACGGAC
AGTAACGACC CGGACAGTTT CTTCCGTCCT TTACTGAGCT GCGCGGCAAT TCATTCACAG
ACCAACCTCG CCCACTGGTG CGATCCGAAA TTCGACAGCG TGTTGCGTAA GGCGCTCTCC
TCGCAGCAGC TGGCGGCGCG TATTGAAGCC TATGACGAAG CGCAGAGTAT TCTGGCGCAG
GAATTGCCCA TTTTGCCGCT GGCGTCGTCA TTGCGTTTGC AGGCCTATCG GTACGATATC
AAAGGTCTGG TACTTAGCCC GTTTGGTAAC GCCTCCTTTG CTGGGGTGTA TCGCGAGAAA
CAGGATGAGG TGAAAAAACC ATGA
 
Protein sequence
MRQVLSSLLV IAGLVSGQAI AAPESPPHAD IRDSGFVYCV SGQVNTFNPS KASSGLIVDT 
LAAQFYDRLL DVDPYTYRLM PELAESWEVL DNGATYRFHL RRDVPFQKTD WFTPTRKMNA
DDVVFTFQRI FDRNNPWHNV NGSNFPYFDS LQFADNVKSV RKLDNHTVEF RLAQPDASFL
WHLATHYASV MSAEYARKLE KEDRQEQLDR QPVGTGPYQL SEYRAGQFIR LQRHDDFWRG
KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLSI LRDDPRLRLT LRPGMNVAYL
AFNTAKPPLN NPAVRHALAL AINNQRLMQS IYYGTAETAA SILPRASWAY DNEAKITEYN
PAKSREQLKS LGLENLTLKL WVPTRSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF
QEARLMDMSH DLTLSGWATD SNDPDSFFRP LLSCAAIHSQ TNLAHWCDPK FDSVLRKALS
SQQLAARIEA YDEAQSILAQ ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVYREK
QDEVKKP