Gene EcolC_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2331 
Symbol 
ID6066752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2569521 
End bp2571164 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content54% 
IMG OID641601734 
Productextracellular solute-binding protein 
Protein accessionYP_001725293 
Protein GI170020339 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.815677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGG TATTATCGTC TCTTTTGGTG ATTGCTGGAC TTGTGAGTGG TCAGGCAATC 
GCCGCGCCTG AATCCCCCCC GCATGCTGAT ATCCGCGACA GCGGTTTTGT CTATTGCGTC
AGCGGGCAAG TCAACACCTT TAACCCATCC AAAGCGAGCA GTGGGTTAAT TGTCGATACC
CTTGCCGCCC AGTTTTATGA TCGACTGCTG GATGTCGATC CCTATACCTA TCGCCTGATG
CCGGAACTTG CCGAAAGCTG GGAAGTACTC GACAACGGCG CGACCTATCG CTTCCACCTA
CGTCGCGATG TTCCGTTTCA AAAAACCGAC TGGTTTACTC CCACTCGTAA AATGAATGCC
GACGATGTGG TGTTTACCTT CCAGCGAATT TTTGACCGCA ACAACCCGTG GCATAACGTC
AACGGCAGCA ACTTCCCCTA CTTCGACAGC CTGCAATTTG CCGATAACGT GAAAAGCGTC
CGTAAACTGG ATAATCATAC CGTTGAGTTC CGTCTGGCTC AGCCGGATGC TTCTTTTTTG
TGGCACCTCG CAACCCATTA TGCTTCGGTC ATGTCGGCAG AATATGCCCG GAAGTTAGAG
AAAGAAGATC GCCAGGAGCA ACTCGACCGT CAACCGGTCG GCACTGGGCC ATATCAGTTG
TCGGAATACC GCGCCGGGCA ATTTATTCGC CTACAACGTC ATGATGACTT CTGGCGCGGT
AAACCGTTAA TGCCGCAGGT GGTGGTGGAT TTAGGCTCCG GCGGCACCGG ACGTCTGTCG
AAACTCCTGA CCGGGGAATG CGACGTTCTG GCCTGGCCTG CTGCCAGCCA GCTATCCATT
TTGCGTGACG ACCCGCGCTT GCGTTTAACG CTGCGTCCTG GGATGAACGT CGCCTATCTG
GCATTTAACA CCGCCAAACC GCCGCTAAAT AATCCCGCTG TCCGCCATGC GCTGGCACTG
GCGATTAATA ACCAGCGCCT GATGCAATCC ATCTATTATG GTACGGCTGA AACGGCGGCC
TCTATTTTAC CGCGCGCCTC GTGGGCCTAT GACAACGAGG CTAAAATTAC TGAATACAAT
CCGGCGAAAT CGCGCGAACA GTTGAAGGCG TTGGGGCTGG AAAATTTAAC GCTGAAACTG
TGGGTGCCCA CACGTTCGCA GGCGTGGAAC CCCAGTCCAC TGAAAACTGC CGAACTGATT
CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTGATTG TGCCGGTAGA AGGTCGCTTT
CAGGAGGCGC GGTTGATGGA TATGAGCCAT GATCTGACGT TATCCGGTTG GGCGACGGAC
AGTAACGACC CGGACAGTTT CTTCCGTCCT TTACTGAGCT GCGCGGCAAT TCATTCACAG
ACCAACCTCG CCCACTGGTG CGATCCGAAA TTCGACAGCG TGTTGCGTAA GGCGCTCTCC
TCGCAGCAGC TGGCGGCGCG TATTGAAGCC TATGACGAAG CGCAGAGTAT TCTGGCGCAG
GAATTGCCCA TTTTGCCGCT GGCGTCGTCA TTGCGTTTGC AGGCCTATCG GTACGATATC
AAAGGTCTGG TACTTAGCCC GTTTGGTAAC GCCTCCTTTG CTGGGGTGTA TCGCGAGAAA
CAGGATGAGG TGAAAAAACC ATGA
 
Protein sequence
MRQVLSSLLV IAGLVSGQAI AAPESPPHAD IRDSGFVYCV SGQVNTFNPS KASSGLIVDT 
LAAQFYDRLL DVDPYTYRLM PELAESWEVL DNGATYRFHL RRDVPFQKTD WFTPTRKMNA
DDVVFTFQRI FDRNNPWHNV NGSNFPYFDS LQFADNVKSV RKLDNHTVEF RLAQPDASFL
WHLATHYASV MSAEYARKLE KEDRQEQLDR QPVGTGPYQL SEYRAGQFIR LQRHDDFWRG
KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLSI LRDDPRLRLT LRPGMNVAYL
AFNTAKPPLN NPAVRHALAL AINNQRLMQS IYYGTAETAA SILPRASWAY DNEAKITEYN
PAKSREQLKA LGLENLTLKL WVPTRSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF
QEARLMDMSH DLTLSGWATD SNDPDSFFRP LLSCAAIHSQ TNLAHWCDPK FDSVLRKALS
SQQLAARIEA YDEAQSILAQ ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVYREK
QDEVKKP