Gene EcolC_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2219 
Symbol 
ID6064924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2438542 
End bp2439687 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID641601625 
Productextracellular solute-binding protein 
Protein accessionYP_001725184 
Protein GI170020230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA CATTTGCCCG CAGCAGCCTG TGTGCGCTCA GCATGACAAT AATGACCGCT 
CACGCCGCCG AACCGCCTAC CAATTTAGAT AAACCGGAAG GGCGACTGGA TATTATCGCC
TGGCCGGGAT ACATCGAACG CGGACAAACT GATAAACAAT ACGACTGGGT AACGCAGTTC
GAAAAAGAGA CAGGCTGCGC GGTGAATGTG AAAACCGCCG CGACTTCCGA TGAAATGGTC
AGTCTGATGA CCAAAGGGGG TTACGATCTG GTTACGGCAT CCGGCGATGC CTCGCTGCGT
TTGATTATGG GTAAACGCGT GCAGCCGATT AATACCGCAT TGATTCCCAA CTGGAAAACG
CTCGATCCGC GCGTGGTTAA AGGCGACTGG TTTAATGTTG GCGGCAAAGT TTACGGCACA
CCTTACCAAT GGGGGCCGAA CCTGCTGATG TACAACACTA AAACCTTCCC GACGCCGCCG
GATAGCTGGC AAGTGGTTTT TGTTGAGCAA AATCTGCCGG ACGGCAAGAG CAATAAAGGC
CGCGTTCAGG CTTATGATGG CCCTATCTAC ATTGCGGACG CTGCGTTGTT CGTTAAAGCC
ACTCAGCCGC AGTTGGGCAT CAGCGATCCG TATCAACTCA CCGAAGAACA GTACCAGGCG
GTGCTGAAAG TGCTGCGCGC TCAACACAGT TTGATCCATC GCTACTGGCA TGACACTACC
GTGCAAATGA GCGATTTCAA AAACGAGGGT GTGGTTGCTT CCAGTGCCTG GCCCTATCAG
GCCAACGCCC TGAAAGCCGA AGGCCAGCCT GTTGCTACCG TTTTCCCGAA GGAGGGTGTT
ACCGGTTGGG CTGATACCAC CATGCTGCAT AGCGAAACGA AACATCCGGT TTGCGCCTAC
AAATGGATGA ACTGGTCATT AACGCCAAAA GTGCAGGGCG ATGTGGCGGC CTGGTTTGGC
TCGTTACCGG TAGTGCCGGA AGGGTGTAAA GCCAGTCCGT TATTAGGCGA AAAAGGTTGT
GAAACCAACG GTTTTAACTA TTTCGACAAA ATCGCCTTCT GGAAAACGCC TATAGCAGAA
GGGGGCAAGT TTGTTCCCTA CAGTCGCTGG ACGCAGGATT ACATTGCCAT TATGGGCGGT
CGCTAA
 
Protein sequence
MSKTFARSSL CALSMTIMTA HAAEPPTNLD KPEGRLDIIA WPGYIERGQT DKQYDWVTQF 
EKETGCAVNV KTAATSDEMV SLMTKGGYDL VTASGDASLR LIMGKRVQPI NTALIPNWKT
LDPRVVKGDW FNVGGKVYGT PYQWGPNLLM YNTKTFPTPP DSWQVVFVEQ NLPDGKSNKG
RVQAYDGPIY IADAALFVKA TQPQLGISDP YQLTEEQYQA VLKVLRAQHS LIHRYWHDTT
VQMSDFKNEG VVASSAWPYQ ANALKAEGQP VATVFPKEGV TGWADTTMLH SETKHPVCAY
KWMNWSLTPK VQGDVAAWFG SLPVVPEGCK ASPLLGEKGC ETNGFNYFDK IAFWKTPIAE
GGKFVPYSRW TQDYIAIMGG R