Gene EcolC_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1469 
Symbol 
ID6067239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1619901 
End bp1620995 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content51% 
IMG OID641600889 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001724459 
Protein GI170019505 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000191994 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0382129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCTT ACCTGATTCG CCGTCTGTTG CTGGTGATCC CAACATTATG GGCGATTATC 
ACCATCAACT TTTTCATCGT GCAAATTGCG CCTGGCGGTC CGGTCGATCA GGCCATCGCC
GCCATTGAGT TTGGTAATGC CGGAGTATTA CCCGGCGCAG GCGGTGAAGG TGTTCGTGCC
AGCCATGCGC AAACGGGTGT CGGCAATATC AGCGACAGTA ATTACCGTGG TGGACGCGGA
TTAGATCCAG AAGTGATCGC TGAGATCACT CATCGCTACG GTTTCGATAA GCCGATCCAC
GAACGTTACT TCAAAATGCT CTGGGACTAC ATCCGCTTTG ATTTTGGTGA TAGCCTGTTT
CGCAGCGCCT CGGTGCTGAC GCTGATTAAA GACAGTCTGC CGGTTTCCAT CACCCTCGGA
TTGTGGAGCA CGCTGATTAT CTATCTGGTG TCGATTCCGT TAGGCATTCG CAAAGCTGTT
TATAATGGGA GCCGCTTTGA CGTCTGGAGT AGCGCATTTA TCATCATCGG CTACGCCATT
CCGGCCTTTT TGTTTGCCAT CCTGCTGATT GTCTTCTTCG CGGGCGGCAG CTATTTCGAC
CTGTTCCCTC TACGCGGCCT GGTTTCCGCT AACTTTGATT CGCTGCCGTG GTATCAGAAA
ATCACCGATT ATCTGTGGCA TATCACGCTG CCGGTGCTGG CGACAGTGAT TGGTGGCTTT
GCGGCGCTGA CCATGCTGAC AAAAAACTCA TTCCTTGATG AAGTGCGTAA GCAATACGTG
GTGACCGCCC GAGCGAAAGG GGTAAGTGAA AAAAATATTC TCTGGAAACA TGTGTTCCGC
AACGCCATGC TGCTGGTGAT TGCCGGTTTT CCGGCGACGT TTATCAGCAT GTTTTTTACC
GGCTCGCTGC TGATTGAGGT GATGTTTTCA CTCAATGGTC TTGGCTTACT GGGCTACGAA
ACGACCGTCT CGCGCGATTA TCCTGTAATG TTTGGTACCT TGTATATTTT CACCCTGATT
GGCCTGCTGC TGAATATTGT CAGTGATATC AGCTATACGC TGGTTGATCC GCGTATAGAT
TTTGAGGGAC GTTAA
 
Protein sequence
MGAYLIRRLL LVIPTLWAII TINFFIVQIA PGGPVDQAIA AIEFGNAGVL PGAGGEGVRA 
SHAQTGVGNI SDSNYRGGRG LDPEVIAEIT HRYGFDKPIH ERYFKMLWDY IRFDFGDSLF
RSASVLTLIK DSLPVSITLG LWSTLIIYLV SIPLGIRKAV YNGSRFDVWS SAFIIIGYAI
PAFLFAILLI VFFAGGSYFD LFPLRGLVSA NFDSLPWYQK ITDYLWHITL PVLATVIGGF
AALTMLTKNS FLDEVRKQYV VTARAKGVSE KNILWKHVFR NAMLLVIAGF PATFISMFFT
GSLLIEVMFS LNGLGLLGYE TTVSRDYPVM FGTLYIFTLI GLLLNIVSDI SYTLVDPRID
FEGR