Gene EcolC_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1517 
Symbol 
ID6066963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1676437 
End bp1677594 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID641600936 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001724506 
Protein GI170019552 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.89368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTATC TACGTATTAA TCCTGTTCTG GCGCTGCTGC TGTTGCTGAC GGCAATCGCA 
GCGGCGCTGC CGTTTATCAG TTACGCGCCT AATCGTTTAG TCTCGGGTGA AGGGCGTCAT
CTCTGGCAAC TGTGGCCGCA AACGATCTGG ATGCTGGTGG GCGTTGGTTG CGCCTGGCTG
ACAGCCTGTT TTATTCCCGC TAAAAAAGGC AGCATTTTTG CACTCATTCT GGCGCAATTC
GTCTTCGTAT TGCTGGTGTG GGGAGCAGGA AAGGCGGCGA CCCAACTGGC GCAAAATGGC
AGTGCGCTGG CGCGTACCAG CCTCGGCAGT GGTTTCTGGC TGGCTGCGGC GCTGACATTG
CTGGCCTGTA GCGATGCCAT CCGCCGAATC TCCACGCATC CGCTGTGGCG CTGGTTGTTG
CATATGCAGA TTGCCATTAT TCCGCTGTGG TTGCTGTACT CCGGCACGCT TAACGATCTC
TCACTAATGA AAGAATACGC CAACCGTCAG GATGTGTTTG ACGACGCGCT GGCACAGCAT
TTGACGTTGC TGTTTGGTGC GGTGCTGCCT GCGTTAGTGA TTGGTGTGCC GTTGGGCATC
TGGTGCTACT TTTCCACTGC GCGGCAGGGG GCAATTTTTT CTCTGCTCAA TGTCATTCAG
ACCGTGCCTT CGGTGGCGCT CTTTGGCCTG TTGATTGCGC CGCTTGCCGC GCTGGTTACG
GCCTTTCCGT GGCTGGGGAA GCTCGGCATA GCAGGAACCG GAATGACACC CGCACTGATT
GCGCTGGTGC TCTATGCCTT GCTGCCGCTG GTGCGCGGCG TGGTAGTCGG TTTGAACCAG
ATCCCGCGCG ATGTGCTGGA GAGCGCCAGA GCGATGGGGA TGAGCGGGGC GCAGCGATTC
CTGCATGTTC AGTTACCACT GGCGTTACCG GTATTTCTGC GCAGCCTGCG GGTGGTGATG
GTGCAAACTG TAGGTATGGC GGTGATTGCG GCGTTAATCG GCGCAGGCGG TTTTGGTGCG
CTGGTTTTCC AGGGGCTGCT AAGCAGCGCC ATTGATTTAG TGTTGCTGGG GGTGATCCCG
GTAATTGTTC TGGCGGTGCT TACCGACGCG CTGTTCGATT TGCTTATCGC ACTGCTGAAG
GTGAAACGTA ATGATTGA
 
Protein sequence
MTYLRINPVL ALLLLLTAIA AALPFISYAP NRLVSGEGRH LWQLWPQTIW MLVGVGCAWL 
TACFIPAKKG SIFALILAQF VFVLLVWGAG KAATQLAQNG SALARTSLGS GFWLAAALTL
LACSDAIRRI STHPLWRWLL HMQIAIIPLW LLYSGTLNDL SLMKEYANRQ DVFDDALAQH
LTLLFGAVLP ALVIGVPLGI WCYFSTARQG AIFSLLNVIQ TVPSVALFGL LIAPLAALVT
AFPWLGKLGI AGTGMTPALI ALVLYALLPL VRGVVVGLNQ IPRDVLESAR AMGMSGAQRF
LHVQLPLALP VFLRSLRVVM VQTVGMAVIA ALIGAGGFGA LVFQGLLSSA IDLVLLGVIP
VIVLAVLTDA LFDLLIALLK VKRND