Gene EcolC_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1516 
Symbol 
ID6066985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1675513 
End bp1676430 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content53% 
IMG OID641600935 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001724505 
Protein GI170019551 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCT CAAAGGTCTG GGCAGGTTCA CTGGTTTTGT TGGCAGCCGT GAGCCTGCCG 
CTGCACGCGG CTTCCCCCGT TAAAGTCGGT TCAAAAATCG ATACCGAAGG CGCGCTGCTC
GGCAATATCA TTTTGCAAGT ACTCGAAAGC CACGGCGTAC CAACGGTCAA TAAAGTGCAA
CTTGGAACGA CTCCTGTGGT GCGCGGGGCG ATTACTTCCG GTGAACTGGA TATCTATCCG
GAATATACCG GCAATGGCGC TTTCTTCTTT AAAGATGAAA ACGATGCGGC ATGGAAAAAC
GCGCAGCAAG GTTACGAGAA AGTCAAAAAA CTCGATGCAG AGCAAAACAA GTTAATCTGG
CTGACGCCCG CACCAGCGAA TAACACCTGG ACCATCGCCG TGCGTCAGGA TGTGGCAGAG
AAAAACAAAC TCACTTCGCT TGCTGACCTG AGTCGTTATC TGAAAGAGGG CGGCACCTTC
AAACTGGCAG CCTCGGCAGA GTTTATCGAA CGCGCCGATG CGTTACCCGC GTTTGAAAAA
GCCTACGACT TTAAACTCGA TCAGGATCAG TTACTGTCAC TGGCTGGCGG CGACACGGCG
GTAACGATTA AAGCCGCTGC CCAGCAAACT TCTGGCGTTA ATGCCGCAAT GGCTTACGGC
ACTGACGGTC CGGTCGCGGC GCTGGGGCTG CAAACCTTAA GCGATCCGCA AGGCGTGCAA
CCTATCTACG CGCCTGCACC AGTGGTGCGT GAGTCGGTGC TGAAAGAGTA TCCGCAAATG
GCACAGTGGC TACAGCCAGT CTTCGCCAGC CTCGATGCAA AAACATTGCA GCAACTGAAT
GCCAGCATTG CAGTGGAAGG ACTGGATGCC AAAAAAGTGG CTGCCGACTA CCTGAAACAA
AAAGGGTGGA CGAAGTAA
 
Protein sequence
MPLSKVWAGS LVLLAAVSLP LHAASPVKVG SKIDTEGALL GNIILQVLES HGVPTVNKVQ 
LGTTPVVRGA ITSGELDIYP EYTGNGAFFF KDENDAAWKN AQQGYEKVKK LDAEQNKLIW
LTPAPANNTW TIAVRQDVAE KNKLTSLADL SRYLKEGGTF KLAASAEFIE RADALPAFEK
AYDFKLDQDQ LLSLAGGDTA VTIKAAAQQT SGVNAAMAYG TDGPVAALGL QTLSDPQGVQ
PIYAPAPVVR ESVLKEYPQM AQWLQPVFAS LDAKTLQQLN ASIAVEGLDA KKVAADYLKQ
KGWTK