Gene EcolC_1832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1832 
Symbol 
ID6065852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2032055 
End bp2033500 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content51% 
IMG OID641601246 
Productputative transporter 
Protein accessionYP_001724808 
Protein GI170019854 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.301874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTTTGC TGGCAATGGG ACTGGTGATT TATTTAGCCA CCAGTAAATA CGGCAATATT 
CGTCTTGGCG AAGGAAAACC GGAATACAGC ACGCTCTCCT GGCTGTTTAT GTTTATTTGT
GCCGGTTTAG GTTCTTCTAC GCTTTATTGG GGGGTTGCTG AATGGGCCTA TTATTATCAA
ACGCCTGGAT TAAATATCGC ACCGCGTTCA CAACAGGCAC TCGAATTTAG CGTTCCCTAC
TCTTTCTTCC ACTGGGGCAT CAGCGCCTGG GCAACTTATA CGCTGGCCTC ATTAATCATG
GCTTATCACT TTCATGTGCG GAAAAACAAA GGTCTGAGCC TTTCCGGCAT TATTGCTGCT
ATTACCGGCG TTCGCCCGCA AGGCCCATGG GGAAAACTGG TCGATTTGAT GTTCCTGATC
GCCACTGTCG GCGCACTGAC CATTTCCCTT GTTGTTACCG CAGCAACCTT TACTCGTGGA
CTTTCCGCGC TGACCGGTTT ACCCGATAAC TTCACCGTGC AGGCATTTGT GATCCTGCTT
TCCGGCGGCA TTTTTTGCCT AAGCTCGTGG ATTGGTATCA ACAACGGTTT GCAACGTCTG
AGCAAAATGG TTGGCTGGGG CGCGTTCCTG CTGCCATTAC TGGTGCTGAT TGTCGGCCCA
ACCGAATTTA TTACCAACAG CATCATCAAT GCCATCGGCC TGACCACGCA AAACTTCCTG
CAAATGAGCT TATTCACCGA TCCGCTTGGC GATGGTTCAT TTACCCGCAA CTGGACCGTT
TTCTACTGGC TGTGGTGGAT CTCATACACC CCTGGCGTAG CAATGTTTGT CACCCGCGTT
TCCCGCGGTC GTAAGATTAA AGAAGTTATC TGGGGACTGA TCCTCGGCAG CACCGTCGGT
TGCTGGTTCT TCTTTGGCGT AATGGAAAGC TATGCCATTC ATCAGTTTAT CAATGGCGTA
ATCAACGTCC CACAGGTGCT GGAAACACTG GGCGGCGAAA CAGCTGTGCA GCAAGTTCTG
ATGTCGTTGC CAGCCGGTAA ATTGTTCCTC GCCGCATACC TGGGCGTGAT GATTATTTTC
CTTGCCTCGC ATATGGATGC GGTGGCCTAC ACCATGGCTG CGACCAGTAC GCGTAATCTC
CAGGAAGGTG ACGATCCTGA CCGTGGGCTG CGTCTTTTCT GGTGCGTGGT GATCACTCTG
ATCCCGCTTT CCATCTTGTT TACCGGTGCT TCGCTGGAAA CGATGAAAAC CACCGTCGTG
CTCACAGCCC TTCCCTTCCT CGTCATTTTA CTGGTGAAAG TCGGCGGATT TATTCGCTGG
CTGAAACAGG ATTACGCCGA CATTCCGGCT CATCAAGTTG AACATTATCT CCCGCAGACA
CCGGTTGAAG CCCTGGAAAA AACGCCAGTG CTCCCTGCGG GAACCGTATT CAAAGGCGAC
AACTGA
 
Protein sequence
MVLLAMGLVI YLATSKYGNI RLGEGKPEYS TLSWLFMFIC AGLGSSTLYW GVAEWAYYYQ 
TPGLNIAPRS QQALEFSVPY SFFHWGISAW ATYTLASLIM AYHFHVRKNK GLSLSGIIAA
ITGVRPQGPW GKLVDLMFLI ATVGALTISL VVTAATFTRG LSALTGLPDN FTVQAFVILL
SGGIFCLSSW IGINNGLQRL SKMVGWGAFL LPLLVLIVGP TEFITNSIIN AIGLTTQNFL
QMSLFTDPLG DGSFTRNWTV FYWLWWISYT PGVAMFVTRV SRGRKIKEVI WGLILGSTVG
CWFFFGVMES YAIHQFINGV INVPQVLETL GGETAVQQVL MSLPAGKLFL AAYLGVMIIF
LASHMDAVAY TMAATSTRNL QEGDDPDRGL RLFWCVVITL IPLSILFTGA SLETMKTTVV
LTALPFLVIL LVKVGGFIRW LKQDYADIPA HQVEHYLPQT PVEALEKTPV LPAGTVFKGD
N