Gene EcolC_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1842 
Symbol 
ID6066494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2039834 
End bp2041015 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID641601256 
Productcyanate transporter 
Protein accessionYP_001724818 
Protein GI170019864 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2807] Cyanate permease 
TIGRFAM ID[TIGR00896] cyanate transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.233357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGTT CAACTTCATT AAGCGGCAAA AACAGGATTG TCCTTATCGC TGGCATTCTG 
ATGATTGCCA CAACATTACG CGTCACCTTT ACCGGCGCAG CACCGTTACT GGATACGATT
CGTTCCGCTT ACTCGCTGAC GACAGCGCAA ACCGGCTTAT TGACCACCCT GCCATTATTG
GCCTTTGCGC TAATCTCACC TTTGGCTGCC CCGGTGGCGC GACGTTTTGG TATGGAACGT
AGCCTGTTTG CCGCGTTACT TTTGATCTGT GCAGGTATCG CAATTCGCTC TCTCCCTTCG
CCTTACTTAT TATTTGGCGG TACAGCGGTC ATTGGCGGTG GGATTGCATT AGGCAATGTC
TTACTGCCAG GATTAATTAA ACGCGATTTC CCTCATTCCG TCGCCAGACT TACCGGCGCA
TATTCCCTGA CAATGGGAGC TGCAGCGGCA CTGGGATCGG CTATGGTCGT GCCGCTGGCT
TTGAACGGTT TTGGCTGGCA AGGCGCGTTG CTCATGCTGA TGTGTTTTCC TCTGCTGGCT
CTTTTTTTAT GGCTGCCACA GTGGCGAAGT CAACAACATG CAAATTTGAG TACCTCGCGC
GCCTTACATA CTCGGGGTAT CTGGCGTTCG CCGCTTGCCT GGCAGGTCAC ATTGTTTCTT
GGGATCAACT CACTGGTCTA TTACGTGATT ATTGGCTGGC TTCCGGCGAT CCTCATCAGT
CACGGCTATA GCGAAGCACA GGCGGGTTCA CTGCATGGTT TGCTGCAACT AGCCACAGCA
GCACCCGGTT TGCTGATCCC ACTTTTCTTA CATCATGTGA AAGATCAGCG TGGTATTGCA
GCGTTCGTTG CCTTGATGTG CGCAGTGGGC GCGGTTGGGC TCTGCTTTAT GCCAGCGCAC
GCGATCACCT GGACTCTGCT TTTCGGTTTT GGTTCCGGCG CAACAATGAT ACTGGGGTTG
ACGTTCATTG GTCTGCGGGC TAGTTCTGCG CATCAGGCGG CGGCACTCTC GGGGATGGCA
CAATCCGTCG GGTATTTGTT GGCAGCCTGT GGGCCGCCGC TGATGGGTAA AATACACGAT
GCTAACGGTA ACTGGTCTGT ACCACTTATG GGTGTTGCCA TACTTTCACT ACTGATGGCG
ATTTTCGGAC TTTGCGCCGG GAGAGACAAA GAAATTCGCT AA
 
Protein sequence
MTCSTSLSGK NRIVLIAGIL MIATTLRVTF TGAAPLLDTI RSAYSLTTAQ TGLLTTLPLL 
AFALISPLAA PVARRFGMER SLFAALLLIC AGIAIRSLPS PYLLFGGTAV IGGGIALGNV
LLPGLIKRDF PHSVARLTGA YSLTMGAAAA LGSAMVVPLA LNGFGWQGAL LMLMCFPLLA
LFLWLPQWRS QQHANLSTSR ALHTRGIWRS PLAWQVTLFL GINSLVYYVI IGWLPAILIS
HGYSEAQAGS LHGLLQLATA APGLLIPLFL HHVKDQRGIA AFVALMCAVG AVGLCFMPAH
AITWTLLFGF GSGATMILGL TFIGLRASSA HQAAALSGMA QSVGYLLAAC GPPLMGKIHD
ANGNWSVPLM GVAILSLLMA IFGLCAGRDK EIR