Gene EcolC_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1868 
Symbol 
ID6064451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2068682 
End bp2069725 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID641601281 
Productselenophosphate synthetase 
Protein accessionYP_001724843 
Protein GI170019889 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000994537 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000520996 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGAGA ACTCGATTCG TTTGACCCAA TACAGCCACG GAGCTGGTTG CGGCTGTAAA 
ATTTCCCCAA AAGTGTTGGA AACCATCCTG CACAGTGAGC AGGCGAAGTT TGTTGATCCG
AATTTGCTTG TGGGTAATGA AACCCGCGAC GATGCGGCGG TGTACGATCT GGGCAATGGC
ACCAGCGTTA TCAGTACCAC CGACTTCTTT ATGCCGATCG TTGATAATCC TTTCGATTTT
GGCCGCATTG CGGCGACTAA CGCCATCAGC GATATCTTCG CGATGGGGGG CAAACCGATT
ATGGCGATTG CGATCCTCGG CTGGCCGATT AACAAACTTT CCCCGGAAAT TGCCCGCGAA
GTGACCGAAG GTGGACGCTA TGCATGTCGT CAGGCGGGTA TTGCGCTGGC TGGCGGTCAC
TCCATCGATG CGCCGGAGCC GATTTTTGGT CTGGCGGTAA CGGGGAGCGT ACCGACCGAG
CGGGTGAAGA AAAACAGTAC CGCACAAGCC GGATGCAAAC TGTTCCTGAC GAAACCGCTG
GGGATCGGCG TTCTTACCAC GGCTGAGAAA AAATCACTGT TGAAACCAGA ACATCAGGGA
CTGGCGACGG AAGTGATGTG CCGGATGAAC ATCGCAGGCG CGTCCTTTGC CAACATCGAA
GGCGTAAAAG CGATGACCGA CGTTACGGGC TTTGGTCTGC TGGGCCACTT GAGCGAAATG
TGTCAGGGGG CTGGTGTGCA GGCACGCGTC GACTATGAAG CGATCCCGAA ACTCCCCGGT
GTTGAAGAGT ACATTAAGTT GGGCGCAGTA CCTGGCGGCA CTGAACGTAA CTTTGCCAGC
TACGGTCATC TGATGGGTGA AATGCCGCGT GAAGTGCGCG ATCTGCTGTG CGATCCGCAA
ACATCTGGCG GTTTGCTGCT GGCGGTCATG CCGGAAGCAG AAAATGAGGT CAAAGCTACA
GCCGCCGAGT TTGGCATTGA ACTGACGGCA ATTGGCGAAC TGGTGCCAGC GCGCGGCGGT
CGTGCCATGG TTGAGATTCG TTAA
 
Protein sequence
MSENSIRLTQ YSHGAGCGCK ISPKVLETIL HSEQAKFVDP NLLVGNETRD DAAVYDLGNG 
TSVISTTDFF MPIVDNPFDF GRIAATNAIS DIFAMGGKPI MAIAILGWPI NKLSPEIARE
VTEGGRYACR QAGIALAGGH SIDAPEPIFG LAVTGSVPTE RVKKNSTAQA GCKLFLTKPL
GIGVLTTAEK KSLLKPEHQG LATEVMCRMN IAGASFANIE GVKAMTDVTG FGLLGHLSEM
CQGAGVQARV DYEAIPKLPG VEEYIKLGAV PGGTERNFAS YGHLMGEMPR EVRDLLCDPQ
TSGGLLLAVM PEAENEVKAT AAEFGIELTA IGELVPARGG RAMVEIR