Gene EcHS_A1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1848 
SymbolselD 
ID5591032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1865047 
End bp1866090 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID640920992 
Productselenophosphate synthetase 
Protein accessionYP_001458544 
Protein GI157161226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.000400465 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA ACTCGATTCG TTTGACCCAA TACAGCCACG GAGCTGGTTG CGGCTGTAAA 
ATTTCCCCAA AAGTGTTGGA AACCATCCTG CACAGTGAGC AGGCGAAGTT TGTTGATCCG
AATTTGCTTG TGGGTAATGA AACCCGCGAC GATGCGGCGG TGTACGATCT GGGCAATGGC
ACCAGCGTTA TCAGTACCAC CGACTTCTTT ATGCCGATCG TTGATAATCC TTTCGATTTT
GGCCGCATTG CGGCGACTAA CGCCATCAGC GATATCTTCG CGATGGGGGG CAAACCGATT
ATGGCGATTG CGATCCTCGG CTGGCCGATT AACAAACTTT CCCCGGAAAT TGCCCGCGAA
GTGACCGAAG GTGGACGCTA TGCATGTCGT CAGGCGGGTA TTGCGCTGGC TGGCGGTCAC
TCCATCGATG CGCCGGAGCC GATTTTTGGT CTGGCGGTAA CGGGGATCGT ACCGACCGAG
CGGGTGAAGA AAAACAGTAC CGCACAAGCC GGATGCAAAC TGTTCCTGAC GAAACCGCTG
GGGATCGGCG TTCTTACCAC GGCTGAGAAA AAATCACTGT TGAAACCAGA ACATCAGGGA
CTGGCGACGG AAGTGATGTG CCGGATGAAC ATCGCAGGCG CGTCCTTTGC CAACATCGAA
GGCGTAAAAG CGATGACCGA CGTTACGGGC TTTGGTCTGC TGGGCCACTT GAGCGAAATG
TGTCAGGGGG CTGGTGTGCA GGCACGCGTC GACTATGAAG CGATCCCGAA ACTCCCCGGT
GTTGAAGAGT ACATTAAGTT GGGCGCAGTA CCTGGCGGCA CTGAACGTAA CTTTGCCAGC
TACGGTCATC TGATGGGTGA AATGCCGCGT GAAGTGCGCG ATCTGCTGTG CGATCCGCAA
ACTTCTGGCG GTTTGCTGCT GGCGGTCATG CCGGAAGCAG AAAATGAGGT CAAAGCTACA
GCCGCCGAGT TTGGCATTGA ACTGACGGCA ATTGGCGAAC TGGTGCCAGC GCGCGGCGGT
CGTGCCATGG TTGAGATTCG TTAA
 
Protein sequence
MSENSIRLTQ YSHGAGCGCK ISPKVLETIL HSEQAKFVDP NLLVGNETRD DAAVYDLGNG 
TSVISTTDFF MPIVDNPFDF GRIAATNAIS DIFAMGGKPI MAIAILGWPI NKLSPEIARE
VTEGGRYACR QAGIALAGGH SIDAPEPIFG LAVTGIVPTE RVKKNSTAQA GCKLFLTKPL
GIGVLTTAEK KSLLKPEHQG LATEVMCRMN IAGASFANIE GVKAMTDVTG FGLLGHLSEM
CQGAGVQARV DYEAIPKLPG VEEYIKLGAV PGGTERNFAS YGHLMGEMPR EVRDLLCDPQ
TSGGLLLAVM PEAENEVKAT AAEFGIELTA IGELVPARGG RAMVEIR