Gene EcE24377A_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2202 
Symbol 
ID5586868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2168944 
End bp2169948 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content53% 
IMG OID640925871 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_001463271 
Protein GI157157814 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000714656 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AGCAATTTTT AAAAGAATCA GATGTTACGG CCGAGTCGGT ATTCTTTATG 
AAGCGTCGAC AGGTGTTAAA AGCACTGGGC ATCAGCGCAG CTGCACTTTC TTTGCCTCAC
GCTGCGCATG CCGATCTGCT TAGCTGGTTT AAAGGGAACG ATCGCCCGCC CGCCCCCGCC
GGAAAACCGC TGGAGTTCAG CAAGCCTGCC GCCTGGCAAA ATAACCTGCC ACTGACGCCA
GTAGATAAAG TCTCCGGTTA TAACAACTTC TATGAATTCG GGCTGGATAA AGCCGATCCC
GCCGCTAATG CTGGTAGCCT GAAAACCGAT CCATGGACAC TGAAAATCAG CGGCGAAGTG
GCAAAACCAT TGACCCTCGA TCACGATGAT TTAACCCGTC GCTTCCCGCT GGAAGAGCGT
ATTTATCGTA TGCGCTGCGT GGAGGCATGG TCGATGGTGG TGCCGTGGAT TGGTTTTCCG
CTGCACAAAT TGCTGGCGCT TGCCGAACCC ACCAGCAATG CGAAGTATGT CGCTTTCGAA
ACAATTTATG CACCGGAACA AATGCCAGGC CAGCAGGACC GCTTTATCGG CGGCGGGCTG
AAATATCCTT ATGTCGAAGG ATTGCGTCTC GACGAAGCAA TGCATCCGCT CACACTGATG
ACCGTGGGTG TTTATGGCAA GGCGTTACCG CCACAAAATG GCGCGCCGGT ACGACTGATT
GTGCCGTGGA AATATGGCTT TAAAGGGATT AAATCGATAG TCAGTATTAA GCTGACCCGC
GAGCGTCCGC CAACCACCTG GAATCTGGCA GCGCCTGACG AATACGGTTT TTACGCCAAC
GTTAATCCGC ATGTTGATCA CCCGCGCTGG TCACAGGCTA CCGAACGATT TATTGGTTCA
GGCGGCATCC TCGATGTTCA GCGCCAGCCA ACGCTACTGT TTAATGGTTA CGCCGACCAG
GTGGCATCGC TGTATCGTGG CCTGGATTTG CGGGAGAATT TCTGA
 
Protein sequence
MKKKQFLKES DVTAESVFFM KRRQVLKALG ISAAALSLPH AAHADLLSWF KGNDRPPAPA 
GKPLEFSKPA AWQNNLPLTP VDKVSGYNNF YEFGLDKADP AANAGSLKTD PWTLKISGEV
AKPLTLDHDD LTRRFPLEER IYRMRCVEAW SMVVPWIGFP LHKLLALAEP TSNAKYVAFE
TIYAPEQMPG QQDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPVRLI
VPWKYGFKGI KSIVSIKLTR ERPPTTWNLA APDEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYADQ VASLYRGLDL RENF