Gene EcHS_A0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0049 
SymbolfixC 
ID5591073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp47440 
End bp48726 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content55% 
IMG OID640919237 
Productputative oxidoreductase FixC 
Protein accessionYP_001456832 
Protein GI157159514 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAG ATATCTTTGA CGCCATCATC GTCGGTGCTG GGCTTGCCGG TTCGGTTGCC 
GCACTGGTGC TCGCCCGCGA AGGGGCGCAA GTGTTAGTTA TCGAGCGTGG CAATTCCGCA
GGTGCCAAGA ACGTCACCGG CGGGCGTCTC TATGCCCACA GCCTGGAACA CATTATTCCT
GGTTTCGCCG ACTCCGCCCC CGTAGAACGC CTGATCACCC ATGAAAAACT CGCGTTTATG
ACGGAAAAGT CAGCGATGAC TATGGACTAC TGCAATGGTG ACGAAACCTC GCCATCCCAG
CGTTCTTACT CCGTTTTGCG CAGTAAATTT GATGCCTGGC TGATGGAGCA GGCCGAAGAA
GCGGGCGCGC AGTTAATTAC CGGGATCCGC GTCGATAACC TCGTACAGCG CGATGGCAAA
GTCGTCGGTG TAGAAGCCGA TGGCGATGTG ATTGAAGCGA AAACGGTGAT CCTTGCTGAT
GGAGTGAACT CCATCCTTGC CGAAAAGCTG GGGATGGCAA AACGCGTTAA ACCGACGGAT
GTGGCGGTTG GCGTGAAGGA ACTGATCGAG TTACCGAAGT CGGTAATCGA AGACCGTTTT
CAGTTGCAGG GTAATCAGGG CGCGGCTTGT CTGTTTGCGG GATCACCCAC CGATGGCCTG
ATGGGCGGCG GCTTCCTTTA TACCAATGAA AATACCCTGT CGCTGGGGCT GGTTTGTGGT
CTGCATCATC TGCATGACGC AAAAAAATCG GTGCCGCAAA TGCTGGAAGA TTTCAAACAA
CATCCGGCCG TTGCACCGCT GATCGCGGGT GGCAAGCTGG TGGAATATTC CGCTCACGTA
GTGCCGGAAG CAGGCATCAA CATGCTGCCG GAGTTGGTTG GTGACGGCGT ATTGATTGCC
GGTGATGCCG CCGGAATGTG TATGAACCTC GGTTTTACCA TTCGCGGTAT GGATCTGGCG
ATTGCCGCCG GGGAAGCCGC AGCAAAAACC GTGCTTTCAG CGATGAAAAG CGACGATTTC
AGTAAGCAAA AACTGGCGGA ATATCGTCAG CATCTTGAGA GTGGCCCGCT GCGCGATATG
CGTATGTACC AGAAACTACC GGCCTTCCTT GATAACCCAC GCATGTTTAG CGGCTACCCG
GAACTGGCGG TGGGCGTGGC GCGTGACCTG TTCACCATTG ACGGCAGTGC GCCGGAACTG
ATGCGCAAGA AAATCCTCCG CCACGGCAAG AAAGTGGGCT TCATCAATCT GATCAAGGAT
GGCATGAAAG GAGTGACCGT TTTATGA
 
Protein sequence
MSEDIFDAII VGAGLAGSVA ALVLAREGAQ VLVIERGNSA GAKNVTGGRL YAHSLEHIIP 
GFADSAPVER LITHEKLAFM TEKSAMTMDY CNGDETSPSQ RSYSVLRSKF DAWLMEQAEE
AGAQLITGIR VDNLVQRDGK VVGVEADGDV IEAKTVILAD GVNSILAEKL GMAKRVKPTD
VAVGVKELIE LPKSVIEDRF QLQGNQGAAC LFAGSPTDGL MGGGFLYTNE NTLSLGLVCG
LHHLHDAKKS VPQMLEDFKQ HPAVAPLIAG GKLVEYSAHV VPEAGINMLP ELVGDGVLIA
GDAAGMCMNL GFTIRGMDLA IAAGEAAAKT VLSAMKSDDF SKQKLAEYRQ HLESGPLRDM
RMYQKLPAFL DNPRMFSGYP ELAVGVARDL FTIDGSAPEL MRKKILRHGK KVGFINLIKD
GMKGVTVL