Gene EcHS_A1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1430 
Symbol 
ID5591894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1424282 
End bp1425337 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content55% 
IMG OID640920585 
Productgfo/idh/mocA family protein 
Protein accessionYP_001458144 
Protein GI157160826 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAGTG CAATGACAAG CTCTCCGCTG CGGGTCGCGA TAATAGGCGC AGGCCAGGTG 
GCGGATAAGG TTCATGCTTC GTACTACTGC ACCCGCAACG ATCTGGAACT GGTGGCTGTC
TGTGACAGCC GCCTTTCCCA GGCGCAGGCG CTGGCAGAAA AATACGGGAA TGCATCCGTG
TGGGACGATC CGCAGGCCAT GCTGCTGGCG GTGAAACCTG ATGTGGTTAG CGTCTGCTCA
CCTAACCGTT TTCATTACGA ACATACCCTG ATGGCACTGG AAGCGGGCTG CCATGTGATG
TGCGAAAAAC CGCCCGCCAT GACGCCAGAA CAGGCGCGGG AAATGTGCGA TACCGCGCGC
AAACTGGGCA AGGTGCTGGC CTACGACTTT CACCATCGTT TTGCGCTCGA TACGCAACAG
CTGCGTGAAC AGGTGACCAA CGGCGTTTTG GGAGAGATTT ACGTTACCAC CGCCCGCGCC
CTGCGTCGCT GCGGCGTTCC CGGCTGGGGT GTCTTTACCA ATAAAGAACT GCAGGGTGGT
GGCCCGCTGA TCGACATCGG CATTCATATG CTGGATGCTG CGATGTATGT GCTGGGCTTT
CCGGCGGTGA AAAGCGTGAA TGCGCATAGC TTTCAAAAGA TCGGCACGCA AAAGAGCTGT
GGTCAATTTG GTGAGTGGGA TCCGGCAACT TACAGCGTCG AAGATTCGCT GTTTGGCACC
ATTGAATTTC ATAACGGCGG CATTCTGTGG CTGGAAACGT CATTTGCACT CAACATCCGC
GAACAGTCGA TTATGAACGT CAGCTTTTGT GGTGATAAAG CTGGTGCGAC GCTGTTTCCA
GCACATATCT ACACCGATAA CAACGGTGAA TTAATGACGC TGATGCAACG GGAAATGGCA
GACGACAACC GCCATTTGCG CAGCATGGAA GCCTTTATCA ATCACGTACA GGGCAAGCCC
GTGATGATAG CCGACGCCGA GCAGGGGTAC ATCATCCAGC AACTGGTGGC GGCGTTGTAT
CAATCCGCAG AAACAGGGAC GCGTGTGGAA TTATGA
 
Protein sequence
MKSAMTSSPL RVAIIGAGQV ADKVHASYYC TRNDLELVAV CDSRLSQAQA LAEKYGNASV 
WDDPQAMLLA VKPDVVSVCS PNRFHYEHTL MALEAGCHVM CEKPPAMTPE QAREMCDTAR
KLGKVLAYDF HHRFALDTQQ LREQVTNGVL GEIYVTTARA LRRCGVPGWG VFTNKELQGG
GPLIDIGIHM LDAAMYVLGF PAVKSVNAHS FQKIGTQKSC GQFGEWDPAT YSVEDSLFGT
IEFHNGGILW LETSFALNIR EQSIMNVSFC GDKAGATLFP AHIYTDNNGE LMTLMQREMA
DDNRHLRSME AFINHVQGKP VMIADAEQGY IIQQLVAALY QSAETGTRVE L