Gene EcHS_A3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3194 
Symbol 
ID5593235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3203758 
End bp3205977 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content55% 
IMG OID640922312 
Producthypothetical protein 
Protein accessionYP_001459810 
Protein GI157162492 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCTA TCTCCCTGAT CCAACCGGAT CGCGACCTGT TCTCCTGGCC GCAGTACTGG 
GCCGCCTGTT TTGGACCGGC ACCGTTTTTG CCGATGTCTC GTGAAGAGAT GGATCAACTT
GGCTGGGATA GCTGCGACAT CATTTTGGTT ACTGGCGACG CGTATGTCGA TCACCCAAGC
TTCGGGATGG CGATTTGCGG TCGTATGCTG GAAGCACAGG GCTTTCGCGT CGGGATCATC
GCCCAGCCAG ACTGGAGCAG CAAAGACGAT TTCATGCGTC TGGGTAAACC GAATCTGTTT
TTCGGTGTTA CTGCTGGCAA CATGGATTCG ATGATCAACC GTTATACCGC CGATCGCCGT
TTACGTCATG ACGATGCCTA CACGCCGGAT AACGTCGCGG GTAAGCGCCC GGATCGCGCC
ACACTGGTTT ATACCCAGCG TTGTAAAGAG GCGTGGAAAG ATGTACCGGT GATCCTCGGC
GGTATTGAGG CTAGTCTGCG CCGTACCGCA CATTATGATT ACTGGTCCGA TACCGTGCGC
CGTTCCGTGC TGGTGGATTC GAAAGCCGAC ATGCTGATGT TTGGTAACGG TGAGCGTCCG
CTGGTGGAGG TGGCGCACCG TCTGGCGATG GGCGAGCCGA TTAGTGAAAT CCGCGATGTG
CGTAATACCG CGATTATCGT GAAAGAGGCG CTGCCTGGCT GGAGCGGCGT GGATTCCACC
CGTCTTGATA CCCCTGGAAA AATCGACCCA ATCCCGCATC CGTATGGTGA AGATTTGCCG
TGCGCGGATA ACAAACCGGT GGCACCGAAA AAGCAGGAAG CCAAAGCCGT AACCGTGCAG
CCACCGCGCC CGAAACCGTG GGAAAAAACC TACGTGTTGC TGCCTTCTTT CGAGAAAGTG
AAGGGCGATA AAGTGCTGTA CGCCCATGCT TCGCGTATTC TGCACCACGA AACCAACCCA
GGCTGTGCCC GCGCATTGAT GCAAAAACAC GGCGACCGCT ATGTGTGGAT CAACCCGCCT
GCTATTCCGC TTTCTACCGA AGAGATGGAC AGCGTTTTTG CGCTGCCATA CAAGCGCGTG
CCACATCCGG CCTATGGCAA TGCCCGTATT CCGGCTTACG AAATGATCCG TTTTTCGGTC
AACATTATGC GTGGCTGCTT TGGCGGCTGC TCTTTCTGTT CTATCACCGA GCACGAAGGG
CGCATTATTC AGAGCCGTTC CGAAGATTCG ATCATTAATG AGATCGAAGC GATCCGCGAC
ACCGTTCCAG GTTTTACGGG CGTGATTTCC GATCTTGGTG GGCCAACTGC CAACATGTAT
ATGTTGCGCT GCAAATCGCC ACGCGCTGAA CAAACTTGTC GCCGTTTGTC GTGCGTTTAT
CCGGATATTT GTCCGCACAT GGACACGAAC CACGAACCAA CGATCAACCT CTATCGCCGT
GCGCGTGATC TGAAAGGCAT TAAAAAGATC CTGATTGCCT CTGGTGTGCG TTATGACATA
GCCGTAGAAG ATCCGCGCTA TATCAAAGAA CTGGCGACCC ATCACGTCGG CGGTTATCTG
AAGATTGCCC CGGAACATAC CGAAGAAGGG CCGTTATCGA AGATGATGAA GCCGGGCATG
GGCAGCTATG ACCGCTTTAA AGAGCTGTTC GATACTTACT CAAAACAGGC AGGTAAAGAG
CAGTATCTGA TCCCCTATTT CATCTCCGCG CACCCCGGTA CGCGTGATGA AGATATGGTG
AATCTGGCGC TGTGGCTGAA AAAGCACCGC TTCCGCCTCG ACCAGGTGCA GAACTTCTAT
CCGTCGCCGC TGGCGAACTC AACCACCATG TATTACACCG GGAAAAACCC GCTGGCGAAG
ATTGGTTATA AGAGCGAAGA CGTCTTCGTA CCGAAGGGCG ACAAACAGCG TCGTTTGCAT
AAAGCGTTGT TGCGTTACCA CGATCCGGCA AACTGGCCGT TAATCCGCCA GGCGCTGGAA
GCGATGGGCA AAAAGCATCT GATTGGCAGC CGTCGCGATT GCTTAGTGCC TGCGCCAACC
ATTGAAGAGA TGCGTGAAGC TCGTCGCCAG AACCGCAATA CCCGTCCGGC GTTGACGAAA
CATACGCCGA TGGCGACCCA GCGTCAGACG CCTGCTACGG CAAAAAAAGC GTCGTCTACG
CAATCTCGTC CGGTGAATGC TGGTGCGAAG AAACGGCCTA AAGCGGCGGT TGGACGTTAA
 
Protein sequence
MSSISLIQPD RDLFSWPQYW AACFGPAPFL PMSREEMDQL GWDSCDIILV TGDAYVDHPS 
FGMAICGRML EAQGFRVGII AQPDWSSKDD FMRLGKPNLF FGVTAGNMDS MINRYTADRR
LRHDDAYTPD NVAGKRPDRA TLVYTQRCKE AWKDVPVILG GIEASLRRTA HYDYWSDTVR
RSVLVDSKAD MLMFGNGERP LVEVAHRLAM GEPISEIRDV RNTAIIVKEA LPGWSGVDST
RLDTPGKIDP IPHPYGEDLP CADNKPVAPK KQEAKAVTVQ PPRPKPWEKT YVLLPSFEKV
KGDKVLYAHA SRILHHETNP GCARALMQKH GDRYVWINPP AIPLSTEEMD SVFALPYKRV
PHPAYGNARI PAYEMIRFSV NIMRGCFGGC SFCSITEHEG RIIQSRSEDS IINEIEAIRD
TVPGFTGVIS DLGGPTANMY MLRCKSPRAE QTCRRLSCVY PDICPHMDTN HEPTINLYRR
ARDLKGIKKI LIASGVRYDI AVEDPRYIKE LATHHVGGYL KIAPEHTEEG PLSKMMKPGM
GSYDRFKELF DTYSKQAGKE QYLIPYFISA HPGTRDEDMV NLALWLKKHR FRLDQVQNFY
PSPLANSTTM YYTGKNPLAK IGYKSEDVFV PKGDKQRRLH KALLRYHDPA NWPLIRQALE
AMGKKHLIGS RRDCLVPAPT IEEMREARRQ NRNTRPALTK HTPMATQRQT PATAKKASST
QSRPVNAGAK KRPKAAVGR