Gene EcHS_A3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3195 
SymbolsufI 
ID5593236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3206088 
End bp3207500 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content56% 
IMG OID640922313 
Productrepressor protein for FtsI 
Protein accessionYP_001459811 
Protein GI157162493 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTCA GTCGGCGTCA GTTCATTCAG GCATCGGGGA TTGCACTTTG TGCAGGCGCT 
GTTCCCCTGA AGGCCAGCGC AGCCGGGCAA CAGCAACCGC TACCCGTTCC GCCGCTACTT
GAATCTCGCC GTGGGCAACC GCTGTTTATG ACTGTACAAC GTGCGCACTG GTCATTTACG
CCAGGGACAC GCGCGTCGGT CTGGGGAATC AATGGTCGTT ACCTGGGGCC GACTATCCGC
GTCTGGAAGG GCGACGATGT TAAGCTTATT TACAGCAACC GCCTGACAGA AAATGTCTCA
ATGACGGTGG CCGGGCTACA GGTACCAGGC CCGCTGATGG GCGGTCCGGC ACGGATGATG
TCGCCAAACG CTGACTGGGC ACCCGTACTG CCCATTCGCC AGAACGCAGC TACTCTGTGG
TATCACGCCA ATACTCCCAA CCGCACGGCT CAGCAGGTCT ATAACGGCCT TGCCGGAATG
TGGCTGGTGG AAGATGAAGT CAGCAAGTCG CTGCCTATCC CCAACCATTA TGGTGTGGAT
GATTTTCCGG TCATTATCCA GGATAAACGG CTGGATAACT TTGGTACGCC AGAATACAAC
GAACCGGGAA GCGGCGGCTT TGTTGGTGAT ACGCTGCTGG TTAACGGTGT ACAAAGCCCG
TACGTTGAAG TCTCGCGTGG CTGGGTGCGC TTGCGACTGC TGAACGCGTC GAACTCTCGT
CGCTATCAAC TACAGATGAA CGATGGTCGC CCGTTACATG TGATTTCTGG CGATCAGGGA
TTCCTGCCTG CTCCTGTATC GGTGAAGCAA CTTTCGCTGG CACCGGGCGA GCGCCGCGAG
ATTCTGGTGG ATATGAGCAA CGGCGATGAA GTGTCGATCA CCTGTGGCGA AGCGGCGAGC
ATTGTTGATC GTATTCGTGG CTTCTTTGAG CCATCCAGTA TTCTGGTTTC TACCCTGGTG
CTAACGCTGC GCCCAACCGG CCTTCTGCCG CTGGTCACAG ACAGTCTTCC GATGCGCTTG
CTGCCAACTG AAATCATGGC TGGTTCGCCA ATTCGCAGTC GCGATATCAG TCTGGGTGAT
GACCCGGGTA TTAATGGACA GCTGTGGGAC GTCAACCGTA TTGATGTCAC CGCGCAGCAA
GGAACGTGGG AACGCTGGAC GGTACGCGCG GACGAGCCGC AAGCGTTCCA TATTGAAGGC
GTAATGTTCC AGATCCGTAA CGTGAATGGC GCGATGCCGT TCCCGGAAGA CAGAGGCTGG
AAAGATACCG TTTGGGTTGA CGGACAAGTG GAGCTGCTTG TTTATTTCGG TCAGCCTTCC
TGGGCGCACT TCCCGTTCTA CTTCAACAGT CAGACGCTGG AAATGGCGGA CCGTGGCTCG
ATTGGGCAAC TGTTAGTCAA TCCGGTACCG TAA
 
Protein sequence
MSLSRRQFIQ ASGIALCAGA VPLKASAAGQ QQPLPVPPLL ESRRGQPLFM TVQRAHWSFT 
PGTRASVWGI NGRYLGPTIR VWKGDDVKLI YSNRLTENVS MTVAGLQVPG PLMGGPARMM
SPNADWAPVL PIRQNAATLW YHANTPNRTA QQVYNGLAGM WLVEDEVSKS LPIPNHYGVD
DFPVIIQDKR LDNFGTPEYN EPGSGGFVGD TLLVNGVQSP YVEVSRGWVR LRLLNASNSR
RYQLQMNDGR PLHVISGDQG FLPAPVSVKQ LSLAPGERRE ILVDMSNGDE VSITCGEAAS
IVDRIRGFFE PSSILVSTLV LTLRPTGLLP LVTDSLPMRL LPTEIMAGSP IRSRDISLGD
DPGINGQLWD VNRIDVTAQQ GTWERWTVRA DEPQAFHIEG VMFQIRNVNG AMPFPEDRGW
KDTVWVDGQV ELLVYFGQPS WAHFPFYFNS QTLEMADRGS IGQLLVNPVP