Gene EcolC_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1190 
Symbol 
ID6065804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1304245 
End bp1305825 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID641600606 
Producthydrogenase 4 subunit F 
Protein accessionYP_001724184 
Protein GI170019230 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.647759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATT CTGTGATGTT CGCTTTACTC CTGCTCACGC CGCTGCTTTT TTCGCTGCTC 
TGTTTTGCCT GCCGGAAACG GAGACTTTCT GCGACTCGCA CGGTGACCGT ATTACATAGC
TTAGGGATCA CACTGCTGCT GATTCTGGCA CTCTGGGTGG TCCAAACTGC CGCTGATGCA
GGAGAAATAT TCGCTGCGGG ACTGTGGCTT CATATTGATG GTCTGGGCGG TTTGTTCCTC
GCCATTCTTG GTGTGATTGG CTTTCTCACC GGTATTTACT CGATTGGCTA CATGCGTCAT
GAAGTGGCAC ACGGCGAGCT TTCACCCGTT ACGCTGTGCG ATTACTACGG TTTCTTCCAT
CTGTTTTTGT TCACCATGCT GCTGGTTGTT ACCAGCAATA ACCTGATTGT GATGTGGGCG
GCGATCGAAG CCACCACCTT AAGCTCGGCG TTTCTGGTAG GCATTTACGG TCAGCGTTCA
TCGCTGGAAG CTGCATGGAA GTACATCATT ATTTGTACTG TTGGTGTCGC TTTTGGTCTG
TTCGGTACCG TGCTGGTATA CGCCAACGCC GCCAGCGTTA TGCCGCAGGC AGAAATGGCG
ATATTCTGGA GCGAGGTTCT TAAGCAATCG TCCTTGCTTG ACCCAACACT AATGCTGTTG
GCCTTTGTGT TTTTGCTAAT TGGCTTTGGT ACCAAAACCG GGCTATTTCC CATGCACGCC
TGGCTGCCGG ATGCTCACAG TGAAGCGCCG AGTCCGGTCA GCGCCCTGCT CTCCGCCGTA
TTGCTGAACT GCGCGCTGTT GGTGCTGATT CGCTATTACA TCATTATTTG CCAAGCCATC
GGCAGCGATT TCCCCAACCG GTTGTTGCTC ATCTTCGGCA TGTTGTCGGT TGCCGTGGCG
GCATTTTTCA TTCTGGTACA GCGGGACATT AAGCGTCTGC TGGCGTACTC CAGCGTGGAG
AACATGGGGC TGGTCGCGGT GGCGCTAGGC ATTGGCGGGC CGCTGGGAAT TTTTGCCGCG
CTGCTGCACA TCTTAAACCA CAGTCTGGCA AAAACGCTGC TGTTCTGCGG TTCCGGCAAT
GTACTGCTCA AGTACGGCAC GCGCGATCTC AACGTCGTCT GTGGGATGCT CAAAATCATG
CCATTTACCG CCGTGCTGTT TGGCGGCGGT GCGCTGGCGC TGGCAGGGAT GCCGCCCTTC
AACATTTTTC TTAGCGAATT TATGACCATT ACCGCCGGAC TGGCACGTAA TCACCTGCTG
ATTATCGTCC TGCTGTTATT GCTGTTAACG CTGGTGCTGG CGGGCCTGGT ACGGATGGCT
GCGCGGGTGT TAATGGCGAA ACCGCCGCAG GCCGTTAACC GGGGTGATCT CGGCTGGTTG
ACCACCTCGC CAATGGTGAT TCTGCTGGTC ATGATGCTGG CGATGGGAAC GCATATTCCA
CAACCTGTCA TCAGGATCCT GGCGGGCGCT TCCACTATAG TCCTCTCAGG GACGCACGAT
CTGCCTGCAC AACGTAGCAC CTGGCATGAT TTTTTGCCTT CAGGCACCGC ATCTGTTTCG
GAGAAACACA GTGAACGTTA A
 
Protein sequence
MSYSVMFALL LLTPLLFSLL CFACRKRRLS ATRTVTVLHS LGITLLLILA LWVVQTAADA 
GEIFAAGLWL HIDGLGGLFL AILGVIGFLT GIYSIGYMRH EVAHGELSPV TLCDYYGFFH
LFLFTMLLVV TSNNLIVMWA AIEATTLSSA FLVGIYGQRS SLEAAWKYII ICTVGVAFGL
FGTVLVYANA ASVMPQAEMA IFWSEVLKQS SLLDPTLMLL AFVFLLIGFG TKTGLFPMHA
WLPDAHSEAP SPVSALLSAV LLNCALLVLI RYYIIICQAI GSDFPNRLLL IFGMLSVAVA
AFFILVQRDI KRLLAYSSVE NMGLVAVALG IGGPLGIFAA LLHILNHSLA KTLLFCGSGN
VLLKYGTRDL NVVCGMLKIM PFTAVLFGGG ALALAGMPPF NIFLSEFMTI TAGLARNHLL
IIVLLLLLLT LVLAGLVRMA ARVLMAKPPQ AVNRGDLGWL TTSPMVILLV MMLAMGTHIP
QPVIRILAGA STIVLSGTHD LPAQRSTWHD FLPSGTASVS EKHSER