Gene EcHS_A2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2286 
Symbol 
ID5592659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2284037 
End bp2285194 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content54% 
IMG OID640921414 
Producthypothetical protein 
Protein accessionYP_001458950 
Protein GI157161632 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGCA ACGTCACGCT CGATTTTGTT CGCGGCGTCG CCATTCTGGG GATCCTGCTA 
TTAAACATCA GCGCCTTTGG GCTACCAAAG GCGGCTTATC TTAATCCCGC CTGGTACGGC
GCTATTACGC CGCAGGATGC ATGGACCTGG GCATTTCTTG ATCTCGTCGG CCAGGTGAAA
TTCCTCACGC TTTTTGCGCT GCTGTTTGGT GCGGGCCTGC AAATGTTGCT GCCCCGTGGC
AGACGCTGGA TCCAGTCGCG GTTAACGCTG TTAGTCTTGT TGGGCTTTAT TCACGGTTTA
CTGTTCTGGG ACGGCGATAT TCTGCTGGCT TACGGGCTGG TGGGCTTAAT CTGCTGGCGG
CTGGTGCGCG ATGCGCCATC GGTAAAAAGC CTGTTTAATA CAGGCGTCAT GCTTTATCTG
GTGGGGCTTG GCGTTTTGCT GTTATTGGGG TTGATTTCCG ATAGCCAGAC CAGCCGCGCC
TGGACGCCGG ATGCATCGGC TATTTTGTAT GAAAAATACT GGAAGCTTCA CGGCGGCGTT
GATGCGATCA GTAATCGTGC CGATGGTGTT GGCAACAGTT TACTGGCACT GGGCGCACAG
TATGGCTGGC AACTGGCTGG GATGATGCTC ATTGGTGCCG CATTGATGCG CAGTGGCTGG
CTGAAAGGGC AGTTCAGCTT ACGTCACTAT CGTCGTACTG GTTTTGTGCT GGTGGCGATT
GGGGTGATCA TTAACCTTCC TGCCATCGCC CTGCAATGGC GGCTGGACTG GGCATATCGC
TGGTGCGCCT TCTTACTTCA GATGCCGCGG GAACTGAGTG CGCCGTTTCA GGCGATAGGC
TATGCGTCGC TGTTTTATGG TTTCTGGCCG CAATTGAGCC GCTTTAAGCT GGTGCTTGCG
ATCGCCTGCG TCGGACGGAT GGCGCTGACC AACTATCTAT TGCAAACGCT GATTTGTACC
ACGCTTTTTT ACCACCTCGG TTTGTTTATG CATTTTGACC GCCTGGAGCT GCTGGCGTTT
GTTATTCCGG TATGGCTGGC GAATATCCTC TTCTCTGTTA TCTGGCTGCG TTTCTTCCGC
CAGGGGCCGG TGGAATGGCT CTGGCGTCAG TTAACTTTGC GTGCTGCCGG ACCGGCAATA
TCTAAAACAT CAAGATAA
 
Protein sequence
MERNVTLDFV RGVAILGILL LNISAFGLPK AAYLNPAWYG AITPQDAWTW AFLDLVGQVK 
FLTLFALLFG AGLQMLLPRG RRWIQSRLTL LVLLGFIHGL LFWDGDILLA YGLVGLICWR
LVRDAPSVKS LFNTGVMLYL VGLGVLLLLG LISDSQTSRA WTPDASAILY EKYWKLHGGV
DAISNRADGV GNSLLALGAQ YGWQLAGMML IGAALMRSGW LKGQFSLRHY RRTGFVLVAI
GVIINLPAIA LQWRLDWAYR WCAFLLQMPR ELSAPFQAIG YASLFYGFWP QLSRFKLVLA
IACVGRMALT NYLLQTLICT TLFYHLGLFM HFDRLELLAF VIPVWLANIL FSVIWLRFFR
QGPVEWLWRQ LTLRAAGPAI SKTSR