Gene EcolC_1496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1496 
Symbol 
ID6067114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1652157 
End bp1653314 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content54% 
IMG OID641600915 
Producthypothetical protein 
Protein accessionYP_001724485 
Protein GI170019531 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCA ACGTCACGCT CGATTTTGTT CGCGGCGTCG CCATTCTGGG GATCCTGCTA 
TTAAACATCA GCGCCTTTGG GCTACCAAAG GCGGCTTATC TTAATCCCGC CTGGTACGGC
GCTATTACGC CGCAGGATGC ATGGACCTGG GCATTTCTTG ATCTCGTCGG CCAGGTGAAA
TTCCTCACGC TTTTTGCGCT GCTGTTTGGT GCGGGCCTGC AAATGTTGCT GCCCCGTGGC
AGACGCTGGA TCCAGTCGCG GTTAACGCTG TTAGTCTTGT TGGGCTTTAT TCACGGTTTA
CTGTTCTGGG ACGGCGATAT TCTGCTGGCT TACGGGCTGG TGGGCTTAAT CTGCTGGCGG
CTGGTGCGCG ATGCGCCATC GGTAAAAAGC CTGTTTAATA CAGGCGTCAT GCTTTATCTG
GTGGGGCTTG GCGTTTTGCT GTTATTGGGG TTGATTTCCG ATAGCCAGAC CAGCCGCGCC
TGGACGCCGG ATGCATCGGC TATTTTGTAT GAAAAATACT GGAAGCTTCA CGGCGGCGTT
GATGCGATCA GTAATCGTGC CGATGGTGTT GGCAACAGTT TACTGGCACT GGGCGCACAG
TATGGCTGGC AACTGGCTGG GATGATGCTC ATTGGTGCCG CATTGATGCG CAGTGGCTGG
CTGAAAGGGC AGTTCAGCTT ACGTCACTAT CGTCGTACTG GTTTTGTGCT GGTGGCGATT
GGGGTGATCA TTAACCTTCC TGCCATCGCC CTGCAATGGC GGCTGGACTG GGCATATCGC
TGGTGCGCCT TCTTACTTCA GATGCCGCGG GAACTGAGTG CGCCGTTTCA GGCGATAGGC
TATGCGTCGC TGTTTTATGG TTTCTGGCCG CAATTGAGCC GCTTTAAGCT GGTGCTTGCG
ATCGCCTGCG TCGGACGGAT GGCGCTGACC AACTATCTAT TGCAAACGCT GATTTGTACC
ACGCTTTTTT ACCACCTCGG TTTGTTTATG CATTTTGACC GCCTGGAGCT GCTGGCGTTT
GTTATTCCGG TATGGCTGGC GAATATCCTC TTCTCTGTTA TCTGGCTGCG TTTCTTCCGC
CAGGGGCCGG CGGAATGGCT CTGGCGTCAG TTAACTTTGC GTGCTGCCGG ACCGGCAATA
TCTAAAACAT CAAGATAA
 
Protein sequence
MERNVTLDFV RGVAILGILL LNISAFGLPK AAYLNPAWYG AITPQDAWTW AFLDLVGQVK 
FLTLFALLFG AGLQMLLPRG RRWIQSRLTL LVLLGFIHGL LFWDGDILLA YGLVGLICWR
LVRDAPSVKS LFNTGVMLYL VGLGVLLLLG LISDSQTSRA WTPDASAILY EKYWKLHGGV
DAISNRADGV GNSLLALGAQ YGWQLAGMML IGAALMRSGW LKGQFSLRHY RRTGFVLVAI
GVIINLPAIA LQWRLDWAYR WCAFLLQMPR ELSAPFQAIG YASLFYGFWP QLSRFKLVLA
IACVGRMALT NYLLQTLICT TLFYHLGLFM HFDRLELLAF VIPVWLANIL FSVIWLRFFR
QGPAEWLWRQ LTLRAAGPAI SKTSR