Gene ECH74115_2438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2438 
Symbol 
ID6971631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2309100 
End bp2310998 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content39% 
IMG OID643386308 
Producthypothetical protein 
Protein accessionYP_002270790 
Protein GI209398577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0495046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA ACGACATTAT TATCAGAACT CATTATAAGT CTCCTCATAG AATGCACATC 
GATAGCGACA TACCAACGCC TTCATCAGAG CCTATTAATC AATTTGCGCC CCAACTCATC
ACCCTACTTG ATACCTCTGA CTTAAGTTCG ATGCTGTCAT ACTGTGTTAC TCAGGAATTT
ACCGCAAACT GTCGAAAAAT ATCACAAAAT TGTTATTCCA CTGCCCTTTT TACCATTAAC
TTTGCCACTT CACCCATCCA TGCAGAAAAT ATATTCATTA CATTACACTA TAAAAAAGAA
ATCATTTCCT TATTACTGGA AACCACGCCT ATTAAAGCTA ACCATTTGCG AAGCATACTG
GATTATATTG AACAGGAACA GTTAACTGCC GAAAATCGTA ACCATTGTAT GAAACTGTCT
AAAAAAATCC ATAGAGAAAA AACTATACAA CCAACAGTAA ATCTCAATGG TAGTGCATTT
TTTTCGCAAT CTCCTTCTGA CGCTATTTTT TGTCGCCATC TGTCATTGCA ATACGCACTT
GATTCATTGA GAAATGGAAA AGGCAAGGTC AACCTGATTA AACATTACTC CTCCGTTGAA
TCCATACAGC AGCATGTCCC CTTAGTCCGG GACGCGGAGT TCAGAGCATT ACTTCGCCAT
CCTCCTGCAG GGAGTCGCGT TATCGCGAGT AAGGATTTTG GCTTCGCTTT AGATATTTTC
TTCTGTCGAA TGATGGCAAA CAATGTCAGT CATATGTCTG CGATTTTATA TATAGACAAT
CATACTTTGT CAGTAAGGCT ACGAATAAAG CAGTCGGCGT ATGGGCAATT AAATTATGTT
GTGTCCGTTT ACGACCCGAA CGATACCAAC GTTGCCGTCA GAGGCACCCA CAGGACAGCA
CGGGGCTTTC TCTCGCTTGA TAAGTTCATC AGTTCAGGTC CCGATGCTCA GACCTGGGCT
GATAGGTATG TTCGCAACTG TGCAATTGCT ATTCTGCCCC TATTACCTGA GGGAGTTCCA
GGGGCTATTT TCACGGGTAT CGCGACACGA ATGCCATTTG CCCCTATACA TCCATCGGCA
ATGTTGTTAA TAATGGCCAC AGGCCAGACT CAACAGCTTA TTACATTATT CAGACAGTTG
CCCATACTCC CTGAAAAAGA AATCATTGAA ATAATAACTG CGCAGAATAG CATTGGTACA
CCTGCTTTAT TTTTGGCTAT GATGAACGGA CATACTGACA ACGTGAAAAT ATTTATGCAA
GAAATTCAGT CACTGGTAGA TAATCACATC ATTCATGAAG ATAATCTGGT TAAATTGTTG
CAAACTAAAA GTGCTAACGA AACACCTGGA CTTTATATCT CCATGTTGTA TGGATTCGAT
GAGATAATCG ATATCTTTCT GAATGCATTA ACCACTCCTA TAGCACAAGA GCTTTTAAAC
AAAAAACTGG TGATGAGTAT TTTAGCCATG AAAATACATG ATGGTGAGCC AGGATTATAC
GCCGCAATGG AAAATAATCA CCCTTTGTGT GTCACACGGT TCCTCTCTAA AATTAATGGC
ATCGCCTTTA AATACAAGTT GAGCAAAGCT AACATCATGG ATTTATTAAA AGGCGCTACA
GCACAGGGAA CCCCTGCATT ATACATCGCT ATGAGCAAGG GTAATGAAGA CGTCGTGTTA
TCTTATATAT CGACGCTGGG TGCTTTTGCA AAAAAACATT CTTTTAGTCA ACATCAGTTA
TTTACACTGT TGGCTGCTAA AAATCATGAC AACATGTCAG CTGTTCATAT AGCCATTCAT
CATAATCATT ATAAAACTGT AGAAACATAT TATGCTGCTA TAAATGTAAT CAGCCAAAGC
ATGAGTTTTA GTGCTGATGA ATTAAAGACG TATTTATAA
 
Protein sequence
MSQNDIIIRT HYKSPHRMHI DSDIPTPSSE PINQFAPQLI TLLDTSDLSS MLSYCVTQEF 
TANCRKISQN CYSTALFTIN FATSPIHAEN IFITLHYKKE IISLLLETTP IKANHLRSIL
DYIEQEQLTA ENRNHCMKLS KKIHREKTIQ PTVNLNGSAF FSQSPSDAIF CRHLSLQYAL
DSLRNGKGKV NLIKHYSSVE SIQQHVPLVR DAEFRALLRH PPAGSRVIAS KDFGFALDIF
FCRMMANNVS HMSAILYIDN HTLSVRLRIK QSAYGQLNYV VSVYDPNDTN VAVRGTHRTA
RGFLSLDKFI SSGPDAQTWA DRYVRNCAIA ILPLLPEGVP GAIFTGIATR MPFAPIHPSA
MLLIMATGQT QQLITLFRQL PILPEKEIIE IITAQNSIGT PALFLAMMNG HTDNVKIFMQ
EIQSLVDNHI IHEDNLVKLL QTKSANETPG LYISMLYGFD EIIDIFLNAL TTPIAQELLN
KKLVMSILAM KIHDGEPGLY AAMENNHPLC VTRFLSKING IAFKYKLSKA NIMDLLKGAT
AQGTPALYIA MSKGNEDVVL SYISTLGAFA KKHSFSQHQL FTLLAAKNHD NMSAVHIAIH
HNHYKTVETY YAAINVISQS MSFSADELKT YL