Gene ECH74115_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1821 
Symbol 
ID6969159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1737316 
End bp1738446 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content46% 
IMG OID643385759 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002270249 
Protein GI209397356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000000638112 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCGCC CGCGAAAATA TAAAACCGAT GTTCCGGGAT TATCTCCGTA TTTTGACAAA 
AGAAATAACA AAGTTTACTG GCGTTACAGG CATCCCATAA CAGGCAAAAA TCACGGTCTC
GGCAGTATTG ACCAGAAACT GGCAGAAACT ATTGCAGCAG AAGCGAACAG CCGTCTTGCC
CGGCAGCAAA TGGAACAAAT GCTCAGTCTG CAGGAGAAAA TTATTAGTGA TACCGGCGGT
TCATCAACCG TTACCATTTT TCTGAATAAT TACAGAAAAA TTCAACAGGA AAGATATGAA
AACGGCGAGA TCAAACTCAA CACGCTGAAA CAGAAAGCGG CCCCTCTCAG GGTATTTGAT
GAACGTTTTG GCACCAGACC GTTAGATGCC ATAACCGTAA AAGATGTGGT ATCAGTACTG
GAAGAGTACA AGGCCAGAGG ACATAACAGA ATGGGACAAA TTTTCAGGAA AGTACTGATC
GATGTTTTCC GGGAAGCTCA GCAAACGGGC GATGTCCCGC CAGGCTTTAA CCCTGCAGAA
TCTGCAAAAA AACCGCAGGT GCGGATATCA AGACAGCGAC TGACTTTTGA TGAGTGGATG
ATGATTTATA ACGCAGCGGA AAAGGATGGT TACTTTTTAC AGCGCGGTAT GCTGCTGGCA
CTGATGACAG GCCAGCGCCT TTCAGATATT TGCAAAATGC AATTTTCGGA TATCCGGGAT
GGTTATCTTC ATGTCGAACA GCAAAAAACA GGAACCCGGA TTGCCATCCC TCTGGCTCTG
CGTTGCGATA AATTAAATCT CACCCTGGAT GATGTGGTGT CATCCTGCCG CGATTGCGTT
CTTAGTCCGT GGCTATTGCA CCACCATCAC GCGAAAGGGA CAGCTAAGCG CGGCGGGATG
GTTAAGCCAG CAACATTAAC CGTTGCATTT AAAAAAGCCC GGGATTCTGT GGATTACAAC
TGGCGTGCTA ATGGCACCCC ACCCTCTTTC CATGAGCAGA GATCTTTATC AGAGCGATTG
TTCAGAGAGC AGGGGGTTGA TACCAAAATT TTGCTAGGCC ATTCGAATCA AAAAATGATC
GATATTTACA ACGACGCACG CGGTAAGGAA TGGAAAAAAC TGGTCATTTG A
 
Protein sequence
MARPRKYKTD VPGLSPYFDK RNNKVYWRYR HPITGKNHGL GSIDQKLAET IAAEANSRLA 
RQQMEQMLSL QEKIISDTGG SSTVTIFLNN YRKIQQERYE NGEIKLNTLK QKAAPLRVFD
ERFGTRPLDA ITVKDVVSVL EEYKARGHNR MGQIFRKVLI DVFREAQQTG DVPPGFNPAE
SAKKPQVRIS RQRLTFDEWM MIYNAAEKDG YFLQRGMLLA LMTGQRLSDI CKMQFSDIRD
GYLHVEQQKT GTRIAIPLAL RCDKLNLTLD DVVSSCRDCV LSPWLLHHHH AKGTAKRGGM
VKPATLTVAF KKARDSVDYN WRANGTPPSF HEQRSLSERL FREQGVDTKI LLGHSNQKMI
DIYNDARGKE WKKLVI