Gene ECH74115_2775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2775 
Symbol 
ID6966563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2593060 
End bp2594652 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content57% 
IMG OID643386630 
Productphage portal protein, lambda family 
Protein accessionYP_002271109 
Protein GI209397959 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA CGCCTGTCCT GATTGATGTG AACGGCGTTC CGCTTCGTGA GAGTCTCAGC 
TACAACGGGG GCGGTGCAGG ATTTGGCGGG CAAATGGCGG AGTGGTTGCC ACCGGCGCAG
AGTGCCGATG CGGCCCTGCT GCCCGCGTTG CGTCTGGGGA ATGCCCGGGC AGATGATCTG
GTGCGCAATA ACGGAATAGC GGCCAATGCG GTGGCACTGC ATAAGGATCA CATTGTCGGG
CATATGTTTC TTATCAGCTA CCGTCCGAAC TGGCGCTGGC TGGGGATGCG GGAGACCGCA
GCAAAAAGCT TTGTCGATGA GGTGGAGGCG GCCTGGTCGG AATACGCCGA AGGGATGTCT
GGCGAGATCG ACGTGGAAGG AAAACGCACG TTCACGGAAT TTATCCGTGA AGGTGTGGGC
GTTCATGCGT TTAACGGCGA AATCTTTGTG CAGCCGGTCT GGGATACGGA AACCACGCAG
TTATTCCGTA CGCGTTTTAA AGCCGTGAGT CCGAAACGGG TGGACACGCC AGGACACGGT
ATGGGGAACC GTTTTCTGCG GGCCGGGGTG GAGGTCGATC GATATGGCCG TGCCGTTGCG
TACCATATCT GTGAGGATGA TTTTCCTCGC TCCGGGAGTG GACGATGGGA ACGGATCCCG
CGTGAACTTC CCACCGGGCG TCCGGCCATG CTGCATATTT TCGAGCCGGT GGAGGACGGG
CAGACCCGTG GGGCCAACCA GTTTTACAGC GTCATGGAAC GGCTGAAGAT GCTCGATTCC
CTGCAGGCAA CACAGCTTCA GTCGGCCATT GTGAAAGCCA TGTATGCAGC GACGATTGAA
AGTGACCTTG ATACCGAAAA GGCCTTTGAA TATATCGCCG GTGCGCCGCA GGGGCAGAAG
GATAATCCGC TTATTAATAT TCTGGAGAAG TTCTCCAGCT GGTATGACAC GAATAACGTG
ACGCTGGGTG GTGTCAAAAT TCCGCACCTT TTCCCCGGGG ATGATCTGAA ACTACAGACT
GCGCAGGATT CAGACAATGG ATTTTCGGCG CTTGAACAGG CGCTGCTGCG GTATATCGCC
GCCGGTCTTG GCGTTTCCTA CGAACAGTTG TCCCGTGATT ACTCGAAGGT CAGTTATTCA
AGTGCCAGGG CCTCTGCCAA TGAGTCGTGG CGCTATTTTA TGGGGCGGCG AAAATTTATT
GCGGCCCGGC TGGCCACGCA GATGTTTTCC TACTGGCTGG AAGAGGCACT TCTTCGGGGG
ATTATCCGTC CGCCACGGGC GCGTTTTGAT TTTTATCAGG CGCGATCAGC CTGGTCACGG
GCAGAGTGGA TTGGTGCCGG AAGAATGGCC ATTGACGGGC TCAAGGAGGT TCAGGAATCG
GTGATGCGCA TTGAGGCCGG ACTGAGCACG TATGAGAAAG AGCCGGCGCT GATGGGCGAG
GATTATCAGG ACATTTTCCG CCAGCAGGTC AGGGAATCTG CAGAGCGGCA AAAAGCCGGA
CTCTCACGTC CGGTGTGGAT AGCGCAGGCG TATCAGCAGC AGATAGCGGA GAGTCGCAGG
CCGGAAGAGG AGACAACACC CCGTGAGACG TAA
 
Protein sequence
MKRTPVLIDV NGVPLRESLS YNGGGAGFGG QMAEWLPPAQ SADAALLPAL RLGNARADDL 
VRNNGIAANA VALHKDHIVG HMFLISYRPN WRWLGMRETA AKSFVDEVEA AWSEYAEGMS
GEIDVEGKRT FTEFIREGVG VHAFNGEIFV QPVWDTETTQ LFRTRFKAVS PKRVDTPGHG
MGNRFLRAGV EVDRYGRAVA YHICEDDFPR SGSGRWERIP RELPTGRPAM LHIFEPVEDG
QTRGANQFYS VMERLKMLDS LQATQLQSAI VKAMYAATIE SDLDTEKAFE YIAGAPQGQK
DNPLINILEK FSSWYDTNNV TLGGVKIPHL FPGDDLKLQT AQDSDNGFSA LEQALLRYIA
AGLGVSYEQL SRDYSKVSYS SARASANESW RYFMGRRKFI AARLATQMFS YWLEEALLRG
IIRPPRARFD FYQARSAWSR AEWIGAGRMA IDGLKEVQES VMRIEAGLST YEKEPALMGE
DYQDIFRQQV RESAERQKAG LSRPVWIAQA YQQQIAESRR PEEETTPRET