Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3519 |
Symbol | |
ID | 6969320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3262435 |
End bp | 3264141 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387320 |
Product | large subunit terminase |
Protein accession | YP_002271783 |
Protein GI | 209397398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.918493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00317443 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTCC GGAAGAATGA ACCGCGATGT GATGAGCCGT CAGAAATGAC CGAGGCTGAA CAACGTCTGT TCATCATGAC TAAACTGAGC AATCCCTGGT GGCGGCTCAA TCATCTCTAC AAAATACAGA ACGAAAAAGG TGAACTGGTC ACCTTCAGAA TGCGACCGGC GCAGCGCCAG TTGTTCCGGA GCATGCACAA TAAAAATATT ATCCTGAAAG CGCGCCAGCT GGGATTTTCC ACAGCCATTG ATATTTATCT TCTCGACCAG GCATTATTCA TTCCGCATCT CAAATGCGGG ATCGTCGCTC AGGATAAACA GGCTGCCAGT GAAATTTTCC GCACAAAAAT TGCTGTACCG TTTGATCATC TCCCTGACTG GCTGAGAGCC TCATTCACCA TCGTTGAACG TCGTAGCGGT GCCAGCGGTG GCTATATCCT GTTTGGTCAC GGCTCGAGTA TCCAGGTGGC AACCTCATTC CGTTCAGGTA CGGTGCAGCG CCTGCATATC TCAGAGCACG GCAAAATTTG CGCGAAATAT CCGGCTAAGG CGAAAGAACT GCGAACCGGT ACGCTTAATG CCGTCTCTGA TGAATGCATT ATTTTTGATG AGTCCACTGC TGAAGGCGTG GGTGGTGATT TTTACGAGAT GAGTAACCGA GCACAGGAGA TCACTGCATC AGGCTTATTG CTGACGGCAC AGGATTATAA ATTCCATTTT TACGCCTGGT GGCAGGATCC TAAATACAGC GCCAGAGTGC CGGAAAGCGG GCTGAAGCTG TCACGGGAAA AAATGACGTA TTTTTCTGCG GTTGAGAAGG CAATGAACAT CACGCTTACT GATGAACAGA AGCAGTGGTA CATCAATAAG GAAACTGAAC AGCGTGAGGA AATGAAGCAG GAGTTTCCCT CAACGCCACA GGAGGCGTTT CTGACGTCCG GACGACGTGT GTTCAGTGCC GAAAGTACGT TGCAGGCAGA ATCATTCTGT TCGCCACCGA TGATTGTTTA TGACATTGAA CCTGTTACAG GAGCGAAGAC TAAAGCTCAG TCTCTGCGTG AAGGAAATAA AAACGAGTTG CAGCGGACGC TGATGAATTA TCTGCTGGTA TGGGAACTGC CGGATCCGGA TGAAGAGTAT GTTTGTGGGG CAGATACTGC CGAAGGGCTG GAGCACGGAG ACCGCTCATC GCTGGATGTT GTCAAACGCA GTAATGGCGA GCAGGTGGCT CACTGGTTCG GGCATCTCGA TGCTGAACTT TTTGCTCATC TCATTTCGCA GGTCTGTCGT ATGTATAACA ACGCGTTTGT GGGGCCGGAG CGTAATAATC ACGGACATGC AGTTATCCTG AAACTCCGGG AACTCTATCC GACACGTTAT ATCTACAACG AACAGCATCT TGACCAGGCA TATGACGACG ATACGCCCCG CCTTGGCTGG CTGACAACCC GTCAGAGCAA ACCTGTTCTG ACCGAAGGAA TGAAAACGCT TCTGAATAAT GGAATATCAG GGATCCGCTG GTCAGGCACA TTATCGGAAA TGAACACCTA CGTTTATGAC GCGAAAGGCT CCATGAATGC ACAGGAAGGC TGCTTTGATG ATCAGCTCAT GAGCTACATG ATTGCCCAGG AGATGCGCGC CAGAATGCCG GTGAGGGTAA AACAGAAAAC GGATAAACGC AGAACCACAC ACTGGATGGC ACACTGA
|
Protein sequence | MTFRKNEPRC DEPSEMTEAE QRLFIMTKLS NPWWRLNHLY KIQNEKGELV TFRMRPAQRQ LFRSMHNKNI ILKARQLGFS TAIDIYLLDQ ALFIPHLKCG IVAQDKQAAS EIFRTKIAVP FDHLPDWLRA SFTIVERRSG ASGGYILFGH GSSIQVATSF RSGTVQRLHI SEHGKICAKY PAKAKELRTG TLNAVSDECI IFDESTAEGV GGDFYEMSNR AQEITASGLL LTAQDYKFHF YAWWQDPKYS ARVPESGLKL SREKMTYFSA VEKAMNITLT DEQKQWYINK ETEQREEMKQ EFPSTPQEAF LTSGRRVFSA ESTLQAESFC SPPMIVYDIE PVTGAKTKAQ SLREGNKNEL QRTLMNYLLV WELPDPDEEY VCGADTAEGL EHGDRSSLDV VKRSNGEQVA HWFGHLDAEL FAHLISQVCR MYNNAFVGPE RNNHGHAVIL KLRELYPTRY IYNEQHLDQA YDDDTPRLGW LTTRQSKPVL TEGMKTLLNN GISGIRWSGT LSEMNTYVYD AKGSMNAQEG CFDDQLMSYM IAQEMRARMP VRVKQKTDKR RTTHWMAH
|
| |