Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1186 |
Symbol | |
ID | 6146211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1192092 |
End bp | 1193849 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616064 |
Product | putative phage terminase, large subunit |
Protein accession | YP_001743247 |
Protein GI | 170684273 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.619537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000000060413 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAAG TGGCTGACGG GATCCGCTAC GCCGAACGTG TTGTTGCAGG AGAAATTGTT GCTGGCGAAT TTGTCCGCCT GGCCTGCCAG CGTTTTCTTG ATGATCTGAA GTACGGCGAA GAGCGGGGGA TTTATTTCAG TGAACCCCGT GCGCAGCACA TCCTGAATTT CTACAAATTT GTGCCTCATG TAAAAGGGGC GCTGGCAGGC CAGCCCATTG AGTTGATGGA CTGGCATGTA TTTATCCTCA TTAATATTTT TGGTTTTGTC ATTCCGCTGG TCAATGAAGA GACCGGGGAA GTTGTCATGC GCAGCGATGG CAGCGGACGT CCGGTGATGG TGCGCCGGTT CCGGACGGCG TACAACGAAG TCGCCCGTAA AAACGCAAAA TCAACTCTGT CATCGGGTAT CGGCCTGTAT ATGACGGGGG CAGATGGTGA AGGCGGAGCT GAGGTGTATT CAGCCGCAAC CACGCGTGAC CAGGCCAGAA TCGTGTTTGA AGACGCCAAA AATATGGTCA GAAAAGCCCG GTCGACACTC GGGCGGTTGT TTGATTTCAA CAAGCTGGCG ATTTACCAGG AGCAGAGCGC ATCAAAATTT GAACCGCTTT CTTCGGATGC AAACAACCTG GATGGTCTGA ACATCCACTG CGCCATTATT GATGAGCTGC ATGCACATAA AACTCGTGAC GTGTGGGACG TTCTGGAAAC GGCAACCGGT GCCCGTCTGC AGTCCCTTTT ATTTGGTATC ACCACGGCAG GGTTTAACAA GGAAGGGATT TGTTACGAGC AGCGTGATTA CGCCATCAAG GTATTGCGTG GCTATAACAG CGACGTGGAG GGCGCGGTAA AAGACGACTC CTACTTTGCG ATTATTTACA CCCTCGATGA GGGAGATGAT CCGTTTGATG AAACGGTCTG GCAGAAAGCG AATCCCGGCC TGGGCATCTG TAAACGCTGG GATGATCTGC GTCGCCTGGC GAAAAAAGCG AAAGAACAGG TCTCTGCGCG GGTGAATTTT TTTACCAAAC ACATGAATGT GTGGGTAACA GCAGAGTCTG CCTGGATGGA CATGATTAAG TGGGATAAGT GCGAATACAT TGCCCCACGA CATGAGCTGA AAACGTATCC CATGTGGGTC GGCGTTGACC TTGCTCATAA GATTGATATC TGTGCGGCGG CAAAACTCTG GCGAACGGAT AACGGGCATG TTCATGCCGA TTTTAAATTC TGGCTTCCGG AAGGACGGCT GGAACGATGC TCGCGGCAGC AGGCAGAACT TTACCGGAAG TGGGCGGAGA TGGATAAGCT GATTCTGACG GATGGTGATG TTATCGATCA TGCTCAGATA AAAAGTGACT TACTGGAATG GATTGGTGGT GAAAACCTCA GGGAACTGGG ATTTGACCCG TGGAGCGCGA TGCAGTTCAG CCTGGCACTG GCTGAAGAAG GGATACCGCT GGTGGAGGTT CCGCAGACGG TTCGCAATCT GTCAGAGGCC ATGAAGGAAA CGGAATCACT GGTCTATGCC GGGCGTTTCC ATCACAGCAA TCATCCGGTC ATGAACTGGA TGATGTCTAA CGTTACGGTA AAACCGGACA AAAACGACAA TATCTTCCCG AATAAATCCA CGCTGGAAGC CAAAATCGAC GGCCCTGTTG CGATGTTTAC AGCAATGAGC CGGATGCTGG TCAATGGTGG TGAACCGGAG CTGGATCTGT CTGAACATCT GGTCAGCGTG GGCATCCGCT CGCTTTAA
|
Protein sequence | MAKVADGIRY AERVVAGEIV AGEFVRLACQ RFLDDLKYGE ERGIYFSEPR AQHILNFYKF VPHVKGALAG QPIELMDWHV FILINIFGFV IPLVNEETGE VVMRSDGSGR PVMVRRFRTA YNEVARKNAK STLSSGIGLY MTGADGEGGA EVYSAATTRD QARIVFEDAK NMVRKARSTL GRLFDFNKLA IYQEQSASKF EPLSSDANNL DGLNIHCAII DELHAHKTRD VWDVLETATG ARLQSLLFGI TTAGFNKEGI CYEQRDYAIK VLRGYNSDVE GAVKDDSYFA IIYTLDEGDD PFDETVWQKA NPGLGICKRW DDLRRLAKKA KEQVSARVNF FTKHMNVWVT AESAWMDMIK WDKCEYIAPR HELKTYPMWV GVDLAHKIDI CAAAKLWRTD NGHVHADFKF WLPEGRLERC SRQQAELYRK WAEMDKLILT DGDVIDHAQI KSDLLEWIGG ENLRELGFDP WSAMQFSLAL AEEGIPLVEV PQTVRNLSEA MKETESLVYA GRFHHSNHPV MNWMMSNVTV KPDKNDNIFP NKSTLEAKID GPVAMFTAMS RMLVNGGEPE LDLSEHLVSV GIRSL
|
| |