Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4323 |
Symbol | |
ID | 6142866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4420121 |
End bp | 4421893 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641619144 |
Product | phage large terminase subunit GpP |
Protein accession | YP_001746268 |
Protein GI | 170683976 |
COG category | [S] Function unknown |
COG ID | [COG5484] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00512482 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCATCA CCACAGACAC CACTCTTTTA CACGACCCGC GTCGTCAGGC GGCGCTGCTG TACTGGCAGG GGTTTTCCGT GCCGCAGATT GCCGCCATGT TGCAGATGAA ACGCCCGACG GTGCAGAGCT GGAAACAGCG CGACGGCTGG GACAGCGTTG CCCCCATCAG CCGTGTCGAA ATGAGTCTGG AAGCGCGGCT GACCCAGCTC ATCATCAAAC CGCAGAAAAC CGGCGGTGAC TTCAAGGAAA TTGACCTGCT GGGACGCCAG ATTGAACGAC TGGCACGGGT CAACCGCTAC AGTCAGACCG GCAACGAGGC AGACCTTAAT CCGAACATCG CTAACCGCAA CAAAGGCGGG CGGCGCAAAC CGAAAAAGAA TTTTTTCAGC GACGAAGCCA TCGAAAAGCT GGAGCAGATT TTCTTTGAGC AGTCTTTCGA CTATCAGTTG CACTGGTATC GCGCCGGGCT TGAGCACCGC ATCCGCGATA TCCTGAAATC CCGCCAGATT GGCGCAACGT TTTATTTTTC CCGCGAGGCG CTGCTGCGTG CCCTGAAAAC CGGTCATAAC CAGATTTTTC TGTCGGCCAG TAAAACGCAG GCGTATGTGT TCCGCGAATA CATCATCGCC TTTGCCCGGC TGGTTGACGT TGACCTGACC GGTGACCCGA TTGTCCTGGG CAATAACGGC GCAAAACTGA TTTTTCTCGG CACCAACTCC AACACCGCGC AGAGCCATAA CGGCGACCTG TACGTCGACG AGATTTTCTG GATCCCGAAT TTTCAGGTAC TGCGTAAGGT GGCATCAGGT ATGGCCTCAC AGAGTCACCT GCGCTCGACC TATTTCTCCA CCCCGTCCAC GCTGGCGCAC GACGCCTACC CGTTCTGGTC GGGTGAACTG TTTAACCGGG GACGCGCCAG CGCCGCCGAA CGCGTGGAAA TCGACGTCAG TCATAACGCC CTTGCCGGTG GGCTTCTCTG TGCGGACGGC CAGTGGCGGC AGATTGTCAC CATTGAGGAC GCCCTGAAAG GCGGCTGCAC GCTGTTCGAC ATTGAGCAGC TCAAACGCGA AAACAGCGCC GACGATTTTA AAAACCTGTT CATGTGTGAA TTTGTTGACG ACAAGGCGTC GGTATTCCCG TTCGAGGAGC TGCAACGCTG CATGGTCGAC ACGCTGGAAG AATGGGAAGA CTATGCGCCG TTTGCCGCCA ATCCGTTCGG CTCCCGCCCG GTATGGATTG GTTACGACCC GTCACACCGT GGCGACAGCG CCGGATGCGT GGTGCTGGCA CCGCCGGTGG TGGCCGGAGG TAAATTCAGA ATACTTGAGC GTCACCAGTG GAAAGGCATG GACTTTGCCA CTCAGGCGGA ATCCATCCGC AAACTCACCG AAAAATACAA CGTCGAATAC ATCGGAATTG ATGCCACCGG CCTCGGTGTC GGCGTGTTCC AGCTCGTGCG CTCGTTCTAT CCCGCCGCGC GCGATATCCG CTACACGCCG GAAATGAAAA CCGCAATGGT GCTCAAGGCA AAAGACGTCA TCCGCCGTGG CTGTCTGGAA TATGACGTCA GCGCCACCGA CATCACCAGC TCGTTCATGG CTATCCGCAA GACCATGACC AGCAGCGGAC GCAGCGCCAC CTATGAGGCC AGCCGTAGCG AGGAAGCCAG CCACGCCGAC CTCGCCTGGG CGACCATGCA CGCCCTGTTA AATGAGCCAC TCACCGCCGG TATCAGCACC CCGCTGACAT CCACCATTCT GGAGTTTTAC TGA
|
Protein sequence | MTITTDTTLL HDPRRQAALL YWQGFSVPQI AAMLQMKRPT VQSWKQRDGW DSVAPISRVE MSLEARLTQL IIKPQKTGGD FKEIDLLGRQ IERLARVNRY SQTGNEADLN PNIANRNKGG RRKPKKNFFS DEAIEKLEQI FFEQSFDYQL HWYRAGLEHR IRDILKSRQI GATFYFSREA LLRALKTGHN QIFLSASKTQ AYVFREYIIA FARLVDVDLT GDPIVLGNNG AKLIFLGTNS NTAQSHNGDL YVDEIFWIPN FQVLRKVASG MASQSHLRST YFSTPSTLAH DAYPFWSGEL FNRGRASAAE RVEIDVSHNA LAGGLLCADG QWRQIVTIED ALKGGCTLFD IEQLKRENSA DDFKNLFMCE FVDDKASVFP FEELQRCMVD TLEEWEDYAP FAANPFGSRP VWIGYDPSHR GDSAGCVVLA PPVVAGGKFR ILERHQWKGM DFATQAESIR KLTEKYNVEY IGIDATGLGV GVFQLVRSFY PAARDIRYTP EMKTAMVLKA KDVIRRGCLE YDVSATDITS SFMAIRKTMT SSGRSATYEA SRSEEASHAD LAWATMHALL NEPLTAGIST PLTSTILEFY
|
| |