Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4339 |
Symbol | |
ID | 6146955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4430192 |
End bp | 4431100 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641619160 |
Product | baseplate assembly protein GpJ |
Protein accession | YP_001746284 |
Protein GI | 170682153 |
COG category | [R] General function prediction only |
COG ID | [COG3948] Phage-related baseplate assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0312894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATTA TCGACCTGAA CCAGCTACCC GCACCGGATG TGGTCGAGGA GCTGGACTTT GAAACCATTC TCGCCGAACG CAAGGCGACA CTGATTTCCC TTTACCCGGA AGACCAGCAG GAGGCGGTCG CCCGTACCCT GACACTGGAA TCTGAGCCTC TCGTCAAACT GCTGGAAGAA AATGCTTATC GTGAGCTTAT CTGGCGTCAG CGTGTGAATG AGGCCGCACG GGCGGTGATG CTGGCCTGTG CCGCCGGTAA TGACCTTGAT GTGATTGGTG CCAATTACAA CACCACGCGT CTGACTATCA CCCCGGCAGA TGATTCGACC ATCCCGCCGA CACCGGCAGT GATGGAATCT GATACCGATT ATCGTCTGCG TATTCAGCAG GCGTTTGAAG GTTTAAGCGT CGCCGGGTCG GTGGGTGCCT ATCAGTATCA TGGTCGCAGT GCTGACGGGC GTGTCGCGGA TATCTCTGTC ACCAGTCCGT CTCCGGCCTG CGTCACCATC TCTGTGCTGT CACGTGAGAA TAACGGTGTC GCATCCAAAG ACCTGCTGGC GGTGGTGCGT AACGCCCTGA ATGGCGAGGA CGTCAGACCG GTGGCCGACC GCGTGACCGT GCAGTCTGCC GCCATTGTTG AATACCAGAT AAACGCCACG CTTTACCTTT ACCCTGGTCC CGAAAGTGAA CCCATCCGCG CGGCCGCCGT GAAAAAACTG GAAGCATACA TCACGGCACA GCACCGGCTG GGGCGCGACA TCCGTCTGTC TGCCATTTAT GCCGCTTTGC ATGTGGAAGG CGTGCAGCGT GTCGAACTGG CTGCACCGCT GGCCGACATC GTGCTCAACA ATACGCAGGC GTCTTTCTGT ACCGAATACA GCGTCGTGAC CGGAGGCTCG GATGAGTGA
|
Protein sequence | MPIIDLNQLP APDVVEELDF ETILAERKAT LISLYPEDQQ EAVARTLTLE SEPLVKLLEE NAYRELIWRQ RVNEAARAVM LACAAGNDLD VIGANYNTTR LTITPADDST IPPTPAVMES DTDYRLRIQQ AFEGLSVAGS VGAYQYHGRS ADGRVADISV TSPSPACVTI SVLSRENNGV ASKDLLAVVR NALNGEDVRP VADRVTVQSA AIVEYQINAT LYLYPGPESE PIRAAAVKKL EAYITAQHRL GRDIRLSAIY AALHVEGVQR VELAAPLADI VLNNTQASFC TEYSVVTGGS DE
|
| |