Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3240 |
Symbol | gspK |
ID | 6142677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3312415 |
End bp | 3313392 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618070 |
Product | general secretion pathway protein GspK |
Protein accession | YP_001745220 |
Protein GI | 170684190 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3156] Type II secretory pathway, component PulK |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.656676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.626096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCT TGCCACCAAA ACGCGGAATG GCACTGGTCG TGGTGCTGGT ATTGCTGGCG GTTATGATGC TGGTGACCAT CACGCTTTCC GGGCGGATGC AGCAACAACT TGGGCGAACG CGCAGCCAGC AGGAGTACCA GCAGGCGCTG TGGTACAGCG CCAGTGCAGA AAGCCTGGCG CTGAGCGCGC TCAGTCTGAG CCTGAAAAAT GAAAAGCGTG TGCATCTGGC ACAACCGTGG GCTTCTGGCC CGCGTTTTTT CCCACTGCCG CAGGGGCAAA TTGCCGTCAC TCTGCGTGAC GCACAGGCCT GCTTTAACCT GAATGCCCTC GCTCAGCCGA CGACGACGTC GCGTCCGCTC GCGGTACAAC AACTGATTGC CCTGATCTCG CGCCTCGATG TGCCTGCTTA TCGGGCCGAA CTGATAGCCG AAAGCCTGTG GGAGTTTATT GACGAAGACC GCAGTGTGCA GACGCGTCTG GGTCGTGAAG ACAGCGAGTA TCTCGCCCGC TCGGTGCCGT TCTACGCCGC TAATCAACCG CTGGCTGATA TCAGCGAGAT GCGCGTGGTG CAGGGAATGG ACGCTGGGCT TTATCAAAAA CTGAAACCGC TGGTCTGTGC GCTGCCGATG GCCCGCCAGC AAATCAACAT CAATACATTA GATGTCACGC AAAGTGTGAT TCTTGAGGCG CTGTTTGACC CGTGGTTAAG CCCTGTTCAG GCGCGGGCGT TATTACAACA ACGTCCGGCG AAGGGCTGGG AAGATGTCGA TCAGTTTCTT GCTCAGCCGC TACTTGCAGA CGTCGATGAG CGTACTAAAA AACAGCTAAA AACCATCCTG AGCGTGGACA GCAATTACTT CTGGCTGCGT TCAGATATCA CCGTGAATGA GATTGAACTG ACGATGAATT CGTTAATTGT CCGCATGGGC CCACAACACT TTTCTGTTCT CTGGCATCAG ACAGGAGAAA GTGAGTGA
|
Protein sequence | MITLPPKRGM ALVVVLVLLA VMMLVTITLS GRMQQQLGRT RSQQEYQQAL WYSASAESLA LSALSLSLKN EKRVHLAQPW ASGPRFFPLP QGQIAVTLRD AQACFNLNAL AQPTTTSRPL AVQQLIALIS RLDVPAYRAE LIAESLWEFI DEDRSVQTRL GREDSEYLAR SVPFYAANQP LADISEMRVV QGMDAGLYQK LKPLVCALPM ARQQININTL DVTQSVILEA LFDPWLSPVQ ARALLQQRPA KGWEDVDQFL AQPLLADVDE RTKKQLKTIL SVDSNYFWLR SDITVNEIEL TMNSLIVRMG PQHFSVLWHQ TGESE
|
| |