Gene EcSMS35_3240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3240 
SymbolgspK 
ID6142677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3312415 
End bp3313392 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content56% 
IMG OID641618070 
Productgeneral secretion pathway protein GspK 
Protein accessionYP_001745220 
Protein GI170684190 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3156] Type II secretory pathway, component PulK 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.656676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.626096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCT TGCCACCAAA ACGCGGAATG GCACTGGTCG TGGTGCTGGT ATTGCTGGCG 
GTTATGATGC TGGTGACCAT CACGCTTTCC GGGCGGATGC AGCAACAACT TGGGCGAACG
CGCAGCCAGC AGGAGTACCA GCAGGCGCTG TGGTACAGCG CCAGTGCAGA AAGCCTGGCG
CTGAGCGCGC TCAGTCTGAG CCTGAAAAAT GAAAAGCGTG TGCATCTGGC ACAACCGTGG
GCTTCTGGCC CGCGTTTTTT CCCACTGCCG CAGGGGCAAA TTGCCGTCAC TCTGCGTGAC
GCACAGGCCT GCTTTAACCT GAATGCCCTC GCTCAGCCGA CGACGACGTC GCGTCCGCTC
GCGGTACAAC AACTGATTGC CCTGATCTCG CGCCTCGATG TGCCTGCTTA TCGGGCCGAA
CTGATAGCCG AAAGCCTGTG GGAGTTTATT GACGAAGACC GCAGTGTGCA GACGCGTCTG
GGTCGTGAAG ACAGCGAGTA TCTCGCCCGC TCGGTGCCGT TCTACGCCGC TAATCAACCG
CTGGCTGATA TCAGCGAGAT GCGCGTGGTG CAGGGAATGG ACGCTGGGCT TTATCAAAAA
CTGAAACCGC TGGTCTGTGC GCTGCCGATG GCCCGCCAGC AAATCAACAT CAATACATTA
GATGTCACGC AAAGTGTGAT TCTTGAGGCG CTGTTTGACC CGTGGTTAAG CCCTGTTCAG
GCGCGGGCGT TATTACAACA ACGTCCGGCG AAGGGCTGGG AAGATGTCGA TCAGTTTCTT
GCTCAGCCGC TACTTGCAGA CGTCGATGAG CGTACTAAAA AACAGCTAAA AACCATCCTG
AGCGTGGACA GCAATTACTT CTGGCTGCGT TCAGATATCA CCGTGAATGA GATTGAACTG
ACGATGAATT CGTTAATTGT CCGCATGGGC CCACAACACT TTTCTGTTCT CTGGCATCAG
ACAGGAGAAA GTGAGTGA
 
Protein sequence
MITLPPKRGM ALVVVLVLLA VMMLVTITLS GRMQQQLGRT RSQQEYQQAL WYSASAESLA 
LSALSLSLKN EKRVHLAQPW ASGPRFFPLP QGQIAVTLRD AQACFNLNAL AQPTTTSRPL
AVQQLIALIS RLDVPAYRAE LIAESLWEFI DEDRSVQTRL GREDSEYLAR SVPFYAANQP
LADISEMRVV QGMDAGLYQK LKPLVCALPM ARQQININTL DVTQSVILEA LFDPWLSPVQ
ARALLQQRPA KGWEDVDQFL AQPLLADVDE RTKKQLKTIL SVDSNYFWLR SDITVNEIEL
TMNSLIVRMG PQHFSVLWHQ TGESE