Gene EcHS_A3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3131 
SymbolgspK 
ID5593643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3141222 
End bp3142199 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content56% 
IMG OID640922250 
Productgeneral secretion pathway protein K 
Protein accessionYP_001459749 
Protein GI157162431 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3156] Type II secretory pathway, component PulK 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones88 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACCT CACCACCAAA ACGCGGAATG GCACTGGTCG TGGTGCTGGT ATTGCTGGCA 
GTTATGATGC TGGTAACCAT CACGCTTTCC GGGCGGATGC AGCAGCAACT TGGGCGAACG
CGCAGCCAGC AGGAGTACCA GCAGGCGCTG TGGTACAGCG CCAGTGCAGA AAGCCTGGCG
CTGAGCGCGC TCAGTCTGAG CCTGAAAAAT GAAAAGCGCG TGCATCTGGA ACAGCCGTGG
GCTTCCGGCC CTCGTTTTTT CCCACTGCCG CAGGGGCAAA TCGCCGTCAC TCTGCGTGAC
GCACAGGCCT GCTTTAACCT GAATGCCCTC GCTCAGCCCA CAACGGCGTC GCGTCCGCTC
GCGGTACAAC AACTGATTGC CCTGATCACG CGCCTGGATG TGCCTGCTTA TCGGGCCGAA
CTGATAGCCG AAAGCCTGTG GGAGTTTATT GATGAAGACC GCAGCGTGCA GACGCGTCTG
GGCCGTGAAG ACAGCGAATA TCTCGCCCGC TCGGTGCCTT TCTACGCCGC CAATCAACCG
CTGGCCGATA TCAGCGAGAT GCGCGTGGTG CAGGGAATGG ACGCCGGGCT TTATCAAAAA
CTGAAACCGC TGGTCTGTGC GCTGCCGATG ACCCGCCAGC AAATCAACAT CAATACTTTA
GACGTCACGC AAAGTGTGAT TCTTGAGGCG CTGTTTGACC CGTGGTTAAG CCCTGTTCAG
GCGCGGGCGT TATTACAACA ACGTCCGGCG AAGGGCTGGG AAGATGTCGA TCAGTTTCTT
GCACAGCCGC TACTTGCTGA CGTCGATGAG CGTACTAAAA AACAGCTAAA AACCGTCCTG
AGCGTGGACA GCAATTACTT CTGGCTGCGT TCAGATATCA CCGTGAATGA GATTGAACTG
ACGATGAACT CGTTAATTGT CCGCATGGGC CCACAACACT TTTCGGTTCT CTGGCATCAG
ACAGGAGAAA GTGAGTGA
 
Protein sequence
MITSPPKRGM ALVVVLVLLA VMMLVTITLS GRMQQQLGRT RSQQEYQQAL WYSASAESLA 
LSALSLSLKN EKRVHLEQPW ASGPRFFPLP QGQIAVTLRD AQACFNLNAL AQPTTASRPL
AVQQLIALIT RLDVPAYRAE LIAESLWEFI DEDRSVQTRL GREDSEYLAR SVPFYAANQP
LADISEMRVV QGMDAGLYQK LKPLVCALPM TRQQININTL DVTQSVILEA LFDPWLSPVQ
ARALLQQRPA KGWEDVDQFL AQPLLADVDE RTKKQLKTVL SVDSNYFWLR SDITVNEIEL
TMNSLIVRMG PQHFSVLWHQ TGESE