Gene Sare_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4844 
Symbol 
ID5707623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5500178 
End bp5501806 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content74% 
IMG OID641274240 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001539585 
Protein GI159040332 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.487304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0043079 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGAGC TACTGCGTGG CATCGGCGTC AGCCCGGGGA GCGCAGCCGG CCCGGCGTAC 
CGGATGAGCC CACCACCACC ACCGCCGCCC GAGCCGGCCG CAGTGGTTGA TCCGGACGCC
GAGGTCGACC GGGCGGTGGC CGCGCTGAGT ACCGTGGCCG CGGACCTGAC CCGCCGTGCC
GAGGGTGCGG CAGCCCGGGC GGCGGCCGAT GTTCTGCGGG CACAGGCGAT GATGGCGCAG
GATCCGGAGC TGTCGGCCGC TGTGGTCGCG CAGGTGCGGG CCGGGGCCAG CGCACCGGTC
GCCGTCGACC GGGCACTGGC CGTGCACCGG GAGGCGTTCC TGGCCGCAGG GGGCTATCTC
GCCGAGCGCG TCACCGATCT GGACGACATT CGGGACCGGG TCGTCGCCGC CTGCCTGGGG
CTGCCGCCAC CCGGTATCCC CGATCCGGGC CACCCGTTTG TGCTGATCGC CCGTGACCTC
GCGCCGGCGG ACACCGCCGG CCTGGATCCG GAGCAGGTGC TGGCGCTGGT CACCGAGGAC
GGTGGGCCGA CCAGCCACAC CGCGATTCTG GCCCGAGCCG CTGGCCTACC GGCTGTGGTC
CGGTGCCCCG GTGCGATGGC CGTTGCCGAC GGGGTCGAGG TCACCGTCGA CGGCTCGACC
GGGCAGGTCG CGGTGGGTGT GGATCACGAC ACCGTCATCG CCACCCGGGT CTCCGAGCAG
CGGCGTCGGC GACGACTCGC CACGACGCGG GGGCCCGGCC GCACCGCCGA CGGCCACCCC
GTGGCCCTGT ACGGCAACAT CGGCTCGGCT GAGGATGTGG ACGGTGAGCT GGAAGGCGTC
GGCCTGTTCC GGACCGAACT GCTCTACCTG CATCGCACCG ATCCGCCCGG ACGCGACGAG
CAGGTAGCCG CCTACGCCGA GGTCTTCGCC GCGCTCCCCG GGCGCAGAGT CATCGTGCGG
ACCCTCGACG CCGGTGCCGA CAAGCCCCTG CCCTTCCTCG CGGCCGGTGA GGAACCGAAC
CCGGCGCTGG GCGTACGCGG CCTGCGGTTG GCCCGGCGGC GGGCGGATGT GCTCCATCTC
CAGCTCGAGG CGATCGCACA GGCGGCCCGG GACACGGCAG CCGAGGTGTG GGTGATGGCG
CCGATGGTGG CGACGGTCGC GGAGGCCGCC TGGTTCGCCG CCGCCTGCCG GGACGCCGGC
CTTCCCACAG CGGGAGCGAT GGTCGAGGTG CCGGCGGCAG CCCTGCGGGC CCGCTCGTTG
CTGTCGGTGG TGGATTTCCT CAGCATCGGC ACCAACGACC TGAGCCAGTA CACCTTCGCC
GCCGACCGGC AGTGTGGCGA CCTGGCCGAC CTGCTCGACC CGGCACAGCC CGCGTTGCTC
GAACTCATCT CCGGCTGTGC CGCCGCCGGC ATGGCCGCCG GCAAACCGGT CGGTGTCTGT
GGCGAGGCCG CGGCCGACCC GAGGATCGCG CCGGTGCTCG TCGGCCTTGG CGTGACCAGC
CTGTCCATGG CCCCACGGGC AGTGCCCGAC GTGCGGGAGG CGCTTGCCGC CCACACGCTC
GCCGACTGTC GGCAACTCGC CGCCGAGGCG TTGTCCGCGG CCGGCACCGC ACCGCTCACC
GTCCCCTGA
 
Protein sequence
MAELLRGIGV SPGSAAGPAY RMSPPPPPPP EPAAVVDPDA EVDRAVAALS TVAADLTRRA 
EGAAARAAAD VLRAQAMMAQ DPELSAAVVA QVRAGASAPV AVDRALAVHR EAFLAAGGYL
AERVTDLDDI RDRVVAACLG LPPPGIPDPG HPFVLIARDL APADTAGLDP EQVLALVTED
GGPTSHTAIL ARAAGLPAVV RCPGAMAVAD GVEVTVDGST GQVAVGVDHD TVIATRVSEQ
RRRRRLATTR GPGRTADGHP VALYGNIGSA EDVDGELEGV GLFRTELLYL HRTDPPGRDE
QVAAYAEVFA ALPGRRVIVR TLDAGADKPL PFLAAGEEPN PALGVRGLRL ARRRADVLHL
QLEAIAQAAR DTAAEVWVMA PMVATVAEAA WFAAACRDAG LPTAGAMVEV PAAALRARSL
LSVVDFLSIG TNDLSQYTFA ADRQCGDLAD LLDPAQPALL ELISGCAAAG MAAGKPVGVC
GEAAADPRIA PVLVGLGVTS LSMAPRAVPD VREALAAHTL ADCRQLAAEA LSAAGTAPLT
VP