Gene Sala_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1493 
Symbol 
ID4081170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1555610 
End bp1556950 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID638009859 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_616539 
Protein GI103486978 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.42537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AAACCGCCAC ACCGCGCAGC TTTTCTGCCT CCGTTCCGTT GAAGGGCAGG 
ATCGCCATTC CGGGCGACAA GAGCATCTCG CACCGATCGC TGATGCTGTC GGCGCTGGCC
GTCGGCGAAA GCCGCGTCGC CGGCCTGCTC GAAGGGCATG ATGTCCTCGC GACCGCCGCC
GCGATGCGCG CCATGGGGGC CGATATCGCG CGCCGCGACG ACGGCGAATG GCGCATCCAC
GGCGTCGGCG TCGGCGGCTT GCTCCAGCCG CGGGGCGCGC TCGACATGGG GAACAGCGGC
ACGTCGACAC GTCTGCTGAT GGGGCTCGTC GCGAGCCACC CGATCACCGC CACTTTCGTC
GGCGACGCCA GCCTGTCGGG TCGCCCGATG GGGCGCGTCA TCGATCCGCT GACCCAGATG
GGCGCCGACA TCAGCGCCTC GCCGGGCGCC AGGGGGGCAA AAACTCTGCC GCTGATGGTC
CGCGGCCTCG CGCCCGCCAT TCCCCTCTCC TACCGCCTGC CGATGGCGTC GGCGCAGGTG
AAGAGCGCGA TCCTTCTCGC CGGACTCAAT ACGCCCGGCG TCACCGAAGT CATCGAGCCG
GTGCCCACGC GCGACCACAG CGAGCGGATG CTCGGCGCCT TTGGCGCCGA TCTGACCGTC
GACATCGACG CGGGCGGCAC GCGCCATATC CGTATCCGCG GCGAAGCCGA TCTCAAGCCG
CAGGCGATCA TCGTCCCCGG TGATCCCTCC TCGGCCGCCT TCTTTATCGT TGCGGCGCTC
ATCGTGCCCG GTTCGGACGT CACCATCGCC AACGTCGGTC TCAATCCGAC GCGCGCCGGG
CTGGTCGAGG TTCTGAAGGC GATGGGCGGC GACATCGAAC TGCTCGACCG GCGCGAAATC
GGCGGCGAAC CCGTCGCCGA CCTGCGCGTG CGCCACAGCG TGCTCAAAGG CATCGAGGTC
GACCCGGCGG TTGCGCCGAG CATGATCGAT GAGTTTCCAG TCCTCTTCGT TGCCGCGACG
CTCGCCGAAG GCCGCACGGT GACCACGGGG CTCGATGAAC TGCGCGTCAA GGAAAGCGAC
CGCCTTGCCG TCATGGCGAC CGGGCTCAAG GCCATCGGCG CGCGTGTCGA GGAAAGCCAA
GACGGCCTTG TCATTGATGG CACCGGCGGC GATCCGCTAG CCGGCGGCGC GACCATCGCC
GGCCATCTCG ATCATCGCAT CTGCATGAGC TTCGCAATCG CGGGGCTTGT CAGCAAGGCG
CCGGTGACGG TCGACGACAT CGCCCCCGTC GCAACGAGCT TCCCCAATTT CGAGGCATTG
CTTGCGGGTT TGCAACAATG A
 
Protein sequence
MTDQTATPRS FSASVPLKGR IAIPGDKSIS HRSLMLSALA VGESRVAGLL EGHDVLATAA 
AMRAMGADIA RRDDGEWRIH GVGVGGLLQP RGALDMGNSG TSTRLLMGLV ASHPITATFV
GDASLSGRPM GRVIDPLTQM GADISASPGA RGAKTLPLMV RGLAPAIPLS YRLPMASAQV
KSAILLAGLN TPGVTEVIEP VPTRDHSERM LGAFGADLTV DIDAGGTRHI RIRGEADLKP
QAIIVPGDPS SAAFFIVAAL IVPGSDVTIA NVGLNPTRAG LVEVLKAMGG DIELLDRREI
GGEPVADLRV RHSVLKGIEV DPAVAPSMID EFPVLFVAAT LAEGRTVTTG LDELRVKESD
RLAVMATGLK AIGARVEESQ DGLVIDGTGG DPLAGGATIA GHLDHRICMS FAIAGLVSKA
PVTVDDIAPV ATSFPNFEAL LAGLQQ