Gene Sala_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0659 
Symbol 
ID4082749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp669359 
End bp670435 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content68% 
IMG OID638009018 
Productchorismate synthase 
Protein accessionYP_615713 
Protein GI103486152 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.463158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.312815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA ACAGCTTCGG ACGCGTCCTG CGCTTTACGA CCTGGGGGGA AAGTCACGGG 
CCCGCGCTCG GTGCGGTCGT CGATGGCTGC CCGCCGCGGC TGTCTCTGTC CGAAGCCGAC
ATCCAGCCCT TTCTCGACAA GCGTCGCCCC GGCCAGTCGC GCCACACCAC GCAGCGGCAG
GAGCCCGACC AGGTGCGTAT CCTGTCGGGG GTGTTCGAAG GCAAGACGAC AGGCACGCCG
ATCAGCCTGA TGATCGAGAA TGTCGATCAG CGATCGAAGG ATTATGGCGA GATCGCGCAA
GCCTGGCGTC CCGGCCACGC CGATTATGCC TATGACGCCA AATATGGCAT CCGCGATTAT
CGCGGCGGCG GCCGGTCGAG CGCGCGCGAG ACTGCGGCGC GTGTCGCGGC GGGCGCTGTC
GCGCGGCTGG TGATCCCGGA GGTGCAGATC CACGCCTGGG TCGCGGAGAT CGGCGGCGAT
GCGATCGATC CGGCGAACTT CGACCTCGAA GAAATCGATC GCAATCCCTT TTTCTGTCCC
GACCCCGCGG CGGCGCAGCG CTGGGAGGCG CTGATGGATT CCGCGCGCAA GGCGGGAAGC
TCGCTGGGCG CGGTCATCGA ATGCGCCGCG AGCGGCGTCC CCGCAGGCTG GGGCGCGCCT
GTCTATGCCA AGCTCGACAG TGACCTTGCG GCCGCGATGA TGGGGATCAA CGCGGTGAAG
GGCGTCGAGA TCGGCGCGGG ATTCGGCGTG GCGCGGCTGC GCGGCGAGGA AAATGCCGAT
CCGATGCGCC CCGCCAGCGA CGGCAGCAAC CGCCCCGATT TCCTGTCGAA CAATGCCGGC
GGCATCGCGG GCGGCATTTC GACCGGGCAG CCGGTCGTCG TGCGCGTCGC CTTCAAGCCG
ACGAGTTCGA TCCTGACCCC GGTGCCGACG GTGAACAAGG CGGGCGAGGC GACCGACATC
GTGACCAGGG GCCGCCACGA CCCCTGCGTC GGCATCCGCG GCGCGCCGGT GGTCGAAGCG
ATGATGGCGC TGACACTCGC CGACCACAAG CTGCTCCACC GCGCGCAGTG CGGATGA
 
Protein sequence
MSFNSFGRVL RFTTWGESHG PALGAVVDGC PPRLSLSEAD IQPFLDKRRP GQSRHTTQRQ 
EPDQVRILSG VFEGKTTGTP ISLMIENVDQ RSKDYGEIAQ AWRPGHADYA YDAKYGIRDY
RGGGRSSARE TAARVAAGAV ARLVIPEVQI HAWVAEIGGD AIDPANFDLE EIDRNPFFCP
DPAAAQRWEA LMDSARKAGS SLGAVIECAA SGVPAGWGAP VYAKLDSDLA AAMMGINAVK
GVEIGAGFGV ARLRGEENAD PMRPASDGSN RPDFLSNNAG GIAGGISTGQ PVVVRVAFKP
TSSILTPVPT VNKAGEATDI VTRGRHDPCV GIRGAPVVEA MMALTLADHK LLHRAQCG