Gene Saro_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0688 
SymbolpheS 
ID3918113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp725769 
End bp726869 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID640443419 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_495969 
Protein GI87198712 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.406637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC TCGATCAACA GCAGGCCGAA ACGCTTTCGG CCATCGCGCA AGCGGCCACG 
CCCGAAGCAG TGGAGGCGAT CCGCGTCTCG GCGCTCGGCA AGCAGGGGTG GGTCAGCGCC
CTGCTCAAGT CGCTGGGCGG AATGACCCCG GAACAGCGCC AGAGCGAAGG CCCGAAAATC
CACGCCGCGC GTGAGGCCGT GACAACAGCC TTGGCCGAAC GCAAGTCTGC GCTCGAAGGG
GCAGCGCTCG AAGCGCGCCT CGCTGCCGAA ACGGTGGACC TCTCGCTCCC CGCGCCCGAC
CTGGCGAAAG GGTCGGTCCA CCCCGTGTCG CAGGTCATGG ACGAGCTGGC CGAGATCTTC
GCGGACATGG GCTTCGCCGT GGCCAGCGGG CCGGAGATCG AGGACGACTG GCACAACTTC
ACCGCGCTCA ACATGCCGGA AACCCACCCG GCGCGGGCGA TGCACGATAC CTTCTACTTC
CCCGACAAGG ACGCGGAAGG CCGCTCGATG CTGTTGCGCA CGCACACCTC GCCGGTGCAG
ATCCGCTCGA TGCTGAAAGC GGGCGCGCCC TTGCGCATCA TTGCGCCCGG CCGTGTCTAC
CGCTCGGATT CCGACGCGAC GCACACGCCG ATGTTCCACC AGATCGAAGG TCTCGTCATC
GACAAGGGCA TTCACCTCGG TCACCTCAAG TGGACGCTGG AGACCTTCCT CAAGGCCTTC
TTCGAACGTG ACGACATCGT CCTGCGCCTG CGACCCAGCT ACTTCCCCTT CACCGAACCA
TCGGTCGAAG TCGACGTCGG CTACACACTG GTGAACGGAA AGCGAGTCGT CGGCGGCAGT
GGCGATGCGG ACAACGGCGG CTGGATGGAA GTTCTGGGCT CCGGCATGGT CAACCGCAAG
GTCATCGAAT TCGGCGGCCT CGATCCCGAT GAATGGCAGG GCTTCGCTTT CGGCACCGGT
GTCGATCGCC TTGCCATGCT CAAGTACGGC ATGGACGACC TTCGCGCCTT CTTCGATGGC
GATGCGCGGT GGCTCGGCCA CTACGGTTTC GGCGCACTCG ACGTGCCGAC CCTTTCCGGC
GGTGTTGGAG TACGCTCGTG A
 
Protein sequence
MEQLDQQQAE TLSAIAQAAT PEAVEAIRVS ALGKQGWVSA LLKSLGGMTP EQRQSEGPKI 
HAAREAVTTA LAERKSALEG AALEARLAAE TVDLSLPAPD LAKGSVHPVS QVMDELAEIF
ADMGFAVASG PEIEDDWHNF TALNMPETHP ARAMHDTFYF PDKDAEGRSM LLRTHTSPVQ
IRSMLKAGAP LRIIAPGRVY RSDSDATHTP MFHQIEGLVI DKGIHLGHLK WTLETFLKAF
FERDDIVLRL RPSYFPFTEP SVEVDVGYTL VNGKRVVGGS GDADNGGWME VLGSGMVNRK
VIEFGGLDPD EWQGFAFGTG VDRLAMLKYG MDDLRAFFDG DARWLGHYGF GALDVPTLSG
GVGVRS