Gene Clim_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0152 
SymbolpheS 
ID6356122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp169656 
End bp170681 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content52% 
IMG OID642667779 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_001942230 
Protein GI189345701 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00308868 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAG CCATTCGCAG TCTGCAGCAG GAAATATCTG ACTTCGGGAT CCAAAGCAAT 
AAGGATCTCG AAGCGTTCAG ACTTAAATAT ACTGTTCGCA AAGGCCTCAT TGCCGATCTT
TTCGGACAGC TTAAAACGGT TGCTCCGGAT GAACGGCCAC GAATAGGCCA ACTGCTCAAT
ACGCTCAAAA AAAATGCCGA CGAAAAGCAG ACGGCAGCAG AAGCTGTCTT CTCGGCACAA
GCCGCCCGAA AAGCTCCCGC TCTTGATCTT ACCCTGCCGG GAAGACGGCA TTACACCGGC
AGCGAACATC CAGTGCAGAA GGTACTGGGC GACATGAAGC AGATCTTTCA CGCAATGGGC
TTCAGCATTG CAACCGGACC GGAACTTGAG CTCGACCGGT ATAACTTCGA CCTGCTGAAC
TTTCCGCCTG ACCATCCCGC TCGTGATATG CAGGATACCT TTTTTATCAC AAGGGGCAAC
CCTTCCGGCG ATGTGCTGCT GAGAACCCAC ACCTCGCCTG TACAGGTAAG GGTCATGCTC
GACAACCCTC CGCCCATACG CGTCATCTGC CCCGGTAAAG TCTATCGAAA CGAAGCCATC
AGCTCCCGGA GCTATTGCGT CTTCCATCAG CTTGAAGGGC TCTATATCGA TAAAAATGTC
TCTTTTGCCG ATCTGAAAGC CACGATCTTT TCATTTGCCC GACAGATGTT CGGCAAAGAT
GTTAAACTCC GTTTCAGACC GAGCTTTTTC CCCTTTACCG AACCCTCTGC CGAGGTCGAT
GTAACCTGCT ACCTCTGTGG GGGAAAAGGG TGCCGCGTCT GCAAGAAATC GGGATGGCTG
GAAATAATGG GTTGCGGCAT GGTACATCCG AACGTCATGC GCGACTGCGG TATCGATCCT
GAAGTCTGGT CCGGTTACGC TTTCGGCATG GGTGTTGACC GGACGGTACT GCTCCGTTAT
AAAATAGACG ATATTCGCCT TCTTTTCGAA AACGATATCC GCATGCTTCG CCAGTTCCCG
GCCTGA
 
Protein sequence
MEEAIRSLQQ EISDFGIQSN KDLEAFRLKY TVRKGLIADL FGQLKTVAPD ERPRIGQLLN 
TLKKNADEKQ TAAEAVFSAQ AARKAPALDL TLPGRRHYTG SEHPVQKVLG DMKQIFHAMG
FSIATGPELE LDRYNFDLLN FPPDHPARDM QDTFFITRGN PSGDVLLRTH TSPVQVRVML
DNPPPIRVIC PGKVYRNEAI SSRSYCVFHQ LEGLYIDKNV SFADLKATIF SFARQMFGKD
VKLRFRPSFF PFTEPSAEVD VTCYLCGGKG CRVCKKSGWL EIMGCGMVHP NVMRDCGIDP
EVWSGYAFGM GVDRTVLLRY KIDDIRLLFE NDIRMLRQFP A