Gene Anae109_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1094 
Symbol 
ID5376389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1237443 
End bp1238693 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content70% 
IMG OID640842602 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_001378286 
Protein GI153003961 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.63976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.992757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCGA CCGCGAAGCT GAGGAGCGCG ACCCCGGAGG AGCAGTTCCA GGAGGTCACC 
CGCGCCGCGG TGGACCTGCA CGTCGAGAAG GATCTCCGCG CGCGGCTGCA GAAGGCCTAC
GACACGGGCG TGCCGCTGCG CGTGAAGGCG GGCTTCGATC CGACCGCCCC GGACCTGCAC
CTCGGCCACA CGGTGCTCCT CTCCCGGATG CGCCGCTTCC AGCAGTTCGG GCACACCGTC
ATCTTCCTCA TCGGCGACTT CACGGGGCTC ATCGGCGATC CCACCGGCCG CAACTCCACG
CGGCCGCCGC TCACGCCCGA GCAGATCGCG CAGAACGCGG AGACCTACAA GAAGCAGGTG
TTCAAGATCC TCGACCCGGC CGCGACGGAG GTCCGCTACA ACTCCGAGTG GCTCGGCGCG
ATGGGCTTCG CCGACGTCAT CCGGCTCGCC TCGCGCTACA CCGTCGCGCG CATGCTCGAG
CGCGACGACT TCAAGAAGCG CTACACCGGC AACGTCCCCA TCAGCGTCCA CGAGTTCCTC
TACCCGCTCG CGCAGGCGTA CGACTCGGTG GCGCTGAAGG CCGACGTGGA GCTCGGCTCC
TCCGACCAGC TCTTCAACCT GCTCGTCGGC CGCGCCATCA TGCCGGACTA CCAGCTCACC
CCGCAGATCG TGCTCACCGG CCCCATCCTC GAGGGGCTGG ACGCCAAGCT CGATCCGGAG
TCCCGGCGGA TCGTCGGCAA CAAGATGTCG AAGTCGCTCG GCAACTACGT GGGGGTCGCC
GAGGCGCCGG AGGAGCAGTT CGGAAAGCTC ATGAGCGTCA GCGACGACCT CATGTGGCGC
TACTACGAGC TCCTCTCCGA CCGCACCAGC GCGGAGATCG CGGCGCTGCG CGCCGGCCAC
CCGAAGGACG CGAAGATCGC GCTGGCCAAG GAGATCGTCA CCCGCTTCCA CGGCGCCGAC
GCGGCCACGC GCGCGGAGGA GCACTTCGCC CAGGTGCACG CCCGCCGCGA GGTGCCCGAC
GAGGTGGAGG AGCGCGCCGT CGCGCTCGAC GGCCAGGCCG CCCTGCCTCT CGCGCGGCTC
CTCGCCGACG CCAAGGTGGT CGCCTCGGGC AGCGAGGCGC GCCGGCTCAT CGCCCAGGGC
GGCGTCTCGG TGAACGGCGA GCGCGTCTCG GACGAGAAGG CGACGCTGGG CGCGGGCGAG
TGGCTCGTGA AGGTGGGCAA GCGCCGCTTC GTGCGGCTCA AGCTGGCCTA G
 
Protein sequence
MDSTAKLRSA TPEEQFQEVT RAAVDLHVEK DLRARLQKAY DTGVPLRVKA GFDPTAPDLH 
LGHTVLLSRM RRFQQFGHTV IFLIGDFTGL IGDPTGRNST RPPLTPEQIA QNAETYKKQV
FKILDPAATE VRYNSEWLGA MGFADVIRLA SRYTVARMLE RDDFKKRYTG NVPISVHEFL
YPLAQAYDSV ALKADVELGS SDQLFNLLVG RAIMPDYQLT PQIVLTGPIL EGLDAKLDPE
SRRIVGNKMS KSLGNYVGVA EAPEEQFGKL MSVSDDLMWR YYELLSDRTS AEIAALRAGH
PKDAKIALAK EIVTRFHGAD AATRAEEHFA QVHARREVPD EVEERAVALD GQAALPLARL
LADAKVVASG SEARRLIAQG GVSVNGERVS DEKATLGAGE WLVKVGKRRF VRLKLA