Gene Athe_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0572 
Symbol 
ID7406913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp644828 
End bp646546 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content38% 
IMG OID643714955 
Productprolyl-tRNA synthetase 
Protein accessionYP_002572471 
Protein GI222528589 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00211854 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT CAGAGCTTTT TATGCCAACT TTGAAAGAGA CACCTTCAGA TGCAGAGATA 
GAATCACACA AGCTTATGCT AAGGTCTGGT TTTATGAGAC AGCTTTCCTC TGGCATTTAT
GTTTATCTTC CACTTGGTTA CAGAGTTTTA AGAAAGATAG AAAACATTGT AAGAGAAGAG
ATGGATAGAA GCGGTGCACA AGAGGTTCAT ATGTCTGCAC TTATGCCAAA AGAACTTTGG
GAAGAGTCGG GTAGGTGGGC TGTATTTGGC CCTGAAATGT TTAGGATAAA AGATAGAAAC
GAAAGGGAAT ACTGTTTGGG TCCCACTCAC GAAGAAGCAT TTACTTATAT AGTCAGAAAC
GAGATTTCAT CTTACAGGGA CCTTCCTAAA ATTTTGTATC AAATCCAAAC AAAGTTTCGT
GATGAGAGAA GACCGCGGTT TGGTGTTATG CGTTGCAGAG AGTTTACAAT GAAGGATGCT
TATTCTTTTG ACATTGATGA GAACGGCTTG GACATATCAT ACCAAAAGAT GTATGATGCA
TATGTGAGGA TATTCAAAAG ATGCGGGCTT GATGTAAAGA TTGTCGAGGC AGACACAGGT
GCAATGGGCG GGGCAAGCTC ACACGAGTTT ATGGTTCCAT CATCTGTTGG TGAGGCAGAG
ATTGCATATT GTAAAGCTTG CGGATATGCT GCAAACTTGG AAAAGGCAGA GTGTTTGGAT
GAGCCTGTTG AAAACAAAGA GGAGCCAAAA GAAAAACAGG AAGTCTACAC TCCAAATGTG
AGAACAATTG AGGAGCTTGT GAGTTTTCTT GGCATCGACA GCACAAGGTT TGTAAAAACA
ATGATTTACA AGGCAGATGA CAAGTTTGTT GCAGTGCTGG TAAGGGGAGA TAGAGAGGTA
AATGAGACAA AGCTAAAAAA TCTTTTAAAA GCTACAGACC TTGAACTTGC ATCGGCAGAG
GATGTAGAGA AAATTACAGG TGCAAAAGTA GGATTTGCCG GTCCAATTGG TCTTTCGATA
GATGTGTACG CAGACAATGA AGTAAAATAT CTCAAAAACT TTGTGGTGGG TGCAAATAAG
ACAGATTATC ATATAAAGAA TGTTAATCTT TCAGATTTTA AAGTAACAAA GTTTACAGAC
CTCAGGAATA TAACCCAAGA CGACCTTTGT CCAAAATGTC GCTCTCAGAA GGTGACAATT
GAAAGAGGAA TTGAGGTTGG ACATATATTT AAGCTTGGCA CTAAATATAC ACAGGCATTT
AACTGCGTGT ACACAGATGA AAAAGGCGAA AAGAAGCTCA TGATAATGGG ATGTTATGGT
ATTGGTATCA ACAGGACAGC TGCAGCTATC ATTGAACAGA TGCACGATGA AGATGGTATA
ATTTGGCCAA TTACAGTTGC ACCGTATGAG GTAATTATTG TGCCTGTAAA TGTAAAAGAT
GAAAATCAGA AAAAGATTGC ATTTGAGATT TATGAAAACC TTCAGAGAAA TGGAGTTGAG
GTTTTGATTG ATGACAGAGA TGAAAGAGCA GGTGTTAAGT TCAAGGATGC AGATTTGATA
GGAATTCCGT TTAGAGTTAC TGTTGGAAAG AAAATATCAG AGGGAAAACT TGAGATTAGA
AATAGAAGAA CAAAGGAATC TTTTGAAGTT GAAATTGAGA AAGCAATAGA GGTTGTAATT
AATCTAATCA GGGAAGAAAA AGCAAAATAC CAAATATAA
 
Protein sequence
MKVSELFMPT LKETPSDAEI ESHKLMLRSG FMRQLSSGIY VYLPLGYRVL RKIENIVREE 
MDRSGAQEVH MSALMPKELW EESGRWAVFG PEMFRIKDRN EREYCLGPTH EEAFTYIVRN
EISSYRDLPK ILYQIQTKFR DERRPRFGVM RCREFTMKDA YSFDIDENGL DISYQKMYDA
YVRIFKRCGL DVKIVEADTG AMGGASSHEF MVPSSVGEAE IAYCKACGYA ANLEKAECLD
EPVENKEEPK EKQEVYTPNV RTIEELVSFL GIDSTRFVKT MIYKADDKFV AVLVRGDREV
NETKLKNLLK ATDLELASAE DVEKITGAKV GFAGPIGLSI DVYADNEVKY LKNFVVGANK
TDYHIKNVNL SDFKVTKFTD LRNITQDDLC PKCRSQKVTI ERGIEVGHIF KLGTKYTQAF
NCVYTDEKGE KKLMIMGCYG IGINRTAAAI IEQMHDEDGI IWPITVAPYE VIIVPVNVKD
ENQKKIAFEI YENLQRNGVE VLIDDRDERA GVKFKDADLI GIPFRVTVGK KISEGKLEIR
NRRTKESFEV EIEKAIEVVI NLIREEKAKY QI