Gene Athe_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1410 
Symbol 
ID7409153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1494646 
End bp1495950 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content40% 
IMG OID643715773 
Productglycyl-tRNA synthetase 
Protein accessionYP_002573281 
Protein GI222529399 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000029424 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACAA TGGATGAAAT AGTTGCCCTT TGCAAACGTC GTGGATTTAT ATTCCAATCA 
AGCGAGATAT ATGGTGGACT TAATAGCTGC TGGGACTATG GTCCTCTTGG TGTAGAGATG
AAAAATAACA TAAAAAGACT GTGGTGGAAA GCAAACGTCC AGCTCAGAGA CGACGTTGTG
GGGCTTGACT CGAGCATCTT GATGAACCCA AAGGTGTGGG AAGCAAGCGG ACACTTGAGC
AATTTTTCTG ACCCTATGGC TGACTGCAAG CTGTGCAAAA AAAGATGGAG AGTAGACCAG
TTACAAGAGT ATAAGTGTCC TGAATGCGGC GGTGAACTTA CCGAGGCAAG GATGTTCAAC
CTTATGTTCA AAACATTTAT GGGACCTGTG GAGGACGAGT CGGCAGTAGT ATATTTGAGA
CCTGAAACAG CACAAGGTAT TTTTGTAAAC TTTGTTAATG TCCAGCAGAC CATGAGAAAA
AAGATTCCTT TTGGAATTGC TCAGATTGGT AAGTCATTCA GAAACGAAAT CACTCCTGGT
AACTTTATTT TCAGGACAAG AGAGTTTGAA CAGATGGAAA TAGAGTATTT TGTAAAGCCA
GGGACTGATG AGTACTGGCA CAAACACTGG ATTGAGCAAA GGATAAACTG GTATTACAAT
CTGGGAATAA GAAAGGAAAA CTTAAGAGTT CGTGAGCACG GCAAGGACGA GCTTGCACAC
TATGCAAAAG CATGTGTGGA CATTGAATAT TTATTTCCGA TGGGATGGTC TGAACTTGAA
GGTATTGCAA ACAGGACAGA CTTTGACCTA ACACAGCATC AAAAATACAG TGGCGAAAAT
TTAACATACT TTGACGATGA GACAAAACAA AGGTATATAC CATATGTAAT TGAGCCATCT
GCAGGCGTGG ACAGATCTTT ACTTGCATTT TTAATTGATG CATATGAATA CCAGCAGATA
GATAAAGATG ACTTTAGAGT GGTACTTCAC CTTCATCCTG CGATTTCGCC TGTAAAAGCT
GCTGTGTTCC CGCTTATGAA AAAAGAAGAG CTTGTAAAAA AAGCAAGGGA AATTTACAAT
GAGCTTAAAT ATAAGTGGAT TGTTCAATAC GATGAAAGTG GTAGCATAGG TAAAAGATAT
AGACGACAAG ATGAGATAGG AACACCGTTT GGGATCACAG TGGATTATCA GACTTTAGAA
GATGAAACTG TTACAATAAG AGATAGGGAT ACAATGGAGC AAATAAGGGT GCATATAAAA
GAGATAATTC CTTATCTTGA AGAAAGAATT GAGGTAAAGT TTTAA
 
Protein sequence
MVTMDEIVAL CKRRGFIFQS SEIYGGLNSC WDYGPLGVEM KNNIKRLWWK ANVQLRDDVV 
GLDSSILMNP KVWEASGHLS NFSDPMADCK LCKKRWRVDQ LQEYKCPECG GELTEARMFN
LMFKTFMGPV EDESAVVYLR PETAQGIFVN FVNVQQTMRK KIPFGIAQIG KSFRNEITPG
NFIFRTREFE QMEIEYFVKP GTDEYWHKHW IEQRINWYYN LGIRKENLRV REHGKDELAH
YAKACVDIEY LFPMGWSELE GIANRTDFDL TQHQKYSGEN LTYFDDETKQ RYIPYVIEPS
AGVDRSLLAF LIDAYEYQQI DKDDFRVVLH LHPAISPVKA AVFPLMKKEE LVKKAREIYN
ELKYKWIVQY DESGSIGKRY RRQDEIGTPF GITVDYQTLE DETVTIRDRD TMEQIRVHIK
EIIPYLEERI EVKF