Gene Moth_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1919 
Symbol 
ID3830843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1990752 
End bp1991765 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content56% 
IMG OID637829852 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_430762 
Protein GI83590753 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00690514 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA TCCAAACCAA GAAAAAAGGA CGAATACTGA CCGGGGACCG GCCGACAGGG 
AAGTTGCACC TCGGACATTA TGTAGGCAGC CTCATCAACC GGGTGCGGTT ACAGGATGAG
TATGATACCT TTCTGATTAT CGCCGATGTG CAAGCATTGA CTACCAATTT CGAGGAACCC
GAAAAGCTGG CCCGTGATGT CCGGGAAGTC GCCCTGGACT ACCTGGCGGC AGGGATCGAC
CCGGAGAAAA GCACCATTTT CGTCCAGTCT CTGGTACCGG AAATTGCCGA ACTAACTATC
TTTTACTCCA TGATTATCAC CGTCAATACC CTGCGCCATA ACCCGACCAT CAAGTCAGAA
GCCGCCCAAA GGGGCTATAC CGACATGACC TACGGCTTCC TGGGTTATCC GGTAAGCCAG
GCGGCGGATA TTACTTTCTG CAAGGCCAAC CTGGTGCCCG TAGGTGAGGA CCAGTTGCCC
CACATTGAGT TGACCCGGAA GCTTGTCCGT CGCTTTAACA GCCTCTACGG CCCGGTCCTG
GTAGAGCCCG AGGCCCTGGT AGGGGAAGTG CCGCGTCTGG TTGGCCTGGA TGGAGCGGCC
AAGATGAGCA AATCCCTGGA TAATGCCATC AACCTATCCG ACCCGCCGGA AGAGGTCGAA
CGCCGGGTCA AAAATGCAGT AACTGACCCG GCCCGCATCC GGGCTACCGA TCCCGGCCAC
CCCGATATCT GTACCGTTTT TGCTTATCAT ACCGCTTTCA ATAAGCCGGT GATTCCGGAG
ATCGAAGAAT CTTGTAAAAA AGGCGCCATC GGCTGCGTGG CCTGTAAAAA GCGGTTAACA
GCCACCCTCA ACGAACTGCT GGAGCCCATG CGGGAACGAA GGGCCAGGTA CGAAGCCAAC
CCTAAATTGG TTGATGAAAT CCTCCTGGCC GGGACGGCGC GCGCCCGGGC GGTGGCGAAA
GAAACTATGG CCCAGGTCCG GGAAGCCATG AAGATTAATT ATTTTCCCGG TTAG
 
Protein sequence
MAEIQTKKKG RILTGDRPTG KLHLGHYVGS LINRVRLQDE YDTFLIIADV QALTTNFEEP 
EKLARDVREV ALDYLAAGID PEKSTIFVQS LVPEIAELTI FYSMIITVNT LRHNPTIKSE
AAQRGYTDMT YGFLGYPVSQ AADITFCKAN LVPVGEDQLP HIELTRKLVR RFNSLYGPVL
VEPEALVGEV PRLVGLDGAA KMSKSLDNAI NLSDPPEEVE RRVKNAVTDP ARIRATDPGH
PDICTVFAYH TAFNKPVIPE IEESCKKGAI GCVACKKRLT ATLNELLEPM RERRARYEAN
PKLVDEILLA GTARARAVAK ETMAQVREAM KINYFPG