Gene Moth_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1363 
Symbol 
ID3832285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1406008 
End bp1407159 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID637829299 
Productpolynucleotide adenylyltransferase region 
Protein accessionYP_430219 
Protein GI83590210 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCCT ACCGGAAGCT TTTTAACCAG CACCAGCTAG CCATCCTGGC CCGGGCCAGG 
AAAATAGCCC ACCTCAAGGG CCAGCGAGTT TACCTGGTAG GGGGTACGAC CCGGGATCTC
TTGCTGGGAC GGCCGGATGT CGACCTGGAT CTGGTGGTCG AGGGGAACGG CCTCCTCCTG
GCCCGGGAAC TGGCCCGGCA ACTGGGAGGG CGCCTGGTCT ACCACGAGCG CTTCCTGACG
GCTACCATTT ACTGGGCCGG GGAGCGGGTG GATGTGGTCA CCGCCAGGAA GGAGTATTAT
CCCGAGCCGG GAGCGCTGCC GGAGGTCGAG GCCGCCACCC TGGCCGAGGA TCTGGCCCGG
CGCGATTTTA GCATTAACGC CATGGCCCTG CCATTAACGG CCCGGGACCT GTGGGAGATC
CGGGATCCCT TTAACGGCCG GGCCGACCTG GCTGCCGGTA AAATAAGGGT CCTCCACCAG
GCGAGTTTCC GCGACGATCC CACCCGTATC CTGCGGGGGG TACGCCTGGG AACCAGGTTA
CATTTCCAGC TGGCCAGTGA GACCAGGGAA CTAGCCCGGG AAGCCCTGGC TGCCGGCTAC
CTGGAACTGG TATCACCCCG GCGTTTCTGG CAGGAGTTTA TCCTGCTCCT CAAGGAATCG
CGGCCGGTTG CTGCCTGGGA AGCCCTGCTG GCCCTGGGAT GGCGGGGCCT GCCGGGGAGC
GGTCCCCCGG ATGTAGCTGC CGGTTACCGG GCGGAGAAAC TCCTGCGCCA GTGGCAGTGG
CTGGGCCAGC ACCCCCCTGA CGCCAGCCTG GTGTATTTTT TAACGGCCAC GGCTAATTTG
AGCCTGGCAG CCCTGGAGGA GGTTTTGGAT CACCTGAACT GGGGCCGCAG GCGTCGTCGG
GTAATTAACG CTGCCAGGGT TTTACAGCAA GGGGGCTGTT TTGGACCTGA ACCGGGTTTG
CCCCCGGGAG AAATGATGAC CGGGTCGCGC CCGGCCCCGG AGGCGCTGAT CTTTGCCCTG
GCCCAGGCCG GGTGGAGCCA GTTGCCGGGC TGGGCCCTAC CTGGTGGGAA AGGAGTCGGC
CATGCACCTG CAGAGCTTGT CCCAGTTAAT CCTGGGTATT CCAGCCATCC TGATGGCCCT
GACCTTTCAT GA
 
Protein sequence
MIPYRKLFNQ HQLAILARAR KIAHLKGQRV YLVGGTTRDL LLGRPDVDLD LVVEGNGLLL 
ARELARQLGG RLVYHERFLT ATIYWAGERV DVVTARKEYY PEPGALPEVE AATLAEDLAR
RDFSINAMAL PLTARDLWEI RDPFNGRADL AAGKIRVLHQ ASFRDDPTRI LRGVRLGTRL
HFQLASETRE LAREALAAGY LELVSPRRFW QEFILLLKES RPVAAWEALL ALGWRGLPGS
GPPDVAAGYR AEKLLRQWQW LGQHPPDASL VYFLTATANL SLAALEEVLD HLNWGRRRRR
VINAARVLQQ GGCFGPEPGL PPGEMMTGSR PAPEALIFAL AQAGWSQLPG WALPGGKGVG
HAPAELVPVN PGYSSHPDGP DLS