Gene Moth_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1771 
Symbol 
ID3831063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1825826 
End bp1826929 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content60% 
IMG OID637829696 
Productpeptidase M24 
Protein accessionYP_430615 
Protein GI83590606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0652129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGCAC GTTATGCCAA CCGTATTGAG AAGGCCCGGG AACTGATGAT AGAAAAAGAC 
CTGGATCTGC TCTTTGTGGT CAACCGGGAG AACCTGATTT ACTTTACCGG CCTGACCCAG
ATCGAGTGCC TGGCCGTGCT TATCCCCAGG GAGGGAGAAC CATGTGCTGT GACCCTCTGG
CTGGATGCTG ATTATGTAGA ACGGGAGTCA GGGCTCACCA CCTATGGCTA TTACTTTCCG
CGGGAGAGCC TGGCCAGCAA AGTTGTGGAA CGCATCAAGG CCTATGGTTT CAAGGTACCG
CGTATAGGTT TTGAACGCTA CTTTGTCGAT TTTGCCGTCT ACGACGGCCT GCGCCGGGCC
TTTCCGGAGG CCAGCTTTAC CGGGGCAAGC GATCTCTTTT ATCGCCTTCG CTCCATTAAA
GAACCAACGG AAGTAGAACT CCTGCGGCGG GCGGCGGCGG CCGCCTGCCG CGGCATGGAA
GCGGCCATCA AAAGCGTCCG GCCGGGGGTC ACGGAGCTGG ACATCCTGGC CGAAGCGGAA
TACGCCATGT TGAAAGCAGG CTCAGGTGGG TCTTCCTTCC GGCCTCAGGT GGTCTCTGGG
GAACGGGTCC TCCTGACCCA CCCCTGTGCG AGCAATAAAA AGATTGCGCC GGGGGAGGCG
GTGGTCATCC ACCTGGGCGC GACTTACGAG GGTTACTGTG CCAAGATGTG CCGGACCGTG
GCTGTAGGCC GGATCCCTCC GGAGCAAGAA AATATCTACT ATCTCCTGCT GGAGGCCCAG
GGCCGGGCCA TAGCCGCTTT AAGGCCCGGG GTCACGGCAG GGACGGTGGA TGCCGCCGCC
AGGCAGGTTG TAGAAGTCGC CGGCTATGGC GATAGTTACC TGGAGGTGGT GGGTTACGGC
GTGGGCCTGC GCCAGTCGGA GTTCTACCCC ATTGTCGGTA GAGGGCGGGA GGAGGTTATC
GAGGCCGGCA TGGTAGTAGA CCTGCTCCTG CCGACCATCT ACCGTCCCGG CATTGGCGGG
CCCAGGGTGA CGGATGTTAT CTATGTCGGC CGGGAAAAGA ACGAGATCCT GACGGATTAC
CCGCGGGAAC TGGTACGGGT GTAG
 
Protein sequence
MTARYANRIE KARELMIEKD LDLLFVVNRE NLIYFTGLTQ IECLAVLIPR EGEPCAVTLW 
LDADYVERES GLTTYGYYFP RESLASKVVE RIKAYGFKVP RIGFERYFVD FAVYDGLRRA
FPEASFTGAS DLFYRLRSIK EPTEVELLRR AAAAACRGME AAIKSVRPGV TELDILAEAE
YAMLKAGSGG SSFRPQVVSG ERVLLTHPCA SNKKIAPGEA VVIHLGATYE GYCAKMCRTV
AVGRIPPEQE NIYYLLLEAQ GRAIAALRPG VTAGTVDAAA RQVVEVAGYG DSYLEVVGYG
VGLRQSEFYP IVGRGREEVI EAGMVVDLLL PTIYRPGIGG PRVTDVIYVG REKNEILTDY
PRELVRV