Gene Moth_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0738 
Symbol 
ID3831130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp771443 
End bp773122 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content62% 
IMG OID637828669 
Productpeptidase M1, membrane alanine aminopeptidase 
Protein accessionYP_429599 
Protein GI83589590 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00291351 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCT TAAGCCTCGC CATCCTGGCT TTCTTTTTTT ATCCCCGGCT CGTCCCGGAG 
CCGGCGGCGA TTGCGACGGA TGCTACTTTC AAACAGGTGG TCCTGGACCC ACCGCCCCTG
GAAGCAGACG GGCAGGTACT CGTTTCGGCG GGGGATTTTC TACCCCTGGC CGGCGGCAGC
GCCGACTGGT ATAATGGCCA GGGCGTCCTC AGGGGGGCGG GGAGCGCCAG AGTTGTTGCC
GGCAGCCCGG TGGCTACCCT CAACGGCGAG CCCCGGCAGC TCAAAGTACC CCCCCGCCTG
GTGGGCGATA AACTCTACAT CCCCCTCCAG CTGGCCCTGG CAGTGCTGGG GACTTCGAGT
AGCCAGGGGG GATCTGACCT GCCCCTGCCG CCCCTCCGGG GCAGCACAGG GGTCCGTTTT
CGTTACTACC CCGTCTACGA CCTCCAGGCC CGTTACGACC CGGCCAGCGG CGAGATTCAG
GGAGAACTGC TCCTGGCTTA CCAGAATCCT TTCCTCACTC CTTTGCGGCA GCTCAATTTC
AACCTCCCGG CCAATGCTCC CTTCGGTAAT GGCGCCAGCC TGGCGGTCAC CAGGGCGGTC
GTAAACAACC GGCCGGTGGC GGTACACTTC AAGGGCAGCC GGTTGGAGGT GCCCCTACCA
CGGGCCCTGG CCCCTAGGGA GACCCTTTCC GTGGTCCTTT CCTTTAAGAC TATAGTACCG
CCGGGGGATA TGCGCCTGGG CCGGGATGGG AACCTGAGCA CGGTATCCGG CTGGTACCCT
ATCCTGGCTC CCCCCACGGA GGATACCTGG GCCGGGGTGG CCGGTACGGC CTATGGCGAC
CCTTACTTCG CCGGCGCAGC CTATTACCTG GTGCGGCTCA CCCTGCCATC CGGCTATCAG
GTCCTGGCCA GTTCCCGGCT GACGGGACGC CAGGAAAGGG GAGAATGGAC AGACTGGTCT
TTTAATAGCG ACCAGCCGGT ACGGGAGTTT GCCTTTACAG CGGCCCCGGA CTGGCAATTC
ACCACCCGCC AGGCGGGCAG GGTGCAACTG GTAGTTGCGT CCCGGGGGGA GATGGCGCCG
GCGGTCCTGG ATGTCGCCGC CCGCGCTCTG GAGTTTTTCC AGAAGCTCTA TGGACCTTAT
CCTTACAGCT ACCTGCATAT AGCCTTTGTC CCCCTGGACA ACCTGGCCGG CATGGAATAC
CCTGGCCTGC TGCTCCTGAG CAACCGCAAA CCATATAACC CTGCCGTAGT CGTTCACGAG
GTCGCCCACC AGTGGTGGTA CAATCTGGTG GGTAACGACA CCCGGCAGGC GGCCTGGATT
GACGAGGGCT TGGCCGAGTA CAGCACCCTC CTGTTTTACC GGCACTTCGA TCCCGGCCTT
TATCAGGCCA AACTGGCCGA GATTACCCAA CTCGCTGCCC GCACCGGTGC GCCCATCAAC
CTCCCCCTGG AGGAATACGG TAGCGAACAG GACTACCGCC AGGCTGTCTA CAACCGGGGG
GCGATGTTCT GGCTGGAACT GGAAAGGATG GCCGGTGAAG AACAATTAAA GGAGGCCCTG
GCCTATGTCC AGCGCTATTA CCGTTACGAG ATTATCCCGC CCAGGGCCCT GCTGACAATT
ATTACGTATT ATGGCAAGCT AGATTCCAAC AATTTCAGTC CGTTTTTACG GAACAAATAA
 
Protein sequence
MAVLSLAILA FFFYPRLVPE PAAIATDATF KQVVLDPPPL EADGQVLVSA GDFLPLAGGS 
ADWYNGQGVL RGAGSARVVA GSPVATLNGE PRQLKVPPRL VGDKLYIPLQ LALAVLGTSS
SQGGSDLPLP PLRGSTGVRF RYYPVYDLQA RYDPASGEIQ GELLLAYQNP FLTPLRQLNF
NLPANAPFGN GASLAVTRAV VNNRPVAVHF KGSRLEVPLP RALAPRETLS VVLSFKTIVP
PGDMRLGRDG NLSTVSGWYP ILAPPTEDTW AGVAGTAYGD PYFAGAAYYL VRLTLPSGYQ
VLASSRLTGR QERGEWTDWS FNSDQPVREF AFTAAPDWQF TTRQAGRVQL VVASRGEMAP
AVLDVAARAL EFFQKLYGPY PYSYLHIAFV PLDNLAGMEY PGLLLLSNRK PYNPAVVVHE
VAHQWWYNLV GNDTRQAAWI DEGLAEYSTL LFYRHFDPGL YQAKLAEITQ LAARTGAPIN
LPLEEYGSEQ DYRQAVYNRG AMFWLELERM AGEEQLKEAL AYVQRYYRYE IIPPRALLTI
ITYYGKLDSN NFSPFLRNK