Gene Moth_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1273 
SymbolaspA 
ID3832913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1316165 
End bp1317595 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content60% 
IMG OID637829209 
Productaspartate ammonia-lyase 
Protein accessionYP_430130 
Protein GI83590121 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000429353 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTACCC GCCAGGAACA CGACCTGCTG GGCACCAGGG AAGTACCAGC TACTGCTTAT 
TATGGTATCC ATACCCTGCG GGCCGCAGAA AACTTCAACG TCAGCCGGGC CAGGGTCCAT
CCGGAATTGA TTAAAGCCCT GGCTACTGTA AAAGAAGCCG CGGCCAGGGC TAACCTGGAC
CTGGGTTACC TGCCGGCCGA AAAAGGCCGG GCCATCATCA CCGCCTGCCA GGAAGTGGCC
CGGGGTGAAC TGGCCGACCA GTTTTTCCTC GACGCCTACC AGGGCGGCGC CGGTACCTCA
ACCAATATGA ACGTCAACGA GGTAATCGCC AACCGCGCCC TGGAAATTCT GGGTCGCCCC
AAAGGCGATT ACGCTACCAT CCATCCCATC GATCACGTTA ACCTGCATCA GTCCACTAAC
GATGTCTACC CCACGGCCAT GCGGGTGGCG GCCATCCGCC TGTTGCTGCC CCTGGCGGAT
GAACTGGCGA AACTCCAGGA AGCCCTCCAG GAGAAAGAGG CCGCCTTCGC CGGGGTGGTC
AAAATCGGTC GTACCGAGCT CCAGGACGCC GTGCCGGTAA CCCTGGGGCA GGAATTCGGC
GCTTACGCCC AGGCCATTTC CCGGGACCGC TGGCGCCTCT ATAAAGTTGA AGAGCGCCTG
CGCCAGGTAA ACCTGGGTGG CACCGCCACC GGCACCGGCC TTAACGCCCC CCTGAAGTAC
ATCTACCTGG TCAACGACTA CCTGCGCCGC CTTACAGGAA TAGGCCTGGC CCGGGCGGAG
AATATGATTG ACGCCACCCA GAATATGGAC GTCTTTGTGG AGGTCTCTGG TCTGGTCAAG
GCTGCCGCCG TTACCATGCA CAAAATAGCC TCCGACCTGC GTTTTATGGC CGCCGGCCCC
CGGGGCGGCC CGGCGGAGAT CAATTTGCCG GAACGCCAGG CGGGATCCTC CATCATGCCC
GGCAAAGTCA ATCCCGTCAT CCCGGAGATG GTCAGCCAGG TAGCCATGCA GGTCATGGCC
AATGATTACT TGATCGCCAT GGCTGCCAGT CAGGGCCAGC TGGAGCTCAA TCCCTTTGCC
CCCCTTATTG CCCATACCTT GCTGGAATCC CTGGCCATGC TGGCGGCAGC GGCCCGGATA
TTCCGCACCG AGTGTATCAC GGGTATAACC GCCAACCCCG AGCGCTGCCA GGAACTCCTG
GCCGTGAGCC CGGCCCTGGC TACGGCCCTG CTGCCCTATA TTGGCTACGA GAAGGCCACG
GAAGTAGTGC GGGAAGCCGT GGTTTCCGGC CGATCAATAA AAGAAATAGT TCTAGAAGAA
GGGTATTTGA CCTCTGACGA ACTGGAAAAC GTCTTAACCC CGGCCGCCAT GACCAAACCG
GGAACCGTCG GAGCCGTAAA GCAGGGAATA AAGGAGAAGG GAAAAGCGTA A
 
Protein sequence
MSTRQEHDLL GTREVPATAY YGIHTLRAAE NFNVSRARVH PELIKALATV KEAAARANLD 
LGYLPAEKGR AIITACQEVA RGELADQFFL DAYQGGAGTS TNMNVNEVIA NRALEILGRP
KGDYATIHPI DHVNLHQSTN DVYPTAMRVA AIRLLLPLAD ELAKLQEALQ EKEAAFAGVV
KIGRTELQDA VPVTLGQEFG AYAQAISRDR WRLYKVEERL RQVNLGGTAT GTGLNAPLKY
IYLVNDYLRR LTGIGLARAE NMIDATQNMD VFVEVSGLVK AAAVTMHKIA SDLRFMAAGP
RGGPAEINLP ERQAGSSIMP GKVNPVIPEM VSQVAMQVMA NDYLIAMAAS QGQLELNPFA
PLIAHTLLES LAMLAAAARI FRTECITGIT ANPERCQELL AVSPALATAL LPYIGYEKAT
EVVREAVVSG RSIKEIVLEE GYLTSDELEN VLTPAAMTKP GTVGAVKQGI KEKGKA