Gene Moth_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2284 
Symbol 
ID3831316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2395356 
End bp2396750 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content62% 
IMG OID637830204 
Productargininosuccinate lyase 
Protein accessionYP_431114 
Protein GI83591105 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0400918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT GGGGCGGACG TTTTACCAGG ACAACCGACC GGCTGGTGGA GGATTTCCAC 
TCCTCCATCA GCTTTGACCA GCGCCTCTAT AAAGAGGATA TAGCCGGTTC TATCGCCCAC
GCCCGCATGC TGGCAGCGGT GGGCCTCATC ACACCGGCCG AAGGTGAAGC CATTATCAAA
GGCCTGGAGG AAATCCGGGC CGACATCGAG GCGGGCCGGG TGACCTTCGA CGTGGGCGCC
GAGGACATCC ACATGAACAT CGAAAAGCTC CTCACCGAGC GCATCGGCGA GGCTGGCAAG
AAGCTGCACA CGGCTCGCAG CCGCAACGAC CAGGTGGCCC TGGACCTGCG GCTCTATTTA
AAAGAAGAGA TTCCTGCCGT CAAAAAGCTC CTGGCCGGCC TGCAGCAGGT CCTGGTGGAT
CTGGCCGCGC AGCACCTTCA GACAATCATG CCCGGCTATA CCCACCTGCA GAAGGCCCAG
CCCGTCACCC TGGCCCACCA CCTGATGGCC TACTTTGAGA TGTTCTACCG TGACCAGCAG
CGCCTGGACG ATTGCCTTGA CAGGGTCGAC GTCATGCCCC TGGGCGCCGG GGCCCTGGCC
GGTACCACCC TGCCTATCGA CCGGGAGCTG GTTGCCCGGG AGCTGGGCTT TAAGGCTATC
AGCGCCAATT CCCTGGACGC CGTTAGCGAC CGCGATTTCG TGGTCGAGTT CCTGGCAGCC
GCCTCCCTTA TTATGATGCA CCTGAGTCGC CTGGCAGAAG AGGTCATTTT CTGGTCCAGC
GAGGAGTTCG GTTTTCTGGA ACTGGATGAC GCCTACAGCA CCGGTTCCAG TATGATGCCA
CAAAAGAAAA ACCCTGATGT CGCCGAACTG GTACGGGGTA AGACCGGCCG GGTGTACGGC
CACCTCATGG GCATGCTGGC GGTTTTAAAG GGCCTGCCCC TGGCCTACAA CAAGGACCTG
CAGGAGGATA AGGAAGCCCT CTTTGACACC CTGGACACCG TCAAGGGCTG TCTCATGGTC
TTTACCCCCA TGCTGGCCAC GGCCAGGTTC CGGGTGGAGC GCATGCGGGC CGACGCCTCC
CGGGGGTTTG CCGCGGCTAC CGATGTGGCC GAGTACCTGG TGCGTAAGGG CCTGCCCTTC
CGCGAGGCCC ATGCCGTGGT CGGTTCCCTG GTCCTCCACT GCCTCCGGGA GGGCCGCAGC
TTCCAGGATT TAAGCCTGGA GGAGTGGCAG TCCTTCTCGC CGCTGTTTGA CAATGATATT
TTCGGGTGCC TCGAGGCCGA GGCCTGCGTC AACGGCCGCA ACCTCCCCGG CGGCCCGGCC
CCGGAGGCCG TGGGGAAAGC CATAGAGCGG GCCAGGGAGA TCCTTGCCGG GATTCAGGCT
GGACTTTCAA GATAG
 
Protein sequence
MKLWGGRFTR TTDRLVEDFH SSISFDQRLY KEDIAGSIAH ARMLAAVGLI TPAEGEAIIK 
GLEEIRADIE AGRVTFDVGA EDIHMNIEKL LTERIGEAGK KLHTARSRND QVALDLRLYL
KEEIPAVKKL LAGLQQVLVD LAAQHLQTIM PGYTHLQKAQ PVTLAHHLMA YFEMFYRDQQ
RLDDCLDRVD VMPLGAGALA GTTLPIDREL VARELGFKAI SANSLDAVSD RDFVVEFLAA
ASLIMMHLSR LAEEVIFWSS EEFGFLELDD AYSTGSSMMP QKKNPDVAEL VRGKTGRVYG
HLMGMLAVLK GLPLAYNKDL QEDKEALFDT LDTVKGCLMV FTPMLATARF RVERMRADAS
RGFAAATDVA EYLVRKGLPF REAHAVVGSL VLHCLREGRS FQDLSLEEWQ SFSPLFDNDI
FGCLEAEACV NGRNLPGGPA PEAVGKAIER AREILAGIQA GLSR