Gene Athe_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0474 
SymbolaspA 
ID7407553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp544462 
End bp545847 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content38% 
IMG OID643714862 
Productaspartate ammonia-lyase 
Protein accessionYP_002572379 
Protein GI222528497 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.180708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGAA TAGAAAAAGA TTTCTTGGGC AGTATTGAGC TTTCTGACCT TGAGCTTTAT 
GGAATTCACA CAAAACGCGC TTTTGCTAAT TTCAATGTTT CTGGAAAGAG CGTTGACAAA
GATTTAATAA AATCGCTTGT CATGGTCAAA AAAGCGTGCG CAATTGCAAA TTATGAAGTT
GGTCTTTTGG ATGAAAAAAT TAAAGATGCT ATTGTCTTTG CATGTGACGA AATTCTGGCA
GGAAAATATG ATGACCAGTT CATTGTAGAC AGATTCCAGG GCGGTGCGGG AACATCTACA
AATATGAATG TAAACGAAGT TATTGCAAAC GTAGCCTTAA TTCACATTGG AAGAAAACCG
GGTGAGTATG ACATAATTCA TCCAATCAAC CATGTTAATA TGTCACAGTC AACAAACGAT
GTGTACCCTA CAGCCTTGCG AATTGCCACT ATATGGAATG TAAGAGAACT TTCAGAAGAA
TGTGCAGAGC TTCAAAAAAG CCTTCAGAAA AAAGAGCATG AATTTGAAGA TGTAATCAAG
GCAGGAAGAA CACAGCTGCA GGATGCCCTG CCTGTAACAC TTGGTCAGGA GTTTGGTGCA
TATGCCCAAG CTATCTCACG CGACAGATGG AGACTATACA AGGTTGAAGA GCGGCTAAGA
GTGGTCAATC TTGGTGCAAC TGCTGTTGGC ACAGGAGTAA ACGCACCTTT GAAATACATT
TTTAAGGTGA TAGAAATATT AAGAACTTTA ACCAAAATCG GCTTGGCTCG TTCAGACTAT
CTTATGGACG CAACACAGAA CGCAGACGTT TTTGTTGAAT GCTCTGGGCT TTTGAAAGCA
TTAGCAGTAA ATCTCTCAAA AATTGCAAAT GATCTTCGTC TTCTTTCCTC TGGCCCAAAC
ACGGGCTTTA ATGAGATAAA CCTGCCAGCT GTTCAGGCAG GTTCAAGTAT TATGCCAGGA
AAGGTAAATC CTGTTATACC AGAGCTTATA AACACAGTAG CTTTTCAGGT GATGGCAAAT
GACTTTGCGA TAACTTTAGC AGCACAAGCT GGTCAGCTTG AGCTGAATGC TTTTTTACCT
CTGATAGCAA ACAATCTTCT TGAAAGTCTT AAAATTCTCA AAAACGGTAT TAAAATTTTC
AGGCAGCAGT GTATAGATGG TATAACAGCA AACAAAGAAA AATGTTTAGA GTATGCAAAA
AAGACTCCTG CTATTGCAGC AAGCTTAATT GACAGGATTG GATATGACAA GGCAGCAGAA
ATTGCAAAAA AGGCTATTCT TGAGAACAAA CAGATAATTG ATGTTGTCAA AGAGCTAAAT
ATTATGGATG AAAAAGAAGC ACAAGAGCTT TTGAATCCTT TTGAGTTTAT AAAGTTTAAA
GAATGA
 
Protein sequence
MSRIEKDFLG SIELSDLELY GIHTKRAFAN FNVSGKSVDK DLIKSLVMVK KACAIANYEV 
GLLDEKIKDA IVFACDEILA GKYDDQFIVD RFQGGAGTST NMNVNEVIAN VALIHIGRKP
GEYDIIHPIN HVNMSQSTND VYPTALRIAT IWNVRELSEE CAELQKSLQK KEHEFEDVIK
AGRTQLQDAL PVTLGQEFGA YAQAISRDRW RLYKVEERLR VVNLGATAVG TGVNAPLKYI
FKVIEILRTL TKIGLARSDY LMDATQNADV FVECSGLLKA LAVNLSKIAN DLRLLSSGPN
TGFNEINLPA VQAGSSIMPG KVNPVIPELI NTVAFQVMAN DFAITLAAQA GQLELNAFLP
LIANNLLESL KILKNGIKIF RQQCIDGITA NKEKCLEYAK KTPAIAASLI DRIGYDKAAE
IAKKAILENK QIIDVVKELN IMDEKEAQEL LNPFEFIKFK E