Gene Athe_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1248 
Symbol 
ID7409722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1337446 
End bp1338834 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content33% 
IMG OID643715613 
Productargininosuccinate lyase 
Protein accessionYP_002573121 
Protein GI222529239 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000860987 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC TGAAACTCTG GGAAGGCAGG TTCTCAAAAT CTACAGCGCA GATTTTTGAC 
CTTTTTAATG CTTCTATTAT GACTGATATA AAACTCTTTG AATACGACAT TCTTGGATCT
GTTGCACATG TTAAAATGCT TGCAAAGTGC AATATTATCC GTGAGGATGA GGCAAAACTT
ATCATAGATA GTCTCTATCA AATATTAGAA GACTTTAAAT TGGGTAAAAT TGTATTTGGA
ATTTCTGATG AAGATGTACA CATGCTGATT GAAAAAGAGC TCATTAAAAG AATAGGAGAG
GTTGGCAAAA AAGTCCATAC AGCCAGAAGC AGAAATGATC AAGTTGCTCT TGATGAAAGA
TTATTTTGTC GTGAAAAGAA TTTGTATCTT CAAGAACTAA TAAAAACGCT AATAAATACC
ATCACAACCT TAGCTGAAGA AAATATCGAT GTAATTATGC CAGGGTTTAC TCATCTGCAA
AAGGCCCAGC CTATACTTTT TTCTCATTAT ATTCTTGCAT ATGCTCAAAT GCTAAAAAGA
GATTTGTTAA GACTTAGACA CAACTACAGT ATGACAAATT CCAGCCCGCT TGGCAGCGCT
GCTTTGGCAG GAACCACATT TGAAATAGAC AGATTTTTTG TAGCAAGTGA ACTCGGTTTT
GAAAGTGTTA CAGAAAACAG TGTCGACACT GTCTCTGACA GGGATTTTAT ACTTGAAATG
TTATTTTCAC TTGCCATGAT TCAGATGCAT TTGTCACGAC TTGCGGAAGA TTTTATTATT
TTTAATACTG ATGAATTTAA ATTTATTGAA CTTGACGATA GTTTCTGTTC AGGCAGTAGT
ATTATGCCTC AAAAGAAAAA TCCTGACGCT TTAGAGTTAA TACGTGGTAA AACAGGAAGA
GTATATGCAG ACTTAATTGG ACTTTTAACA GTACTAAAGG GTTTACCTCT TTCATACAAC
AAAGATTTGC AGGAAGACAA AGAATTCTTA TTTGATTCAA TTGAAACAGT AGAAATGAGT
TTAATAGTAA TAAATGAAAT ACTTAAAACT CTTAAAATTG ACAAAGAAAA TATGGTTAAT
TCCTGTAAAT CTGGATTTAT CAATGCAACA GACCTTGCAG ATTACTTGGT GACAAAAGGA
GTACCTTTTA GAGATGCCCA CTTTATTGTA GGAAACATTG TAAAGTACTG TATTGAAAGC
GACAAAACAT TGGAAGATTT ATCGTTAGAG GAATATAAAA GATTTTGTGA GAAGATTCAA
GAGGATGTAT ATCAATTTAT AAAGATTGAA ACCTGTGTAA ACCGCAGAAA AAGCTATGGC
GGAACTTCGC TGGAGAGTGT AAGAAAACAA ATTGATAATC TAAAGGAATT TTTGAATAAG
TTAAAATGA
 
Protein sequence
MSNLKLWEGR FSKSTAQIFD LFNASIMTDI KLFEYDILGS VAHVKMLAKC NIIREDEAKL 
IIDSLYQILE DFKLGKIVFG ISDEDVHMLI EKELIKRIGE VGKKVHTARS RNDQVALDER
LFCREKNLYL QELIKTLINT ITTLAEENID VIMPGFTHLQ KAQPILFSHY ILAYAQMLKR
DLLRLRHNYS MTNSSPLGSA ALAGTTFEID RFFVASELGF ESVTENSVDT VSDRDFILEM
LFSLAMIQMH LSRLAEDFII FNTDEFKFIE LDDSFCSGSS IMPQKKNPDA LELIRGKTGR
VYADLIGLLT VLKGLPLSYN KDLQEDKEFL FDSIETVEMS LIVINEILKT LKIDKENMVN
SCKSGFINAT DLADYLVTKG VPFRDAHFIV GNIVKYCIES DKTLEDLSLE EYKRFCEKIQ
EDVYQFIKIE TCVNRRKSYG GTSLESVRKQ IDNLKEFLNK LK