Gene Athe_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1249 
Symbol 
ID7409723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1338827 
End bp1340038 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID643715614 
Productargininosuccinate synthase 
Protein accessionYP_002573122 
Protein GI222529240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000123934 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA ACAAAGTTGT CTTGGCATAT TCAGGTGGAC TTGATACCTC TGTAATCATT 
CCCTGGCTTA AAGAAAACTT TGACTGTGAA GTAATTGCGG TGGTTGTTGA TGTTGGACAG
GAAGATGACT TTGACGCCAT AAAAGAAAAA GCTTACAAGA CAGGTGCTTC AAAGGTTTAC
ATTGAAGATG CAAAAGAAGA GTTTGTGAAT GAATATATAT TCCCCACTTT AAAAGCTGGA
GCTATTTATG AGGGAAAATA TCTGCTTGGA ACATCAATGG CAAGACCTTT AATCGCTAAA
AAACTGGTCA ATATTGCAAA AAAAGAAAAC GCTGATGCAA TAGCACATGG GGCAACTGGA
AAAGGAAACG ATCAGGTAAG ATTTGAAGTG ACAATTAAAG CGCTTATGCC ACAAATAAAG
ATAATAGCTC CATGGCGAAT TTGGAATTTA AAATCGCGCG AGGATGAGCT CAATTATCTT
ACCCAAAAAG GAATTGATAT TCCTTTTAAA AAAGAAGAAA GTTACAGCAT GGACGGGAAC
ATATGGCATC TTTCTCATGA AGGGCTTGAC TTAGAAGACC CATGGAACAT GCCTGACTTT
GATAAGGTAC TAAAGATTAC AAAAAATCCC CTTAAACTTG CTGATTTACC AGAGACTGTG
GAGATTGAAT TTGAAAAAGG AATACCTGTG AAAGTAAATG GTCAGCAAAT GGGTGGAGTT
GAACTTTTGA AAACTTTGAA CAAAATAGGA TCAAATCATG GAATTGGTAT TGCGGACATA
GTTGAAAACA GGCTTGTTGG AATGAAATCG CGCGGCGTGT ATGAAACCCC TGGTGGAACA
ATTCTTTATT ATGCTCACAG GGAATTGGAA TATCTCTGCC TTGACAGAGC TACTTTACAC
TTTAAAGACA TGGTTGCAAT TAGATTTGCT GAACTTGTTT ATGATGGGCT TTGGTTTTCA
CCGTTAAGAG AAGCACTTTC AGCATTTGTC GACAAAACCC AAGAGGTTGT AAATGGCACA
GTAAGGTTGG TACTATATAG AGGTAATATC TACTCTGCTG GTTCAAAATC ACCAAATTCG
CTATATATCA AAGACCTTGC AACCTTTGAA GAAGACCAGA TGTACAATCA AAAGGATGCG
GAAGGATTTA TAAACCTGTT TGGCTTGCCT TTGAAGGTAT TTGGAATGGT GAACAGAAAG
GAGGATGAGT AA
 
Protein sequence
MKLNKVVLAY SGGLDTSVII PWLKENFDCE VIAVVVDVGQ EDDFDAIKEK AYKTGASKVY 
IEDAKEEFVN EYIFPTLKAG AIYEGKYLLG TSMARPLIAK KLVNIAKKEN ADAIAHGATG
KGNDQVRFEV TIKALMPQIK IIAPWRIWNL KSREDELNYL TQKGIDIPFK KEESYSMDGN
IWHLSHEGLD LEDPWNMPDF DKVLKITKNP LKLADLPETV EIEFEKGIPV KVNGQQMGGV
ELLKTLNKIG SNHGIGIADI VENRLVGMKS RGVYETPGGT ILYYAHRELE YLCLDRATLH
FKDMVAIRFA ELVYDGLWFS PLREALSAFV DKTQEVVNGT VRLVLYRGNI YSAGSKSPNS
LYIKDLATFE EDQMYNQKDA EGFINLFGLP LKVFGMVNRK EDE