Gene Athe_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0507 
Symbol 
ID7408631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp576528 
End bp578108 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content41% 
IMG OID643714889 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_002572406 
Protein GI222528524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAATA ATAAGACCAT CATCATCTAT GACTCAACCT TAAGAGACGG TGCTCAGGCT 
GGTGGAATTT CATATACTCT GGAAGATAAG CTCAAGATTG TAGAGAGGCT TGACAAATTT
GGTGTGAAAT TTATCGAAGC AGGGAATCCC GGTTCTAACA TCAAAGACCA GGAATTTTTT
GCAAGAGTTA AAAAGATGAG ATTGAAAAAC GCAAAGCTTA TCGCCTTTGG TTCAACAAGG
CGAGTGGGGA TTAACGTGAA AGATGACCCT AACATTCAGT CACTCATTGC GGCTGATACC
GAAGCTGTTG CAATCTTTGG CAAGTCATGG GATTTTCATG TTAAAGAGGT CTTGAAAACA
ACAGAGGATG AAAATCTTCA GATGATTTAT GATACTATAA AATATTTAAA GTCATTGGGG
AAGTATGTTG TATTTGATGC AGAGCACTTT TTTGATGGTT ATAAAAATAA CAAAAAGTAC
GCTTTGGAGA CTTTAAAGGT TGCAAAAGAA GCAGGTGCAG ACTCTTTGGA CCTGTGCGAT
ACAAATGGCG GTACTTTCCC AATGGATGTT TACAACATCA CGAAAGAAGT TGTTGAGATG
TTTCCTGGGA CGATGATTGG AATTCACTGT CATAACGACA CAGGCATGGC TGTTGCAAAC
TCAGTCATGG CGGTTTTGGC AGGAGCTCGT CAGGTTCAGG GGACTATAAA CGGATATGGT
GAGAGATGTG GAAACGCAGA CCTTATTACA CTCATACCAA ATCTTCAGCT AAAGCTTGGC
TTTAAATGTG TACCAGATGA GAACATAAAA CACCTGACAT CACTTTCAAG GTATGTTGCA
GAGATTGCCA ACATGATTCC AAACGAGCGC GCACCATATG TTGGAGCTTA TGCGTTTACT
CACAAGGCTG GTATGCACAT TGATGCTGTC AAGAAAAATC CAGCTTCGTT TGAGCATATT
AACCCTGAGA TTGTTGGAAA CACAAGAAGA ATAGTACTGT CTGAGGTTGC AGGAAGGGCT
ACAATTCTTG ACAAGATTCG CGAGATTGAC CCGACAGTTA CAAAAGACTC ACCTGTCACA
AAAGAGATTA TTGATGAGCT AAAGCGTCTT GAAAATGAAG GGTATCAGTT TGAGTCTGCA
GAGGCTTCAT TTGAGATGTT AATTAGAAAA AAACTGGGAC TTTACCAGCC GTTCTTTACT
CTCAAAGAAT TTAAAGTTCT CATTAATGAA CCGGCAGTAG AGTACAGCTC ATCTGCAATT
GTAAAGATTG CGGTAGATGG GGTTACAGCA ATCACTGCTG CAGAAGGTGA TGGTCCTGTT
CATGCTTTAG ATAGTGCTTT GAGAAAGGCA TTGGAAAAGT TCTACCCAGA GCTCAAAGAG
GTTCATCTTG TTGACTACAA AGTAAGAGTG CTCAACGCCG AGACTGCAAC TGCTGCAAAG
GTAAGGGTTC TGATTGAGTC AACAGACGGC AAAGACACAT GGACAACTGT AGGTGTTTCA
ACCGACATTG TAAATGCAAG CTGGATTGCA CTTGTTGACT CACTGGAGTA TAAGCTTTGC
AAAGAAAAAG TGGGAAAATA A
 
Protein sequence
MENNKTIIIY DSTLRDGAQA GGISYTLEDK LKIVERLDKF GVKFIEAGNP GSNIKDQEFF 
ARVKKMRLKN AKLIAFGSTR RVGINVKDDP NIQSLIAADT EAVAIFGKSW DFHVKEVLKT
TEDENLQMIY DTIKYLKSLG KYVVFDAEHF FDGYKNNKKY ALETLKVAKE AGADSLDLCD
TNGGTFPMDV YNITKEVVEM FPGTMIGIHC HNDTGMAVAN SVMAVLAGAR QVQGTINGYG
ERCGNADLIT LIPNLQLKLG FKCVPDENIK HLTSLSRYVA EIANMIPNER APYVGAYAFT
HKAGMHIDAV KKNPASFEHI NPEIVGNTRR IVLSEVAGRA TILDKIREID PTVTKDSPVT
KEIIDELKRL ENEGYQFESA EASFEMLIRK KLGLYQPFFT LKEFKVLINE PAVEYSSSAI
VKIAVDGVTA ITAAEGDGPV HALDSALRKA LEKFYPELKE VHLVDYKVRV LNAETATAAK
VRVLIESTDG KDTWTTVGVS TDIVNASWIA LVDSLEYKLC KEKVGK