Gene Athe_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1447 
Symbol 
ID7408105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1531435 
End bp1532460 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content37% 
IMG OID643715810 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002573318 
Protein GI222529436 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAT ATAAAGATGC AGGAGTAAAT ATTGAAGAAG GTTACAAAGC GGTGAACTTG 
ATTAAAAGTT TAGCGAGAGA AACTTTTGAT TCAAATGTTA TTACTGACAT AGGTAGTTTT
GGGAGTATGT ATCTTTTAAA TATTGGAAAT TCTGAATATA TTTTGGTTTC TGGCACAGAT
GGAGTTGGTA CTAAGCTGAA GATTGCGTTT TACCTTGATA AACATGACAC TGTTGGAATT
GACTGTGTTG CCATGTGTGT CAATGACATT TTATGTCACG GTGCAAAACC ACTTTTCTTC
TTAGACTATA TTGCATGTGG TAAACTAAAC AGTAGCAAGG TTGCAAACAT CGTGAAAGGC
ATTGCTGAAG GTTGCAAAAT GGCAGGATGC TCGCTTGTTG GCGGAGAGAC TGCTGAGATG
CCAGGATTTT ATAAAGAAGA TGAGTATGAT TTGGCAGGGT TTGTTGTTGG AATTGTTGAA
AGACAAAAAG CGGTGTGTGG CAAGGATGTA AACACAGGAG ATGTATTAAT TGGACTTGCT
TCAAGTGGTG TTCACAGCAA TGGTTATTCA CTTGTGAGAA AAGTTTTTGG GATAGATGAT
AATCCAAAAG TGCTTGAAAA AATATATGAA GAGCTTGGAT TGTCCCTTGG GGAAGAGCTA
TTGAAGCCAA CAAGGATATA TGTAAAACCT GTTTTGAAAG TGCTTGAAAG GGTAAATGTT
AAAGGAATAG CCCATATAAC AGGCGGTGGA TTTTTTGAAA ATATACCTCG TGCTTTTCCG
AAAGGTTACT TTGCCATCAT CGAAAAAGGT AGTTGGGAAG TGCCTGCTAT ATTTAGGTTG
ATTCAGGAAT ATGGAAAAGT AGAAGAAAGA GAGATGTTTT CAACATTTAA CATGGGAATA
GGTATGGTTC TAATAGTTTC TGAAGAAGAT GTGGATTTGA CAATGAAGAT TTTAGAACAA
GAGAAAGTAA ATGCATGGGT AATAGGTACA ATTCAAAAAG GTGAAGACGG AGTTGTTTTA
AAATGA
 
Protein sequence
MTTYKDAGVN IEEGYKAVNL IKSLARETFD SNVITDIGSF GSMYLLNIGN SEYILVSGTD 
GVGTKLKIAF YLDKHDTVGI DCVAMCVNDI LCHGAKPLFF LDYIACGKLN SSKVANIVKG
IAEGCKMAGC SLVGGETAEM PGFYKEDEYD LAGFVVGIVE RQKAVCGKDV NTGDVLIGLA
SSGVHSNGYS LVRKVFGIDD NPKVLEKIYE ELGLSLGEEL LKPTRIYVKP VLKVLERVNV
KGIAHITGGG FFENIPRAFP KGYFAIIEKG SWEVPAIFRL IQEYGKVEER EMFSTFNMGI
GMVLIVSEED VDLTMKILEQ EKVNAWVIGT IQKGEDGVVL K