Gene Athe_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1694 
SymboltrpD 
ID7409204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1780795 
End bp1781805 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content37% 
IMG OID643716065 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002573561 
Protein GI222529679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATCG ATGTGCTTGA GCTTGTGACA AACAAAAAGG ATTTAGAATA TGACCAGGTA 
AAAAATCTTC TTGACAACAT CTTGGAAGGA GAACTTGATG AGATAAAGTT TGGCGCATTT
TTGGCAGCGC TAAAAACAAA GGGTGAAACA GAAAAAGAAA TTTCGGCATT TGTTGATGCT
TTTTATGACA AGGCAAAAAA ACTTAATTTT GACCATCCAG CTACAATTGA TACATGTGGA
ACAGGTGGCG ATGGAAAAGG CACATTTAAC ATCTCAACAG CTGCAGCTAT TGTCCTTAGC
TGTTTTGACG TAAAAGTAGC AAAACACGGT AACAGAAGCA TTACAAGCAA CTCAGGCTCA
GCTGATATCT TAGAAAAACT TGGAATTGAT ATTCAAGCTG AGGAAGATAA AATTTTAAAA
GGGCTTGAGA AACTCAACTT TGCGTTTTTA TTTGCACCAC TGTATCATCC TGCTATGAAA
AAGGTTGCAA ATTTAAGAAG ATCACTTGGT ATCAGAACTG TTTTTAACAT CTTAGGTCCC
CTTTTGAACC CTGTACCACT TAAATATCAG GTTGTTGGGA CATTCAGCTT TGATGCGCAG
GATAAAGTGG CTTCTGTGTT AAGAGGTAAC AGAAAAAAAG CTGCTGTCAT ACATAGTCTT
GACGGACTTG ACGAGATTTC CATTTCTCAA AAGACAAGGG TGCTTGAGAT ACAAGACAAG
AATATCAAAG AATATTATAT TGACCCAAAA GACTATGGAA TTAAATTTGA TACAAACTCA
ATCAGAGGCT TTTCGCCTGA AGAAAATGCA AGGATTTTGA TAAGTGTGCT TGAGGGAGAA
AAGTCGCCTT ATTTTTGGGC TGTTGTTTTG AATTGTGGAT TTGCACTTTA CATTTGTGAG
GTTGCAAAAG ATGTTGAAGA AGGAATTAAG CTTTCCTCAA AAGCAATCGA GAGCAAAAAA
GCTTATTTAA AGCTCAAAGA GCTTAGGCAG TTTTATAAAT CGGGAGTGTA A
 
Protein sequence
MLIDVLELVT NKKDLEYDQV KNLLDNILEG ELDEIKFGAF LAALKTKGET EKEISAFVDA 
FYDKAKKLNF DHPATIDTCG TGGDGKGTFN ISTAAAIVLS CFDVKVAKHG NRSITSNSGS
ADILEKLGID IQAEEDKILK GLEKLNFAFL FAPLYHPAMK KVANLRRSLG IRTVFNILGP
LLNPVPLKYQ VVGTFSFDAQ DKVASVLRGN RKKAAVIHSL DGLDEISISQ KTRVLEIQDK
NIKEYYIDPK DYGIKFDTNS IRGFSPEENA RILISVLEGE KSPYFWAVVL NCGFALYICE
VAKDVEEGIK LSSKAIESKK AYLKLKELRQ FYKSGV