Gene Athe_0733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0733 
Symbol 
ID7408427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp824014 
End bp825021 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content32% 
IMG OID643715105 
Productaminodeoxychorismate lyase 
Protein accessionYP_002572621 
Protein GI222528739 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.224262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAT GGCAAAAAAG TTTTAAATAT TATTTGGTTG TATTACTCTT CTTAGTCTTA 
ATGGTTTCAC TTATTTATGT ATTCTTCAAA CCTCAAAAAG AAAAAGTAAT CGAAGCTATG
GTTGAGATTC CACAAAATAC ATCCACAAAA GATGTTGCTA TGATTTTAAA GAAAAATGGA
ATTATTGAAA ACCCATACTT TTTTATGTTT TACGTCAAAC TCAACAACTA TAAAATAGCA
GCAGGAAAAT ACAAACTTTC ATCTGATATG ACATATAGAG AGCTTTGCAA AGTTCTTGAA
AAAGGTTTTG TTCCAAAAGT TGCTATTAAG TTTACTATTC CAGAAGGATT TACGGTCCAG
CAAATTGCCA AAAAACTTCA AAGTCTTGGA CTTGTAGATG AAAACAAGTT TTTGGAAACT
GCTAATAGTT ACGACTTTAA TTTTAAGTAT AAATACAGTT CGAAAGAAGT AAAGTATAAG
CTTGAAGGGT TTTTGTTTCC AGATACATAT GAAGTATATC CCGGCGCTTC TGAAAAGGAT
ATTATAAAAA TGATGCTAAA TAGATTTTTA GAAGTATATG AAAACATAAA AATTAAAAAG
ACAACAAATT TAGATGATAT TCAAACAGTT ATACTTGCTT CAATTGTTGA AAAAGAGGCA
AAAAAAGATA GCGAAAGAGG GATTATTGCT GGTGTGTTTT CAAACAGGCT ACAAAGAGGC
ATAAAACTTG AAAGCTGTGC AACGGTAGAA TATGTGTTGC CTGTTCACAA AGAGGTTCTT
TCTTTGCAGG ATGTTAGAAT AGAATCTCCG TACAATACAT ACCTAAAAAA AGGACTGCCG
CCTTCTGCTA TCTGCAGCCC TGGCAGAAAA AGTCTTCTTG CAGCTTTAGC TCCTGAAAAA
ACAGATTATC TATTTTTTGT TGCCAAAAAG GATGGAACTC ATATATTTTC AAAGACATTT
GAAGACCATT TGAGGGCTCA AAAACAAATA GAAGAAGGGA AAAAATAA
 
Protein sequence
MRLWQKSFKY YLVVLLFLVL MVSLIYVFFK PQKEKVIEAM VEIPQNTSTK DVAMILKKNG 
IIENPYFFMF YVKLNNYKIA AGKYKLSSDM TYRELCKVLE KGFVPKVAIK FTIPEGFTVQ
QIAKKLQSLG LVDENKFLET ANSYDFNFKY KYSSKEVKYK LEGFLFPDTY EVYPGASEKD
IIKMMLNRFL EVYENIKIKK TTNLDDIQTV ILASIVEKEA KKDSERGIIA GVFSNRLQRG
IKLESCATVE YVLPVHKEVL SLQDVRIESP YNTYLKKGLP PSAICSPGRK SLLAALAPEK
TDYLFFVAKK DGTHIFSKTF EDHLRAQKQI EEGKK