Gene Msed_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1689 
SymboltrpD 
ID5105335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1627506 
End bp1628537 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content47% 
IMG OID640507583 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001191768 
Protein GI146304452 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.304746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0692576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCTA AGGAGGTCCT CAAGAGATTA ACCGAAGGCG TATCCTTGTC TCAAGAGGAG 
GCAAAGGAAT TAGCCGATTT AATCATGGAG GGATCAATAC CAGAACCTTT GGTTGCTGGT
ATCTTAGTTG CACTAAAGAT GAAAGGAGAG ACCCCAGACG AAATAATAGG ATTTGTCAAT
TCCATGAGGC AACATGCGCT AAAACTGGAC TTGAGGAACA CCCTCGACAC GGCTGGCACG
GGAGGAGACG GTATAGGAAC AATTAACGTT AGCACGGCTA CTGCTCTGGC CGTTAGCTCT
GTCTTTCCTG TGGCGAAACA TGGTAATAGG GCTGCAAGTA GTAGAAGCGG AAGCGCGGAC
TTCCTTGAAT CCCTTGGGTA CAATATCCAA GTTCCTCCAG AGAAGGCCAA GGATCTACTT
TCGCGGAACA ATTTCGTGTT TCTCTTTGCA CAGCTATATC ATCCTTCCAT GAAGAACGTT
GCACCTGTAA GGAAGGTCCT AGGAGTTAGA ACTATCTTCA ATCTCCTGGG CCCTCTCACC
AACCCAGCAG GATCCGAGAG GCAGGTCATG GGAGTATACT CTCTGCCTTT CATGAGAAAG
CTAGCTGAAG CAGCACTTAA GCTAGGTTAC GTCAAGCTTG TACTTGTTCA CGGGGAGCCT
GGACTGGATG AGGTCAGCCC TCAAGGGAAG ACATATATCA CAGAGGTAAC GGGGAGTAAG
GTTGAGGAAT ATACCTATGA TTTCTCTGAA ATTATAGGTC AACCAGTTCC TGTTTCCAGA
TTGACGACCA CAGATCCTCT CGACTCCGTG AGAAGAGTTC TGATGGCGTC CATGGGGAGG
GATGAGGCCG TAGAGAAGTT CATTAGAATA AACGTGAGCG TCGCACTTTA CACTGCGGGA
CTTGTTTCTG ATTTTAAGGA TGGATTTGAA TTATCAGAGG AACTTGTAAG AAAGTTGCCA
GATCGAATTG AGACTTTGGT AAGGGACAAT GGAGATCCTA GTAAGATCAA GGCCATAAAG
GGATCCCTAT GA
 
Protein sequence
MDPKEVLKRL TEGVSLSQEE AKELADLIME GSIPEPLVAG ILVALKMKGE TPDEIIGFVN 
SMRQHALKLD LRNTLDTAGT GGDGIGTINV STATALAVSS VFPVAKHGNR AASSRSGSAD
FLESLGYNIQ VPPEKAKDLL SRNNFVFLFA QLYHPSMKNV APVRKVLGVR TIFNLLGPLT
NPAGSERQVM GVYSLPFMRK LAEAALKLGY VKLVLVHGEP GLDEVSPQGK TYITEVTGSK
VEEYTYDFSE IIGQPVPVSR LTTTDPLDSV RRVLMASMGR DEAVEKFIRI NVSVALYTAG
LVSDFKDGFE LSEELVRKLP DRIETLVRDN GDPSKIKAIK GSL