Gene Msed_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1687 
Symbol 
ID5105333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1625666 
End bp1626928 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content48% 
IMG OID640507581 
Productanthranilate synthase component I 
Protein accessionYP_001191766 
Protein GI146304450 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00216782 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0764877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCT ACCCTATCAC GGCATTTGCT CAGCCCTATG AGGTTTATCA GTGCATTGAG 
AGAGATCAGG AGATTGCTGC TCTCATGGAG AGTGTAGAGG GCTCTCAAAA CACCGCCAGA
TATAGCGTAA TTGCCTGGGG GGTTAAGAGG AAGGTACAGG TGAATAGGGG AGATGACCTG
GAGGAATCGA TACTGAACGC TCTAAGAGGT GTAGAGGAGG GCGAGCTTAG GTTTTCAGGT
GGTCTTCTAG GTTACATATC GTACGACGCG GTGAGAAGAT GGGAAACAGT TAGGGACTTG
AAGCCTGCAA TAGAGGATTG GCCAGACGCC GAGTTCTTCC TTCCAGAGAA CGTTTTGGTC
TTCGATCATG CACTGGGAAA GGTGTTTGTG GAGGGAGATA TACCATCAAT AGCTGGATGT
TTTGAACAGG GGGAATTCAA GGTGACTCCC CATGACGAGT CGATGACTAA ACAAGAGTAT
GAGTCAGGGG TTAACTCGAT ACTAGAATAC ATCAAGTCAG GATACGCATT TCAGGTTGTC
CTCTCCAGGT TCTATAGATA CGCTGTCCAG GGTGACCCAA TGAGACTTTA CAGAAACTTG
CGAAAGATTA ATCCATCTCC CTACATGTTT TACATTAAAT TTGGGGAGAG GAAACTCATT
GGATCCAGTC CGGAGCTTCT ATTCTCAGTT CAAAGGGGGA TCGCTGAAAC TTTCCCGATC
GCGGGCACTA GACCTAGGGG AAAGACCAGT GAAGAGGATT TTGAACTGGA ACAGGAACTT
CTATCCTCTG AGAAGGAGAT GGCCGAGCAC CTAATGCTTG TGGATTTGGC CAGAAACGAC
ATAGGAAAGT CCTGTGTACC AGGAACTGTG AAGGTCCCAG AATTTGCCTA CGTTGAGAAG
TACAGCCACG TACAACACAT TGTTAGTAGA GTGGTGGGAA CCCTGAGGAA GGATGCAAAT
TCCTTGGATG TTCTAAAGTC CATGTTCCCT GCCGGTACAG TCAGCGGTGC TCCAAAGCCC
ATGGCAATGA ACATAATAGA GTTGCTAGAG CCTTACAAGA GGGGTCCCTA TGCTGGTGCA
GTGGGTTTCA TCTCAAGGAA TTCAGCGGAG TTTGCAATCA CCATCAGAAC CGCAATGATT
AACAGGGATA TTCTTCGCAT ACAAGCTGGA GCTGGGATAG TCTACGATTC AGTTCCTGAG
CAGGAGTACT ATGAAACTGA GCATAAAATG AGAGCCCTTA AGGTGGCACT TGGGGTGAGC
TAA
 
Protein sequence
MKTYPITAFA QPYEVYQCIE RDQEIAALME SVEGSQNTAR YSVIAWGVKR KVQVNRGDDL 
EESILNALRG VEEGELRFSG GLLGYISYDA VRRWETVRDL KPAIEDWPDA EFFLPENVLV
FDHALGKVFV EGDIPSIAGC FEQGEFKVTP HDESMTKQEY ESGVNSILEY IKSGYAFQVV
LSRFYRYAVQ GDPMRLYRNL RKINPSPYMF YIKFGERKLI GSSPELLFSV QRGIAETFPI
AGTRPRGKTS EEDFELEQEL LSSEKEMAEH LMLVDLARND IGKSCVPGTV KVPEFAYVEK
YSHVQHIVSR VVGTLRKDAN SLDVLKSMFP AGTVSGAPKP MAMNIIELLE PYKRGPYAGA
VGFISRNSAE FAITIRTAMI NRDILRIQAG AGIVYDSVPE QEYYETEHKM RALKVALGVS