Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1874 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1665737 |
End bp | 1667002 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | anthranilate synthase component I |
Protein accession | ACX92086 |
Protein GI | 261602483 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGTTC ATCCAATAAG TGAATTTGCC TCACCATTCG AAGTATTTAA GTGTATAGAG AGAGACTTTA AAGTAGCTGG ATTACTAGAG AGCATAGGTG GCCCTCAATA TAAGGCGAGA TATAGTGTGA TAGCTTGGTC AACTAATGGG TATCTGAAAA TTCATGACGA CCCTGTAAAT ATTCTTAATG GTTATTTGAA AGATTTGAAA TTAGCAGATA TACCGGGGTT ATTCAAAGGA GGTATGATAG GATACATAAG TTACGATGCA GTAAGATTTT GGGAGAAAAT AAGAGACTTA AAGCCAGCAG CTGAAGATTG GCCTTATGCG GAATTCTTTA CTCCAGATAA CATCATAATC TATGATCATA ATGAGGGCAA AGTATACGTT AATGCTGATT TAAGCTCTGT AGGGGGATGT GGTGATATAG GGGAGTTTAA AGTAAGCTTT TACGATGAGT CTCTTAATAA GAACAGTTAT GAGAGGATTG TTTCCGAATC ATTAGAGTAT ATAAGATCTG GTTACATATT TCAAGTTGTA TTGTCTAGAT TTTACAGATA TATATTTAGT GGAGATCCAT TAAGAATATA TTATAATCTA AGGAGAATAA ATCCATCCCC TTACATGTTT TATCTCAAAT TTGATGAAAA ATACTTAATA GGATCTAGTC CGGAATTACT GTTCAGAGTT CAAGATAATA TAGTTGAAAC CTATCCCATA GCTGGCACTA GACCTAGGGG CGCTGATCAA GAGGAAGATC TTAAATTGGA ATTGGAATTA ATGAACTCAG AAAAGGATAA AGCTGAGCAC TTAATGCTGG TTGATTTGGC TAGAAATGAT CTAGGTAAAG TATGCGTTCC AGGGACTGTA AAAGTACCAG AATTAATGTA TGTCGAGAAG TATAGCCATG TCCAACACAT AGTATCAAAA GTGATTGGGA CCTTAAAGAA GAAGTATAAC GCGTTAAACG TTTTATCGGC TACATTCCCA GCAGGTACAG TAAGCGGAGC ACCTAAACCA ATGGCAATGA ATATAATTGA AACGTTAGAG GAGTATAAAA GGGGTCCTTA TGCAGGTGCT GTAGGTTTTA TCTCAGCTGA TGGTAACGCA GAGTTCGCAA TAGCGATAAG AACTGCATTT CTAAACAAAG AGTTATTACG AATACATGCC GGTGCTGGTA TAGTATATGA CTCTAATCCA GAATCTGAAT ATTTCGAAAC TGAACATAAA CTAAAAGCAC TAAAAACAGC AATAGGGGTG AGGTAA
|
Protein sequence | MEVHPISEFA SPFEVFKCIE RDFKVAGLLE SIGGPQYKAR YSVIAWSTNG YLKIHDDPVN ILNGYLKDLK LADIPGLFKG GMIGYISYDA VRFWEKIRDL KPAAEDWPYA EFFTPDNIII YDHNEGKVYV NADLSSVGGC GDIGEFKVSF YDESLNKNSY ERIVSESLEY IRSGYIFQVV LSRFYRYIFS GDPLRIYYNL RRINPSPYMF YLKFDEKYLI GSSPELLFRV QDNIVETYPI AGTRPRGADQ EEDLKLELEL MNSEKDKAEH LMLVDLARND LGKVCVPGTV KVPELMYVEK YSHVQHIVSK VIGTLKKKYN ALNVLSATFP AGTVSGAPKP MAMNIIETLE EYKRGPYAGA VGFISADGNA EFAIAIRTAF LNKELLRIHA GAGIVYDSNP ESEYFETEHK LKALKTAIGV R
|
| |