Gene Ssol_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1874 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1665737 
End bp1667002 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content36% 
IMG OID 
Productanthranilate synthase component I 
Protein accessionACX92086 
Protein GI261602483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTTC ATCCAATAAG TGAATTTGCC TCACCATTCG AAGTATTTAA GTGTATAGAG 
AGAGACTTTA AAGTAGCTGG ATTACTAGAG AGCATAGGTG GCCCTCAATA TAAGGCGAGA
TATAGTGTGA TAGCTTGGTC AACTAATGGG TATCTGAAAA TTCATGACGA CCCTGTAAAT
ATTCTTAATG GTTATTTGAA AGATTTGAAA TTAGCAGATA TACCGGGGTT ATTCAAAGGA
GGTATGATAG GATACATAAG TTACGATGCA GTAAGATTTT GGGAGAAAAT AAGAGACTTA
AAGCCAGCAG CTGAAGATTG GCCTTATGCG GAATTCTTTA CTCCAGATAA CATCATAATC
TATGATCATA ATGAGGGCAA AGTATACGTT AATGCTGATT TAAGCTCTGT AGGGGGATGT
GGTGATATAG GGGAGTTTAA AGTAAGCTTT TACGATGAGT CTCTTAATAA GAACAGTTAT
GAGAGGATTG TTTCCGAATC ATTAGAGTAT ATAAGATCTG GTTACATATT TCAAGTTGTA
TTGTCTAGAT TTTACAGATA TATATTTAGT GGAGATCCAT TAAGAATATA TTATAATCTA
AGGAGAATAA ATCCATCCCC TTACATGTTT TATCTCAAAT TTGATGAAAA ATACTTAATA
GGATCTAGTC CGGAATTACT GTTCAGAGTT CAAGATAATA TAGTTGAAAC CTATCCCATA
GCTGGCACTA GACCTAGGGG CGCTGATCAA GAGGAAGATC TTAAATTGGA ATTGGAATTA
ATGAACTCAG AAAAGGATAA AGCTGAGCAC TTAATGCTGG TTGATTTGGC TAGAAATGAT
CTAGGTAAAG TATGCGTTCC AGGGACTGTA AAAGTACCAG AATTAATGTA TGTCGAGAAG
TATAGCCATG TCCAACACAT AGTATCAAAA GTGATTGGGA CCTTAAAGAA GAAGTATAAC
GCGTTAAACG TTTTATCGGC TACATTCCCA GCAGGTACAG TAAGCGGAGC ACCTAAACCA
ATGGCAATGA ATATAATTGA AACGTTAGAG GAGTATAAAA GGGGTCCTTA TGCAGGTGCT
GTAGGTTTTA TCTCAGCTGA TGGTAACGCA GAGTTCGCAA TAGCGATAAG AACTGCATTT
CTAAACAAAG AGTTATTACG AATACATGCC GGTGCTGGTA TAGTATATGA CTCTAATCCA
GAATCTGAAT ATTTCGAAAC TGAACATAAA CTAAAAGCAC TAAAAACAGC AATAGGGGTG
AGGTAA
 
Protein sequence
MEVHPISEFA SPFEVFKCIE RDFKVAGLLE SIGGPQYKAR YSVIAWSTNG YLKIHDDPVN 
ILNGYLKDLK LADIPGLFKG GMIGYISYDA VRFWEKIRDL KPAAEDWPYA EFFTPDNIII
YDHNEGKVYV NADLSSVGGC GDIGEFKVSF YDESLNKNSY ERIVSESLEY IRSGYIFQVV
LSRFYRYIFS GDPLRIYYNL RRINPSPYMF YLKFDEKYLI GSSPELLFRV QDNIVETYPI
AGTRPRGADQ EEDLKLELEL MNSEKDKAEH LMLVDLARND LGKVCVPGTV KVPELMYVEK
YSHVQHIVSK VIGTLKKKYN ALNVLSATFP AGTVSGAPKP MAMNIIETLE EYKRGPYAGA
VGFISADGNA EFAIAIRTAF LNKELLRIHA GAGIVYDSNP ESEYFETEHK LKALKTAIGV
R