Gene Ssol_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0034 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp30333 
End bp31463 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content35% 
IMG OID 
Productaminotransferase class V 
Protein accessionACX90339 
Protein GI261600736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAA GAGATCCGAG AGAATTTAGG GAAAACGTAC CCGTTACCAG AAAATACGTA 
TATTTAAATC ATGCATCAGT GTCACCCACA CCTTTACCGT CGTTATTTGA GGCTTACAGA
TATTTATATG AAGTTGCAAA TAGGGGAAGC ATAGCTGTTA ATGAAGAGGA AGAGGATGAA
CTGTATCACA TAAGGTCTAA AATATCCAAT TTGGTAGGAG CATACTCAGA TGAGATTTCG
CTAATTCCAA ATACTAGTTA TGGGGTAAAC TTAGTTGCAC ATGGGCTAGA ATGGAAGGGA
GATGATAATA TAGTAACAGA TAACCTTGAG TTCCCAACTG TAGTGTACCC ATTTTTGAAA
TTAACGAAAA AAGGAGTCAA GATAAATATA GTAGAGACTA ATCCCTATAC CTTTGAGGAA
GATATAATAT CACATATTGA TAAAAATACT AGATTAGTTG CAATAAGCCA TGTTAGCTTT
AATACTGGTC TGAAAGTAGA TGTTAGAAAA ATTGTAAAAG CCGCAAGGGA GAACAATACT
CTAGTTCTAT TAGATATCAT ACAGAGTGCT GGTGCAGTCA AAATAAATGT AAAGGAACTT
GGTATAGATT TCGCTATTGC TGGAGGATAT AAATGGTTAA TGAGTCCACA AGGATCCGGA
TTTATCTATG TTAAAAGAGG ATTGATAGAA GATCCACCGT TTTATGGATG GAAAACTAGT
GCTGATTACT TGGATTTTAA TCCAAATAAG TTTACATTAG AGAAGGGTCC TAGAAGATTT
GAAATAGGTA CAGTAGATTT AGCTGCAAAC TTATCACTTG CAAAGTCTTG CGAAATAATA
GGCGAAAATA TGGAATTAAT TGAGAGTTCA GTGACGAATC TTTCCCAATT TGCAATAAGA
TTAGCAAAGG ACCATAGCAT GGAGGTAATC ACTCCAGAGG ATAAGAGAGC TGGAATTGTC
ATAGTAAAGG TTAAAAAACC TAAAGAGATA GCGAAGGAAC TATTAAAGGA AAACATAGTT
GTGTCGCCAA GAGGAGAAGG GATAAGGATA TCAACGCACT TCTACAATAC AGAGGAGGAA
GTTCAAAAGA CTATTGAGAA AATCTCAGAA CTCGAAAGAA AATTCAACTA G
 
Protein sequence
MRLRDPREFR ENVPVTRKYV YLNHASVSPT PLPSLFEAYR YLYEVANRGS IAVNEEEEDE 
LYHIRSKISN LVGAYSDEIS LIPNTSYGVN LVAHGLEWKG DDNIVTDNLE FPTVVYPFLK
LTKKGVKINI VETNPYTFEE DIISHIDKNT RLVAISHVSF NTGLKVDVRK IVKAARENNT
LVLLDIIQSA GAVKINVKEL GIDFAIAGGY KWLMSPQGSG FIYVKRGLIE DPPFYGWKTS
ADYLDFNPNK FTLEKGPRRF EIGTVDLAAN LSLAKSCEII GENMELIESS VTNLSQFAIR
LAKDHSMEVI TPEDKRAGIV IVKVKKPKEI AKELLKENIV VSPRGEGIRI STHFYNTEEE
VQKTIEKISE LERKFN