Gene Ssol_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1703 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1528199 
End bp1531354 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content37% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionACX91920 
Protein GI261602317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAA CTCCAAAAAA GGTTCTGGTT ATTGGATCTG GACCTATAAA AATAGCTGAA 
GCTGCAGAGT TTGATTACAG TGGAAGTCAA GCGTTAAAGG CATTAAAAGA GGAAGGAATA
GAGACAGTAT TAGTAAATTC TAACGTGGCA ACAGTACAGA CTAGCAAGAA ATTTGCTGAT
AAATTATACA TGTTACCAGT AGTCTGGTGG GCTGTAGAGA AGGTTATAGA GAAGGAGAGA
CCAGATGGTA TAATGATAGG GTTTGGAGGA CAAACTGCGT TGAATGTTGG AGTAGACTTG
CATAAAAAGG GAGTATTACA GAAATATAAT GTCAAAGTAT TAGGAACGCA AATTGATGGA
ATAGAGAAAG CACTGAGCAG AGAAAAGTTT AGAGAAACAA TGATAGAGAA CAATTTACCA
GTACCCCCAA GTTTATCTGC AAGGAGTGAA GAAGAGGCGA TAAAGAACGC AAAAATCGTG
GGATATCCAG TAATGGTGAG AGTAAGCTTT AATCTAGGCG GAAGAGGCTC TATGGTAGCG
TGGACTGAAG AAGACTTGAA GAAAAATATT AGGAGAGCGT TATCCCAAAG TTACATAGGA
GAAGTTCTGC TTGAGAAATA TCTTTACCAT TGGATAGAAT TAGAATATGA GGTAATGAGA
GACAAAAAGG GCAATTCTGC TGTAATAGCA TGTATTGAAA ATCTAGATCC GATGGGAGTC
CACACTGGAG AGTCAACAGT AGTTGCTCCT TGCCAGACTT TAGATAATCT TGAATATCAA
AATATGAGAA CCTATACAAT AGAAGTTGCA AGATCAATTA ATCTAATCGG GGAGTGCAAT
GTACAATTTG CGCTTAACCC TAGAGGCTAC GAATATTACA TTATAGAAAC TAACCCGAGA
ATGTCTAGAT CAAGTGCTTT AGCTAGTAAG GCTACTGGTT ATCCACTAGC ATATGTTTCT
GCTAAACTAG CACTAGGATA CGAACTCCAT GAAGTTATAA ATAAAGTCTC TGGAAGGACT
TGTGCATGTT TTGAACCAAG TTTAGACTAT ATAGTAACTA AAATCCCAAG ATGGGATCTA
AGCAAGTTTG AGAACGTCGA TCAATCGTTA GCAACTGAGA TGATGAGCGT GGGAGAGGTC
ATGAGTATAG GAAGATCGTT TGAGGAGAGT TTGCAGAAAG CTATAAGGAT GCTAGATATT
GGTGAACCTG GGGTCGTAGG TGGAAAGGTA TATGAGTCCA ATATGAGTAA AGAAGAAGCG
CTAAAGTACC TCAAAGAAAG AAGACCATAC TGGTTCTTAT ACGCAGCTAA AGCCTTCAAA
GAGGGAGCAA CAATTAATGA AGTATATGAG GTTACTGGAA TTAACGAGTT CTTTTTAAAT
AAAATCAAAG GATTAGTAGA TTTCTACGAA ACTCTCAGAA AATTGAAAGA GATCGATAAG
GAAACATTAA AACTCGCTAA GAAATTGGGA TTTAGCGATG AGCAGATATC TAAAGCGCTA
AATAAGTCTA CTGAATACGT GAGAAAAATT AGATACGAAA CCAACACAAT ACCAGTGGTA
AAGCTTATAG ACACGTTAGC TGGCGAATGG CCTGCAGTTA CTAATTACAT GTATTTAACA
TATAATGGTA CGGAAGATGA TATAGAATTC TCACAGGGGA ACAAATTACT AATAATTGGT
GCAGGAGGTT TTAGAATAGG AGTTTCAGTT GAGTTTGATT GGAGTGTTGT ATCTCTAATG
GAAGCAGGAT CAAAGTACTT TGATGAAGTA GCAGTACTTA ATTATAATCC GGAAACTGTC
TCAACTGATT GGGATATTGC GAGGAAGCTT TATTTTGACG AAATTAGCGT GGAAAGAGTA
TTGGATTTAA TTAAGAAGGA GAAGTTTAGA TATGTTGCCA CGTTTTCAGG TGGACAAATA
GGCAATTCAA TAGCTAAAGA GTTGGAGGAA AATGGAGTAA GGTTATTAGG AACATCGGGT
AGCAGTGTAG ATATCGCAGA AAATAGGGAA AAGTTCTCAA AACTATTAGA TAAGTTAGGT
ATTTCGCAGC CGGATTGGAT ATCTGCAACT TCATTAGGCG AAATTAAGAA GTTTGCCAAT
GAAGTAGGAT TTCCAGTTCT TGTAAGACCT AGTTATGTTC TAAGTGGATC GTCAATGAAA
ATAGCTTATT CAGAAGAGGA GCTTTATGAA TATGTAAGAA GAGCTACTGA GATTTCTCCT
AAATATCCAG TAGTGATTTC AAAGTACATC GAGAACGCTA TAGAAGCTGA GATTGATGGA
GTTTCAGATG GAAATAAGGT ATTAGGAATA ACATTAGAGC ATATAGAAGA GGCTGGAGTC
CATAGCGGAG ATGCAACCAT GTCAATTCCC TTTAGAAAAT TGTCTGAAAA TAATGTGAAT
AGAATGAGAG AAAATGTGTT AAATATAGCT AGAGAACTGA ATATTAAAGG GCCATTCAAC
GTACAATTTG TGGTAAAAGA AAATACTCCA TATATAATCG AATTAAATCT TAGAGCAAGT
AGATCAATGC CATTTAGTAG CAAAGCAAAG GGTATAAATC TAATAAATGA GTCAATGAAA
GCGATATTTG ATGGTCTAGA TTTCTCCGAG GATTATTATG AACCACCATC CAAGTACTGG
GCGGTGAAGA GTGCTCAATT CTCTTGGTCT CAATTAAGAG GAGCTTACCC ATTCTTGGGA
CCAGAAATGA AAAGTACTGG AGAGGCTGCA TCATTTGGAG TGACATTTTA TGACGCATTA
CTAAAGAGTT GGCTTTCGTC TATGCCTAAT AGGATACCAA ATAAAAATGG AATAGCGTTA
GTTTATGGGA ATAAAAATTT GGATTACTTA AAGGATACTG CAGATAATTT AACTAGGTTT
GGACTAACGG TCTATAGTAT ATCTGAGCTT CCATTACAAG ATATAGAAAC AATAGACAAA
ATGAAGGCAG AGGAGCTGGT AAGAGCCAAG AAGGTAGAGA TAATAGTAAC GGATGGTTAT
CTAAAGAAAT TTGATTATAA CATTAGAAGA ACAGCTGTTG ATTATAATAT TCCGATAATT
CTTAACGGTA GATTAGGTTA TGAGGTGAGT AAGGCTTTCC TAAATTATGA CTCACTTACT
TTTTTTGAAA TATCTGAGTA TGGAGGAGGA ATATGA
 
Protein sequence
MRETPKKVLV IGSGPIKIAE AAEFDYSGSQ ALKALKEEGI ETVLVNSNVA TVQTSKKFAD 
KLYMLPVVWW AVEKVIEKER PDGIMIGFGG QTALNVGVDL HKKGVLQKYN VKVLGTQIDG
IEKALSREKF RETMIENNLP VPPSLSARSE EEAIKNAKIV GYPVMVRVSF NLGGRGSMVA
WTEEDLKKNI RRALSQSYIG EVLLEKYLYH WIELEYEVMR DKKGNSAVIA CIENLDPMGV
HTGESTVVAP CQTLDNLEYQ NMRTYTIEVA RSINLIGECN VQFALNPRGY EYYIIETNPR
MSRSSALASK ATGYPLAYVS AKLALGYELH EVINKVSGRT CACFEPSLDY IVTKIPRWDL
SKFENVDQSL ATEMMSVGEV MSIGRSFEES LQKAIRMLDI GEPGVVGGKV YESNMSKEEA
LKYLKERRPY WFLYAAKAFK EGATINEVYE VTGINEFFLN KIKGLVDFYE TLRKLKEIDK
ETLKLAKKLG FSDEQISKAL NKSTEYVRKI RYETNTIPVV KLIDTLAGEW PAVTNYMYLT
YNGTEDDIEF SQGNKLLIIG AGGFRIGVSV EFDWSVVSLM EAGSKYFDEV AVLNYNPETV
STDWDIARKL YFDEISVERV LDLIKKEKFR YVATFSGGQI GNSIAKELEE NGVRLLGTSG
SSVDIAENRE KFSKLLDKLG ISQPDWISAT SLGEIKKFAN EVGFPVLVRP SYVLSGSSMK
IAYSEEELYE YVRRATEISP KYPVVISKYI ENAIEAEIDG VSDGNKVLGI TLEHIEEAGV
HSGDATMSIP FRKLSENNVN RMRENVLNIA RELNIKGPFN VQFVVKENTP YIIELNLRAS
RSMPFSSKAK GINLINESMK AIFDGLDFSE DYYEPPSKYW AVKSAQFSWS QLRGAYPFLG
PEMKSTGEAA SFGVTFYDAL LKSWLSSMPN RIPNKNGIAL VYGNKNLDYL KDTADNLTRF
GLTVYSISEL PLQDIETIDK MKAEELVRAK KVEIIVTDGY LKKFDYNIRR TAVDYNIPII
LNGRLGYEVS KAFLNYDSLT FFEISEYGGG I