Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1703 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1528199 |
End bp | 1531354 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | ACX91920 |
Protein GI | 261602317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAAA CTCCAAAAAA GGTTCTGGTT ATTGGATCTG GACCTATAAA AATAGCTGAA GCTGCAGAGT TTGATTACAG TGGAAGTCAA GCGTTAAAGG CATTAAAAGA GGAAGGAATA GAGACAGTAT TAGTAAATTC TAACGTGGCA ACAGTACAGA CTAGCAAGAA ATTTGCTGAT AAATTATACA TGTTACCAGT AGTCTGGTGG GCTGTAGAGA AGGTTATAGA GAAGGAGAGA CCAGATGGTA TAATGATAGG GTTTGGAGGA CAAACTGCGT TGAATGTTGG AGTAGACTTG CATAAAAAGG GAGTATTACA GAAATATAAT GTCAAAGTAT TAGGAACGCA AATTGATGGA ATAGAGAAAG CACTGAGCAG AGAAAAGTTT AGAGAAACAA TGATAGAGAA CAATTTACCA GTACCCCCAA GTTTATCTGC AAGGAGTGAA GAAGAGGCGA TAAAGAACGC AAAAATCGTG GGATATCCAG TAATGGTGAG AGTAAGCTTT AATCTAGGCG GAAGAGGCTC TATGGTAGCG TGGACTGAAG AAGACTTGAA GAAAAATATT AGGAGAGCGT TATCCCAAAG TTACATAGGA GAAGTTCTGC TTGAGAAATA TCTTTACCAT TGGATAGAAT TAGAATATGA GGTAATGAGA GACAAAAAGG GCAATTCTGC TGTAATAGCA TGTATTGAAA ATCTAGATCC GATGGGAGTC CACACTGGAG AGTCAACAGT AGTTGCTCCT TGCCAGACTT TAGATAATCT TGAATATCAA AATATGAGAA CCTATACAAT AGAAGTTGCA AGATCAATTA ATCTAATCGG GGAGTGCAAT GTACAATTTG CGCTTAACCC TAGAGGCTAC GAATATTACA TTATAGAAAC TAACCCGAGA ATGTCTAGAT CAAGTGCTTT AGCTAGTAAG GCTACTGGTT ATCCACTAGC ATATGTTTCT GCTAAACTAG CACTAGGATA CGAACTCCAT GAAGTTATAA ATAAAGTCTC TGGAAGGACT TGTGCATGTT TTGAACCAAG TTTAGACTAT ATAGTAACTA AAATCCCAAG ATGGGATCTA AGCAAGTTTG AGAACGTCGA TCAATCGTTA GCAACTGAGA TGATGAGCGT GGGAGAGGTC ATGAGTATAG GAAGATCGTT TGAGGAGAGT TTGCAGAAAG CTATAAGGAT GCTAGATATT GGTGAACCTG GGGTCGTAGG TGGAAAGGTA TATGAGTCCA ATATGAGTAA AGAAGAAGCG CTAAAGTACC TCAAAGAAAG AAGACCATAC TGGTTCTTAT ACGCAGCTAA AGCCTTCAAA GAGGGAGCAA CAATTAATGA AGTATATGAG GTTACTGGAA TTAACGAGTT CTTTTTAAAT AAAATCAAAG GATTAGTAGA TTTCTACGAA ACTCTCAGAA AATTGAAAGA GATCGATAAG GAAACATTAA AACTCGCTAA GAAATTGGGA TTTAGCGATG AGCAGATATC TAAAGCGCTA AATAAGTCTA CTGAATACGT GAGAAAAATT AGATACGAAA CCAACACAAT ACCAGTGGTA AAGCTTATAG ACACGTTAGC TGGCGAATGG CCTGCAGTTA CTAATTACAT GTATTTAACA TATAATGGTA CGGAAGATGA TATAGAATTC TCACAGGGGA ACAAATTACT AATAATTGGT GCAGGAGGTT TTAGAATAGG AGTTTCAGTT GAGTTTGATT GGAGTGTTGT ATCTCTAATG GAAGCAGGAT CAAAGTACTT TGATGAAGTA GCAGTACTTA ATTATAATCC GGAAACTGTC TCAACTGATT GGGATATTGC GAGGAAGCTT TATTTTGACG AAATTAGCGT GGAAAGAGTA TTGGATTTAA TTAAGAAGGA GAAGTTTAGA TATGTTGCCA CGTTTTCAGG TGGACAAATA GGCAATTCAA TAGCTAAAGA GTTGGAGGAA AATGGAGTAA GGTTATTAGG AACATCGGGT AGCAGTGTAG ATATCGCAGA AAATAGGGAA AAGTTCTCAA AACTATTAGA TAAGTTAGGT ATTTCGCAGC CGGATTGGAT ATCTGCAACT TCATTAGGCG AAATTAAGAA GTTTGCCAAT GAAGTAGGAT TTCCAGTTCT TGTAAGACCT AGTTATGTTC TAAGTGGATC GTCAATGAAA ATAGCTTATT CAGAAGAGGA GCTTTATGAA TATGTAAGAA GAGCTACTGA GATTTCTCCT AAATATCCAG TAGTGATTTC AAAGTACATC GAGAACGCTA TAGAAGCTGA GATTGATGGA GTTTCAGATG GAAATAAGGT ATTAGGAATA ACATTAGAGC ATATAGAAGA GGCTGGAGTC CATAGCGGAG ATGCAACCAT GTCAATTCCC TTTAGAAAAT TGTCTGAAAA TAATGTGAAT AGAATGAGAG AAAATGTGTT AAATATAGCT AGAGAACTGA ATATTAAAGG GCCATTCAAC GTACAATTTG TGGTAAAAGA AAATACTCCA TATATAATCG AATTAAATCT TAGAGCAAGT AGATCAATGC CATTTAGTAG CAAAGCAAAG GGTATAAATC TAATAAATGA GTCAATGAAA GCGATATTTG ATGGTCTAGA TTTCTCCGAG GATTATTATG AACCACCATC CAAGTACTGG GCGGTGAAGA GTGCTCAATT CTCTTGGTCT CAATTAAGAG GAGCTTACCC ATTCTTGGGA CCAGAAATGA AAAGTACTGG AGAGGCTGCA TCATTTGGAG TGACATTTTA TGACGCATTA CTAAAGAGTT GGCTTTCGTC TATGCCTAAT AGGATACCAA ATAAAAATGG AATAGCGTTA GTTTATGGGA ATAAAAATTT GGATTACTTA AAGGATACTG CAGATAATTT AACTAGGTTT GGACTAACGG TCTATAGTAT ATCTGAGCTT CCATTACAAG ATATAGAAAC AATAGACAAA ATGAAGGCAG AGGAGCTGGT AAGAGCCAAG AAGGTAGAGA TAATAGTAAC GGATGGTTAT CTAAAGAAAT TTGATTATAA CATTAGAAGA ACAGCTGTTG ATTATAATAT TCCGATAATT CTTAACGGTA GATTAGGTTA TGAGGTGAGT AAGGCTTTCC TAAATTATGA CTCACTTACT TTTTTTGAAA TATCTGAGTA TGGAGGAGGA ATATGA
|
Protein sequence | MRETPKKVLV IGSGPIKIAE AAEFDYSGSQ ALKALKEEGI ETVLVNSNVA TVQTSKKFAD KLYMLPVVWW AVEKVIEKER PDGIMIGFGG QTALNVGVDL HKKGVLQKYN VKVLGTQIDG IEKALSREKF RETMIENNLP VPPSLSARSE EEAIKNAKIV GYPVMVRVSF NLGGRGSMVA WTEEDLKKNI RRALSQSYIG EVLLEKYLYH WIELEYEVMR DKKGNSAVIA CIENLDPMGV HTGESTVVAP CQTLDNLEYQ NMRTYTIEVA RSINLIGECN VQFALNPRGY EYYIIETNPR MSRSSALASK ATGYPLAYVS AKLALGYELH EVINKVSGRT CACFEPSLDY IVTKIPRWDL SKFENVDQSL ATEMMSVGEV MSIGRSFEES LQKAIRMLDI GEPGVVGGKV YESNMSKEEA LKYLKERRPY WFLYAAKAFK EGATINEVYE VTGINEFFLN KIKGLVDFYE TLRKLKEIDK ETLKLAKKLG FSDEQISKAL NKSTEYVRKI RYETNTIPVV KLIDTLAGEW PAVTNYMYLT YNGTEDDIEF SQGNKLLIIG AGGFRIGVSV EFDWSVVSLM EAGSKYFDEV AVLNYNPETV STDWDIARKL YFDEISVERV LDLIKKEKFR YVATFSGGQI GNSIAKELEE NGVRLLGTSG SSVDIAENRE KFSKLLDKLG ISQPDWISAT SLGEIKKFAN EVGFPVLVRP SYVLSGSSMK IAYSEEELYE YVRRATEISP KYPVVISKYI ENAIEAEIDG VSDGNKVLGI TLEHIEEAGV HSGDATMSIP FRKLSENNVN RMRENVLNIA RELNIKGPFN VQFVVKENTP YIIELNLRAS RSMPFSSKAK GINLINESMK AIFDGLDFSE DYYEPPSKYW AVKSAQFSWS QLRGAYPFLG PEMKSTGEAA SFGVTFYDAL LKSWLSSMPN RIPNKNGIAL VYGNKNLDYL KDTADNLTRF GLTVYSISEL PLQDIETIDK MKAEELVRAK KVEIIVTDGY LKKFDYNIRR TAVDYNIPII LNGRLGYEVS KAFLNYDSLT FFEISEYGGG I
|
| |