Gene Ssol_2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2401 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2201943 
End bp2204063 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content40% 
IMG OID 
Productnitric oxide reductase large subunit-like protein protein 
Protein accessionACX92559 
Protein GI261602956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA GAATTAAGGG TGACGTATGG TCAAACTTAG TGTTGGTCGC GACAGTCCTT 
GTATACGTAG TTTACATTGC GTTAGCTGGT TATACGTTAA CGCATTTACC TCCAATTCCA
TCAGTAGTTG AAACCGAAAA CGGTACTGTA TTATTTACGG GAGGTGAAGT CATAAGCGGA
AAGGTACTAA TGCAGAAATA TGGACTATTC GATTACGGTA GCTTTTGGGG ATTCGGAGGT
TATTATGGAA CCGATTTTAC TGCTCTAGCC TTAAAAGTTA TAAATCAAAC TACAGATCCA
CCTACGATAA AGGTAGACGG TCCAGCTTAC TCCTCCATAA CTGATTCAGA GACAAGTAGA
TGGGTAGTGT CCAACAATTA CGTAAAAGCT TACAACACGC TTTACAATGA GCTTTGCAAT
ATTTTGTACA ACAATTCCTC CAACTATGGA CTTAAGCCTA ACTTGGTTAG TCCCAATGAT
TTGAGAAATA TAACGGCATT TATTTTATGG GGTGCTATGA TTTCTCTCCT AGGCTATACT
AATGGTTTTC CCTACATACC TCAACAAACT CAACCATCCG TAAACGTTAG TCTATCAACG
TGGATAATGG TAATCGTCCT TTTAGCGGTC TTAGTTTCAA TGGTGAGCTA CGTTAGTCTA
AAAATACTTG ACCATTGGAG AGATCCTAGA ATATCTGTCC CCCTTCCTCC ACCATCTGCA
TCACAAAGGA TAGGTTTAAT AGGAGTATTC TTCGCCTCAG TCCTCGCGGG TATACAAGGT
TTGTTAGGAT ATTTGGCTAT GCATTATTAC GTAGACCCAG AGGGAATATT AGGTTTAATA
AACTTCCTAC CTTTTAACAT TACTAGGGCC TTACACTTAA ACATGGCAGT GGTGTGGATA
GCACTTACAT GGATTGCCTT TTCAATCTTT GCGTTACCAT ATCTGGGTGT TCCCTTATCC
AGAAAGCTCT CATTTGCAAT TTTAGGCTTA ACGCTATTTG CCGGAGTAGG TCTGCTTCTA
GGAATATTGC TCTCCTATAA TGAATTGATC CCATCACCCT ATTGGTTCAT TTTCGGTGCT
CAAGGAAGGC CAAATGACGC TGATCAAGGT ACCTTCTGGT TGCTCCTAGT TGCGCTAATA
CTCCTTCTAG CCTCTTCATT ATTCTTCAAA GCTTCAAAGT CAACCGCGGA ACCTTTAAGA
CCATTAACGA GAATTACTGC AATAGGTCTC TTGGGATCTG GAATAGGTGC AATCTTCGGA
TCATTACCAA TAATTGCTCC ATGGCCTAAC TTCACCGAAG ACCAGTTCTT CTTATGGATT
ATGATACACT CGTTCGTTGA GGGATTCTGG CCCTCTATAG TAATACCAGT AGTATTAATA
CTTTTAGTTG TCAACAATTT AGTACCACCA AGTCTAGCTA CGATGGCAGC GAGTATAGAT
TCCGCTAGCG AAATACTTTC AGGTATGATA GGTACTGCGC ATCACTATTA TTTTGGAGGT
GAACCAGTAT TCTGGATGTA TTTGGGAGCA TCAGCAGCCA TATTAGAGGT TGTCCCTATT
CTTTTCTTAA CTTATTACGC ATTTTTGCTG TGGAGAAGAG GTGAAGCTAA GACCGAATTT
CAAAAGACGT TAGTAGCTAC AACTTTAATT TCAGCCATTG GAGGGGGATT TGTAGGTGCG
ATTATCGGCG GAGCTTCAAT ATTGAACGCT CCAATAATAA ACTATTATGT CCATGGGTTA
CAATTTACCA TGGCCCATGC GCATCTTGCG TTCCCATTAG TGTGGGGTTT AACTGCTATT
CTAATGTGGA TTGCAGCATT ATATCTCTCC AATGGGATTA AAGAAAACGA GCTAAAGACG
TTAAGAATTA TGATATTAAT TTACGCTATC GGTTTCATAC TTCAAGGAAT AGACTTATGG
GCCTTGGGCG CGGTTCAATT AGCTACAGTT TTAAGGGTAG GTTATTGGGC GGCTAAAGGT
ACACTATTCT ATCTACAACC TATTCTAAAT TTGATAGTTT GGCTTAGAAT AGTTGGCGAT
ATAGTCGCAG GTTTCGCTGC TACTGTAATT ATAATTTACA CGCTCAAGGG TGTAATTAAA
TCTTACAAGA TCAAAATATA A
 
Protein sequence
MAKRIKGDVW SNLVLVATVL VYVVYIALAG YTLTHLPPIP SVVETENGTV LFTGGEVISG 
KVLMQKYGLF DYGSFWGFGG YYGTDFTALA LKVINQTTDP PTIKVDGPAY SSITDSETSR
WVVSNNYVKA YNTLYNELCN ILYNNSSNYG LKPNLVSPND LRNITAFILW GAMISLLGYT
NGFPYIPQQT QPSVNVSLST WIMVIVLLAV LVSMVSYVSL KILDHWRDPR ISVPLPPPSA
SQRIGLIGVF FASVLAGIQG LLGYLAMHYY VDPEGILGLI NFLPFNITRA LHLNMAVVWI
ALTWIAFSIF ALPYLGVPLS RKLSFAILGL TLFAGVGLLL GILLSYNELI PSPYWFIFGA
QGRPNDADQG TFWLLLVALI LLLASSLFFK ASKSTAEPLR PLTRITAIGL LGSGIGAIFG
SLPIIAPWPN FTEDQFFLWI MIHSFVEGFW PSIVIPVVLI LLVVNNLVPP SLATMAASID
SASEILSGMI GTAHHYYFGG EPVFWMYLGA SAAILEVVPI LFLTYYAFLL WRRGEAKTEF
QKTLVATTLI SAIGGGFVGA IIGGASILNA PIINYYVHGL QFTMAHAHLA FPLVWGLTAI
LMWIAALYLS NGIKENELKT LRIMILIYAI GFILQGIDLW ALGAVQLATV LRVGYWAAKG
TLFYLQPILN LIVWLRIVGD IVAGFAATVI IIYTLKGVIK SYKIKI