Gene Ssol_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2788 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2549576 
End bp2551402 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content39% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionACX92871 
Protein GI261603268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.374642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTTA ATCCAGATAA GGAATGGATA GAGAACAGTA ACGTATATAA GTTCATGATT 
AAAAGGAACC TTAATAGATT AGAAGACTTC GTAAGGTATA CATATGAGAA TCCTGAATTT
TGGGACCAGT TTGTAAAGCT TATTGGAGTG GAATTTAAAG AACCTTATGC CAAGGTTTTG
GATCTGAGCA GAGGAAAACA ATGGCCACAA TGGTTCATAG GAGGGAAGTT AAACATTGGA
GATCAATTAC GTGATAGTTC TGATGTGTTC ATTAAGTGGA TGGATGAAGA CCTCAACACC
AGAACTGTAA CTTATTCTCA AATACTAAAT GAGAGCAAGT CTATTGCAAG TTGGTTAAAG
AAGATAGGTT TGAAGAAGGG AGATAGGGTC GCTATTTACA TGCCAATGAT CCCGGAAATA
GTATCAGTAA TGCTGGGAGC AATAAGGGTA GGGACGATAA TCGTACCCTT ATTTTCCGGA
TTTGGTCCAG AGCCCATAAG GGTTAGAGTG GAGGATAGTG AGGCGAAAGT AATCTTCACA
GTTGATAAGA GCATTAGAAG AGGAAAAGAA GTCGACATGT TAAAAAATCT AGAAGGACTA
AACGATAATA TAACTAAAGT AGTATTGAAT AGAGGAGGCA CTAAGGGTGA TTTTTATGAG
TATAAGGATG TCATAAAAAC TGCTGGAGAT TATGTAGAAG ATACCAGTAC TGAAGACCCA
ATGATGATAA TTTACACTTC TGGAACTACT GGAAAACCCA AAGGATGCGT TCATACTCAC
GATGGGTTTC CAATAAAAGC TTCAGCTGAC ATTTACTTCC AATTCGATTT GAAAAATGGA
GAGACGTTAA TGTGGGTTAC TGACATGGGG TGGATGATGG GACCCTGGAT GGTATTTGGA
TCACTCTTAC TAAACGCTAA AATGGGAATG ATTGAAGGAT ATACAAGTGG GGAAGTCCTA
CAGAAATTCG TTGAAGATAT GAAAGTTGAT GTTTTAGGGG TCTCAGCTAG TCTAGTTAGG
GCACTGAGAA GTCAAGGCGA AGTTAAGCTA AACGTTAGGT TAACTGGAAA CACTGGAGAA
CCAATTGATT CGGAAAGCTG GTATTGGTTA TTCAACGCCA GTGGAAAGAA CCCCATAATA
AACTACTCTG GAGGTACTGA AATCTCTGGA GGTATTTTGG GGAATTACGT TATAAAGAAG
ATAAAGCCAT CTTCATTTAA TGGACCTTCT CCGGGAATTA ACGCCTCAGT ATTTAATGAG
GAAGGAAAAG ACGCCCCACC AAATGTTGAA GGAGAGTTAG TCGTATTAAG TGTTTGGCCA
GGTATGACTA GAGGATTTTG GAGAAACCCG GAAAGGTATA TAGAGACCTA CTGGTCAGTG
TGGAAAGATG TATGGGTTCA TGGAGATTTG GCTTATAGAG ACGAAGAAGG GTATTTTTAC
ATCGTAGGTA GGAGTGATGA TACGATAAAG GTTGCTGGGA AGAGAGTAGG TCCGGCTGAA
ATCGAAAGCG TATTAAATTC ATTCCCAAAC GTAGTGGAAT CAGCATGTAT AGGAATACCT
GATCCGATGA AGGGGGAGAA AATAGTCTGT TTTGTAGTCT CTAAGGTTAG TGGGATTGAA
AATCAATTAA TAGAATACAC AGAGGATAAA CTGGGTAAGG CATTTGCACC ATCTGAAATA
AAGATTGTTA AGGAGCTACC TAAGACTAGA AATGCTAAAA TAATGAGGAG ATTGATAAGA
GCCATATATT TAAATAAACC CTTAGGAGAT ATATCCTCAC TGGAAAATCC GTCAGCTTTG
GAGGAAATTA AGAAGGCCAT CAGTTAG
 
Protein sequence
MVFNPDKEWI ENSNVYKFMI KRNLNRLEDF VRYTYENPEF WDQFVKLIGV EFKEPYAKVL 
DLSRGKQWPQ WFIGGKLNIG DQLRDSSDVF IKWMDEDLNT RTVTYSQILN ESKSIASWLK
KIGLKKGDRV AIYMPMIPEI VSVMLGAIRV GTIIVPLFSG FGPEPIRVRV EDSEAKVIFT
VDKSIRRGKE VDMLKNLEGL NDNITKVVLN RGGTKGDFYE YKDVIKTAGD YVEDTSTEDP
MMIIYTSGTT GKPKGCVHTH DGFPIKASAD IYFQFDLKNG ETLMWVTDMG WMMGPWMVFG
SLLLNAKMGM IEGYTSGEVL QKFVEDMKVD VLGVSASLVR ALRSQGEVKL NVRLTGNTGE
PIDSESWYWL FNASGKNPII NYSGGTEISG GILGNYVIKK IKPSSFNGPS PGINASVFNE
EGKDAPPNVE GELVVLSVWP GMTRGFWRNP ERYIETYWSV WKDVWVHGDL AYRDEEGYFY
IVGRSDDTIK VAGKRVGPAE IESVLNSFPN VVESACIGIP DPMKGEKIVC FVVSKVSGIE
NQLIEYTEDK LGKAFAPSEI KIVKELPKTR NAKIMRRLIR AIYLNKPLGD ISSLENPSAL
EEIKKAIS