Gene Ssol_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1740 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1557954 
End bp1559933 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content36% 
IMG OID 
Productglutamate synthase alpha subunit domain protein 
Protein accessionACX91957 
Protein GI261602354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.627915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGACT ATTATCCTTC TGGTTGTGGT GTTTTTGGGA TCTTAAGGAA AAGAGATGCC 
CCTAAAGTTA AAGGCGATTT AGTTGTAAGA GCAATAGATA GAGTTAGATA TAGGGGTAGT
GATAAGGGTG CAGGATTTGC TGTATTTAAT TTAGAAAAAA GAAACTATTA TGTTATTAAA
GCATTTTATA ATGGAAACCC AAGTGAACTA AAGGAGGTAT TTAGTAAGTA TGGTATAGAA
GTTAAGAATG TTGAGTTAGT AAGTAAGTAT TCGAATTTGT GTGATTGTAA TCTAATTGCT
TTAGGCGATA TAAATGAAGT TAGGAAGGCT ATAAGGAATA TAAATGAGAT TATGTGGAAT
GGTAAAGAGA GGAAAGGCAG AGTATATAGC GTTGGTAGTT CTCTTCATGT TTATAAGGGT
GTTGGATATC CTAGAGATGT AGCTGAACAA TATCGTGTTG AGGAAATCGA GGGTGATTTA
TGGTTAGCAC ATACTAGACA GCCCACGAAT TCTCCTGGTT ATTATCCATT TTGGTCTCAT
CCTTTTTCCT CATTTAATAT TGCTATAGTA CATAATGGTG ATGTTAGCTC ATTTGGCGCA
AACGTTGAAT ATCTAAACTC AAGGGGGTTA AATAGTTTTG TAGGAACTGA CAGTGAAGTA
TTGGCTTTCT TATTTGAGGA ACTCATCGCA GAAGGCTTAA CTGTTGAAGA AGCGGTAAAG
ATTCTAATTA ATCCATCTAG AAGATTCGAT GGCTTACCTA AAGACGTTGA TTATCTATAT
AGAAACGCTA GACTTGATGG TCCATTTACT GCAGTTATTG GTTATGATTC TGGTGATGAT
TTATATTTGA TAGCTATTGC TGATAGATCT AAATTTAGGC CGGCCATAGT TGGTGAGGAT
GATTCGTATT ATTATATAGC AAGTGAGGAG AATGAGATTA GGGAAGTAAG CCCTAAGGCC
AAAGTTTGGA CGCTCAAACC CGGTTCTTAT TTTATAGCGT CTTACAAAAA GGGAGTCATA
TCGTACGGTA GGAGCAATGA AGAGTTAAAG ACATTTTCTC CTCCCCCAAT AATGGTTCCA
GAAAAGTATG ATATTAATGC CTATAATATA GGGTATAAGG AGTTAAATTA TGAGATCCTT
AAGTTAGCTG AAAAAGGAAA GAGGGAAATA ACAGTTGCCA ATGTTTTAGG TCATAGATAT
ATTGGGATAA ATCTACCTGC TAGAGGTATA AATAATTTGA GGATTAACCT TTATGGTGTG
ATCGGAAACG CTATGGCAAA CTTAAATGAG GGTAATGAGT TTTATGTTTA TGGAAATGTC
GCTGATGATT GTTGCGATAC CATGCATGGA GGGAAGGTAG TAATTTACGG CGATGCAAGA
GATGTTTTAG CTCAGACTTT TCAGAACGGT AAGATTTTCG TTAAGGGTAA TGCCGGGAAT
AGAGTCGGTA TTCAGATGAG AGAATATAAG GATAAAAGAC CATATCTCAT AATAGGCGGT
ATTGTCGATG ATTATCTAGG AGAATATATG GCTGGAGGTT CAATAATAGT GTTTGGTAAG
GGATACAATG GAGAACCAGT AGGAAATTTT GTAGGAACGG GAATGGTAGG GGGTAGGATA
TACATAAGAG GTAAAGTTTC CCCATCAAAG TTAGGATTAC AACCACCTAA GTATGAGGTG
ATGAGACTAA TAAAAGCGCT ATTCTTAGAA GGTTTAATTT CTAGTGAAGA ATATGATTCA
TTAAAGAATG AAGAGTATAT AGGGATTGTT AATAAGTTAA AAGGAGAAGC CAAGGAATAC
GCGAAGAAAT TGTTTGAGGA GAAAATTGGA GTTCCGATGT ACGAATATAG AGAATTGACT
GAGGAGGAGT TTAAGGAGTT GTCCCCAGTA GTTAATGAGT ATTCTAAGGA CATGATGGAC
CACTCTTATG AAGAACTTTT AAAGGAAAAG TTTACTGTTG TAACTGCTAG AAAATTATAG
 
Protein sequence
MVDYYPSGCG VFGILRKRDA PKVKGDLVVR AIDRVRYRGS DKGAGFAVFN LEKRNYYVIK 
AFYNGNPSEL KEVFSKYGIE VKNVELVSKY SNLCDCNLIA LGDINEVRKA IRNINEIMWN
GKERKGRVYS VGSSLHVYKG VGYPRDVAEQ YRVEEIEGDL WLAHTRQPTN SPGYYPFWSH
PFSSFNIAIV HNGDVSSFGA NVEYLNSRGL NSFVGTDSEV LAFLFEELIA EGLTVEEAVK
ILINPSRRFD GLPKDVDYLY RNARLDGPFT AVIGYDSGDD LYLIAIADRS KFRPAIVGED
DSYYYIASEE NEIREVSPKA KVWTLKPGSY FIASYKKGVI SYGRSNEELK TFSPPPIMVP
EKYDINAYNI GYKELNYEIL KLAEKGKREI TVANVLGHRY IGINLPARGI NNLRINLYGV
IGNAMANLNE GNEFYVYGNV ADDCCDTMHG GKVVIYGDAR DVLAQTFQNG KIFVKGNAGN
RVGIQMREYK DKRPYLIIGG IVDDYLGEYM AGGSIIVFGK GYNGEPVGNF VGTGMVGGRI
YIRGKVSPSK LGLQPPKYEV MRLIKALFLE GLISSEEYDS LKNEEYIGIV NKLKGEAKEY
AKKLFEEKIG VPMYEYRELT EEEFKELSPV VNEYSKDMMD HSYEELLKEK FTVVTARKL