Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53564 |
Symbol | |
ID | 4851864 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3039931 |
End bp | 3042828 |
Gene Length | 2898 bp |
Protein Length | 766 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393572 |
Product | predicted protein |
Protein accession | XP_001387148 |
Protein GI | 126275832 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.422768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.419656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGTTT TCCCCGCGTT CGACTCGAAC TACAAGACGC GCAAGCGCAC GTTCAAGTGC TGTCAGAACT GCCGTGTGAA GAGAGTCAAG TGTCAGATCA CTTCCACCGA CTACGAGTCC TTGGGATGTG TGAATTGCCG CAAGAACAAA TGGACTTGTT CGCTTGCGAA GCAGCAGGCC GTACAGAGCC AGACTCCGGA CCAAAACCAA AACCAGCAAA TTCAGATTCA GTCCACGATT AAGAATGAAT TGGGAAATAT TGACGAAATA AGCCAAAGTC ACGATATACA CCAAGATATA AACCAAAATC AGATCCAGAC CCAGACCCAG ACCCATAACC AAAACTACAT TGGTAGTCAA ATTCAAGATA ATCTCAACAG TAACAACATC AACAATTACA ATATTAACAA TAACAACAAT ACCAATAACA TTAACAGCCA TACTAGCAAT AATAGCAATA TCCTCTCCAA TGGCCAACAC CATCTTCCCA TCCCGATTGA ATCGATTTCT GAGAACAGGA CTCTTGTGCA GAATACTTCC ATCCATTCCG ACACCATCAC ATCAGCAGTA GACATCAGTT CGGCTACGGT CCCCATGAAC GGCAACAGTA ACACTTCCAA CGCTGCTGCT CTTACAGGTA ACAATACTAC CATTGAAAAT GGTACTGCTA CTACCAATAT GAAAGTCTCT AAGCGCAATT CGCACGTTGG TTCGTTTCCG CAACAGGCGG TAGAATTTGT GCCTCTGCCC GTGTACCCAA AATCAAAGAG CATGTCTGAA ATCAAGCAGT CGCCCAATCA TACGAATACC GGCAATAGCC ATCATAGTAG TATCAGCAAT GGAAACGATA TTGCCAACGG TGAAGTTTTC AATCGTCCTC AGAGAATCAA TATGAAGACC ATTACTCCAC AGTACCTTAA GCAGACTTTC AACTTCAATG TACTGGGCAA TGATCTGGGC TCGAACTATC AATATCTTTT CCACGGTCAT CCTAAGGTAA TTATTGCCAT GTCAGACGAC CAGACCATCT GGCATGAGTC TGGAGTCTAC GTGAAGACTA CTAAAGAAAA TCAGAACACC AAGGAAGGCG TAGAAGGCAA ATCCTACAAG CTCAAGAACT TTGGGAGTGG TACTTTCAAG GAGTTCTACA TCCGTAACGA AAATGTCTAC AACTTCCTCT TGCTGATAAA TGCGTTCACT TTAGAATCTC CAGCATACCC GTTTGAAAGC GACGAAGTAC GTCAACTTAT AGAGTTGTAC TTTTATAAGT TGAACTCCAT CTTTCCTTTG GTTCATGAAA ATGGCTTCTG GGACGATTAC CGCAACAACA AGGCACAAAA TGTTTTGATC TACGTTGTGG TGCTAGCCAT TCTGAGGGAT AAAATGGCAG AGTCAATTCT TAAGAAGGTA TTTTTGCGTG GTACTCTCCA TCGGCACGGA CAAGTCATGT CTGACATGTC GCAGCAGCAA TTCAATGAAG ATCTCGTCTC ATTTATGTCT GAATTAGAAT ACAAAATCAG GCAAATTTTG CTTATACTTC CTCAGTTAGG AGACGAAGAC AAGTTTTCAC GTTTAGTGGT ATCGCTTGTT CTTTCGACCC ATTTCAACTA TGACAAGTTG GGATGTGAAA ACTCTTCCCA CGATCTCACA GATGCAATTA ACTTGGCGAC TTCTATTGGT ATCCACATGA AGAGACTTCT GTTGAATGCT GAGCCTCTGA AAGTTGAGTA TTCTTCCAAT TTGTGGTGGT GCTGCTACAT CTTTGATCGC TTCAACGGTC TCGTAAATGC CCGTCCTGTC TTTATTAGAC AAGAGGATTT CAATGTCGAT CTTCCCTACA ACAATATCAA CTTGTTGAAG ATGGTACAGC TTGCACGTTC GCTTGAAAAC ATGTTTTTTG CAATCTTTCA GCCCTTCAAC AACAACAACG TCATAGGAAC AAACAACTTG AACAATAACA TGGTCAGATA CAAGATGTTT GACACGGATG AATTCCAGCG AATTGAATTT GAGCTTTGCG ATAAGGAACG CTCTAGAAAT AGGGTTGCCT ACGATTCCAT GTATCCGGTA GCTTCCAGAG TTGGTGATAA TCCCTTCACC GACTACGTAG GAAATACCAT TCATTTCATG ACAAGAGTGG TGAACAATGT CATTATCTTA GCTTCGCAGA AGGCCAAGTA CGATAACCCT CAGATACCTA ACCATATTCC AGAAGCTGTA GCACTTAGAG CCTCGCTGAA TATTTTGTGG TATCTTGTTC AGATGAAGGA CGAGTTTGTC ATCAATATTC CGATGGTTCC ATGGTGCATG TCTTTGGCCA TGGCTGTTGC TCTCAAGAAG AAGGCGAGAA TGTGTCTCAA GGATGGTGAA TATGAGGAAT ATAAAGTGTA CCAAGATCCG TTTGAATTCA AAGACTACAT CAACGAGTTG GAAAAGTTCT CACTGACCTG GTGGGTGGTT GATGAAATCT GTAGATTGAC CAGAGATTTC ACCAACACGT TGGACTCCAA GTCTAAGAGT AGAAAACGTA GACAAAGGGC CAAGGCAGCA TCAGCGGCTG GAACTTCGAA GAAGAAGCAA AAGATGGAGC TGAGAAGCGA CTGGGTCAAA CTGGCATTGC CACAACCAGT CTCAAGTCCA GTTCCTAAAT CAGATGCTAT TCCTTCTATC AGAAATATGT TGCAACCACC TTCACAGACC CTGACGGAGT TAAATGCCTA TGTAGCAAGC ACCAATGGAA CTACACCCTA CATGCAAACA GACTCTTCCT TCAGTCCTAA TCTGATGAAT CTGAGTGACG CCAACCAATA TGACCAATAC TTCGAACTGA TGCAGATCGA CATTTTCAAC AACGACTTCT TCAAGGATGT TCCCAATGTC ATCAATTTGT TGAAGTAA
|
Protein sequence | MFVFPAFDSN YKTRKRTFKC CQNCRVKRVK CQITSTDYES LGCVNCRKNK WTCSLAKQQA VQSQTPDQNQ NQQIQIQSTI KNELGNIDEI SQSNNTTIEN GTATTNMKTI TPQYLKQTFN FNVLGNDLGS NYQYLFHGHP KVIIAMSDDQ TIWHESGVYV KTTKENQNTK EGVEGKSYKL KNFGSGTFKE FYIRNENVYN FLLLINAFTL ESPAYPFESD EVRQLIELYF YKLNSIFPLV HENGFWDDYR NNKAQNVLIY VVVLAILRDK MAESILKKVF LRGTLHRHGQ QFNEDLVSFM SELEYKIRQI LLILPQLGDE DKFSRLVVSL VLSTHFNYDK LGCENSSHDL TDAINLATSI GIHMKRLLLN AEPLKVEYSS NLWWCCYIFD RFNGLVNARP VFIRQEDFNV DLPYNNINLL KMVQLARSLE NMFFAIFQPF NNNNVIGTNN LNNNMVRYKM FDTDEFQRIE FELCDKERSR NRVAYDSMYP VASRVGDNPF TDYVGNTIHF MTRVVNNVII LASQKAKYDN PQIPNHIPEA VALRASLNIL WYLVQMKDEF VINIPMVPWC MSLAMAVALK KKARMCLKDG EYEEYKVYQD PFEFKDYINE LEKFSLTWWV VDEICRLTRD FTNTLDSKSK SRKRRQRAKA ASAAGTSKKK QKMELRSDWV KLALPQPVSS PVPKSDAIPS IRNMLQPPSQ TLTELNAYVA STNGTTPYMQ TDSSFSPNLM NLSDANQYDQ YFELMQIDIF NNDFFKDVPN VINLLK
|
| |