Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28496 |
Symbol | |
ID | 4851272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1364643 |
End bp | 1366640 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | |
GC content | 43% |
IMG OID | 640392980 |
Product | hypothetical protein |
Protein accession | XP_001387490 |
Protein GI | 126274258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.543168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.515447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAGG AACCATTACT TTCCGGCCCG GGTGCAAACT CGGCAGCTGC GTTCACTGAT AAGTTTATCA ACCCTCAGAA CCATTTCACA ATAAGAGACT ATTTGGTATC TCCAGAAACT GGAAGGGGTA GTGTAACGTT TTACAAACAG ACGAGAACCA AGTTGCAATA CATTGTGCTT GTGGGCCTCG GCTTCGCCGG TTTCTATAAG CTTGAATGTT CATCTTTGAG AGCAGCTTCA TTATCCCTTG TGTTTCCTGG TGCTGGTTAT GTTGCAACAC CTTCGGCTTT GAACTGGTCA TTGTTGGGGG CCGCTCTTTT CTTGTTGCCA GTATCGTTCT TCTGTTGGTT TGCATCTGGT GCCATCACCA TTCCAATTTT GGTGTATGCC ATTCCTATTG GTCTTTCTGC CTATCTAGCC AGGGGCAAGG AAGTCTGGGT AAACGCTCCT TATGTTGTGG GAGCATTTTT GGTTGTTTTT TTCTCCATTG TCACCTTGAT CTCATCTAGA TTACAAAGTG CAGCTCAAGA GAAAGCCAAG AAGAGAAATG CCTACTTGCC CAAGCTTATT CCTGAATTGT ACGAGAAGAA GAGACTTCCA GAAGATGACC GTGAAATCGA CGTCAAGACC TTGAGGTTCC TTCAGTACTA TATTGAGTTA GGTTTGCAGG ATGTCGACGA CTGGTCCAAT TTCAACGACG TTGATCAGTT CCAAACTGCT TCTTTGAGAT ATCAATTGTA CGAAATCGTT TACTCTTTGG CAACATATCA AGCCATCTAT GCTCCCTCCC TTAGAGGTAA AATCTGTGAG TCAATGAGAA ATTCTATTGA AAAGTCATTG ACTTCAAAGG TCATGGGATA CTGGAAATGG GAATCTCTTC TTGGAAAATG GACTTTAGAC TGGGATCCAA TCAAAAAGGA CAACATCATG GTATCTGGTT ATATTTTGAA GGCTCTTGCT TTGTACACTG GAAATACTGG TGATATGAGA TACACCGAGC CAGACTCTTT GCCATTCCAA ATCACCAAGA ACGCTGTGTA CAAGCATGAC ATTCATTCTA TTAACAAGTA TGTGTTGAGA AACTGGGACG ACTCTCAGTT CTGTATCTAC CCATGTGAAC CAAACTGGAA CTATACTGCT TGTAACGTTA TTGCAGCTGC TGGTGCCGTT GGTTATGACA GAGTTTTCAA CCAGAGTAAC TTTGACAAGA GACGTGGTAG GTTCTTAAAG CATCTTAAAG AAGACATGTG TGACGAGTCC GGTACTCTTT TGCCAATCAA GTCTACTTTC ACCGGTTTCA CTATTCCAGG GATTCTTGGT CCTGTGGGTG AGTCTGTCAG TGCTGTTGAT ACTGCTTTTA TTGACGTCCC AACCTCTGCT AGAGTTTGGA GTGTTATCAA ACATGAAAAC CTCGAATTCG ATTCGAGCAC TGGCAATTAC AAGACGAAGG GTCTTTCAGG TGCTGACTAC ATCGATATGG GTTCTTACAA GCCCTCCAAT GGTACCGCAT TGCTTACTTT TGCTTTCCTT GCTAGCGAAT ATGGCGACAG TGACATTGCA GAAAACTTAT TGAAGCAAGT CGACGAAGGA GAAAATGGAA TAGAAGAATT GCCAACTGGT TCTTACAGAA ACAAGGGTAT TTCTGTCTGG GTTGGTGGTT TTGGTCTTCG TGCTCGTTTG ATTCGTTTCA GAGACTGGAC TAACACTGTT GTTAATGGTC CTCCCAAGCA ATGTGTGAAT GGTCCAGTTT TGGATGCATA CGATTTCCAA AACGTATTGG TTGCAAAGGC TTACTCTAAC GGTGAAGACT TGGATTTGGT GTTGCACAAC GGTAGAAGTC CTGGTAACTT CACCTTGAGA TTAGCACAAT TGACACCAGG TTCTACTTAC AAGACTTCTA CTGGTTCTTC TTTTGTTGCC GACAGTAATG GTACTGGTAT CATTTCAGTT AACATCAATG GTCGTACTCC AGTTTACGTT GAGAAAGAAA AGGCTTAG
|
Protein sequence | MSKEPLLSGP GANSAAAFTD KFINPQNHFT IRDYLVSPET GRGSVTFYKQ TRTKLQYIVL VGLGFAGFYK LECSSLRAAS LSLVFPGAGY VATPSALNWS LLGAALFLLP VSFFCWFASG AITIPILVYA IPIGLSAYLA RGKEVWVNAP YVVGAFLVVF FSIVTLISSR LQSAAQEKAK KRNAYLPKLI PELYEKKRLP EDDREIDVKT LRFLQYYIEL GLQDVDDWSN FNDVDQFQTA SLRYQLYEIV YSLATYQAIY APSLRGKICE SMRNSIEKSL TSKVMGYWKW ESLLGKWTLD WDPIKKDNIM VSGYILKALA LYTGNTGDMR YTEPDSLPFQ ITKNAVYKHD IHSINKYVLR NWDDSQFCIY PCEPNWNYTA CNVIAAAGAV GYDRVFNQSN FDKRRGRFLK HLKEDMCDES GTLLPIKSTF TGFTIPGILG PVGESVSAVD TAFIDVPTSA RVWSVIKHEN LEFDSSTGNY KTKGLSGADY IDMGSYKPSN GTALLTFAFL ASEYGDSDIA ENLLKQVDEG ENGIEELPTG SYRNKGISVW VGGFGLRARL IRFRDWTNTV VNGPPKQCVN GPVLDAYDFQ NVLVAKAYSN GEDLDLVLHN GRSPGNFTLR LAQLTPGSTY KTSTGSSFVA DSNGTGIISV NINGRTPVYV EKEKA
|
| |