Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66704 |
Symbol | CMC1 |
ID | 4851925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3195361 |
End bp | 3198245 |
Gene Length | 2885 bp |
Protein Length | 721 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393633 |
Product | Mitochondrial aspartate/glutamate carrier protein Aralar/Citrin (contains EF-hand Ca2+-binding domains) |
Protein accession | XP_001387183 |
Protein GI | 126276038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.186703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCCGTGGCTC GTCCACCTTA TCTGTATATA CCTTTTGTTT GTCTGTGACT GTTTTGTCTG TTTGACTCGT ATGGCCATTT TCTTCTGTCC GACTTCTCAA CTTCTATAGC CCTCTATATA GTTGACCTTT CAGACAGTGT GTCCCATCTG GCTCATAATA TTTATCGTTT TCTCCACCAG TACCAGACCT TCTTGCCGTT GTCAAGTTCC ATCCCGCAGA TACTGTCTTG ACTGAATTTG TGATTTTCAT TTTGCTTCTG AATTTTTGAA TTTTTGAATT TTTAGAATTC TGACTTGTCT CTCAGATTTG ACTCGCTGCC AGTACGCTGT TACTAGTATT TTTGAACTGA GATTTCGCCA GATAACCCAA AATTGCAGAT TTCTATCATT TCTCGCAACT CCATACAGCT GAGAACCCCG GAAACCCCAA CCTTCAGAAC CTTCACCAAT CTGCCACTGC CTATGAACCA GTAGCTGAGC TGTTTTGATA CCACTTCCTT TTCTCCGCGA CCGCCCACGA AAATTTTTCA GCTAGCTTGT CAAGTTTGCA CCCAGCGAAA TCGACACTAG AACTAGTCCC AGAGGTCTAC AAACCAAACC TATAGACTCT ACTGTATCAA AGTAGATTTC CACACCTGCT GACGTTATTT AAACTGATTT GTGATATTTG TCCATTTCCA TTTTTTCGTT TTTCATTCTA ATTTTTCAAC CTATTCACCA TGTCCGTTGA GATAAAAGCC CGTGAGATCT TCAACCAGTT TGCAACTGTT GGAGCCAACG ATCAGAAAAT CTTGACGTTG CCCGACTTCA TCAACGTGTT GTCGCCTTTT GAGACCGATT TGCCAAAACC AACGTATTCG TTGCTTTACC TCATTGCCGA CGAGTCCAAA AAAGGTTATG TTACCGAGCA GGACTGGGTT TCCTTCATCA ACACGATGAC TTCGCCTGAT GGTAGCTTCA AGTTGCTTTA CCAGTTCATA GCCTCGTCCT CCTCCAATAA GACGAAGATG AGCTACAACC AGTGTATCGA AGCATTGAAC AAGTTGAACA CTTCCATAAA TCCAAGCTAC CAACAGAATT TGGTGAAGAT GAACTGGATT TACTTGCCTC GTTTCTTTGA GCCCAACGGT TCCATCCAGT TCAATGACTT CATCACCATG ATCTCGTACT TGCCTGTGAC GAAGTTGATT GGTAATTTTG AGAAGGAAGC CAACAAGAAA AAGACTATTG ACAGCACTCA ATTGGTCGAC TTGCTTTCCA CCAACTTGTC CCACAAGATC TCCAACAAGT TGAAGGTCAA CTTATCAAAC ATTACGGACT ACTTTGAAGG CCAGAACGAG TTCTCGTTAT CAAACTTGCT CTTTGTGTAC GACACGTTGA ACAAGATCGA CTTGATCAAT GAAGTCATTG CCAACACTCC TCCTACCACA GAAGACAAGG ACGACATCTT GATCAACAAA ATGGATTTAT ACACTCATTT GAACGATCCA TTGTTGAAAT CCGCCAACTT CAAGCCTGTA TCGATGTTGG AGATCGATCT CTTGTTCTAC TTGATCAACA AGAAGTCCGG TGACAATATT CCCAGAAAGG AGTTGATCTC GTACTTGAAT CCATCGTACA ACAATAACCT TAAGACGTTG CCTCTGATGT TTGAGAACTC TAACTCGAAG CACAGTCTCC ACACCCAGGA CGATAACTTT TCGTTATGGC CCATCTATGA CTCACTCTAC TCATTCTTCT TAGGCTCCAT AGCTGGCTGT ATAGGTGCCA CTGTAGTGTA TCCTATCGAT ATGGTGAAAA CCAGAATGCA AGCTCAAAAG CACAAGGCGC TTTACGACAA TTCCTTCGAC TGTTTCAAGA AAATTATCAA GAATGAAGGT TTCAAGGGTT TATACTCTGG TTTGGGAGCC CAGTTGGTGG GTGTTGCACC TGAAAAGGCC ATTAAGTTGA CTGTCAATGA CTTGGTGCGT AGAATCGGTA CCAACGAGGA TGACGGCACC ATTACCATGG GCTGGGAAAT TCTCGCAGGC TCTTCAGCTG GTGCCTGTCA GGTCATTTTT ACCAATCCTT TGGAAATCGT CAAGATCAGA TTGCAGATGC AAGGCAAATC TAAGGTTATC AAAGCTGGTG AAATCCCACA TAAGCATTTG AGTGCTTCTC AGATTATCAA GCAGTTGGGT TTGAAGGGGT TGTACAAGGG TGCCAGTGCC TGTTTACTCA GAGACGTGCC CTTTTCGGCC ATCTACTTCC CTACCTATGC CAATTTGAAG AAGGTTTTGT TCGGCTTTGA TCCTTCAAAC ACCAACTCCA ACAAGAAGTT GAGCACATGG CAGTTGTTGG TCTCGGGTGC TCTTGCGGGT GCCCCTGCAG CCTTCTTCAC AACCCCAGCT GATGTGATAA AGACAAGATT ACAAGTAGAA AGTAAACAAC ACGACATAAA GTACAGCGGC ATTTCCCACG CTTTCAGAGT AATCTTGAAG GAAGAAGGGG TTACTGCCTT CTTTAAGGGT TCGCTTGCCA GAGTGTTTAG ATCTTCGCCT CAGTTCGGTT TCACCTTGGC CTCGTACGAG TTGTTGCAGA ACATGTTCCC CTTGCACCCT CCTTTAACCA AGGACTCCAA CTTTAAGGCC ATCACCGGTT ACCCAGGTAT CTACAACTTG ACCAACGATC AAGTGTACAA CTCGCAGAGC AGAAACGACA GAATCATGTA CTTGAACAAG TCGGACATTT CGCCAGACGT ACAGAAAATC AATGACGCCT TGGTCAAGTT GCCCGCAGAA TACGTGTACA AGGCTCAAGA CGCAGTTAGA TTGTTGTTAG ATATCGACTA CAAGTTTGGA AACTTCAACT ACAATTCCTA CTTGAACTTT ATCCAAAAGA AATAG
|
Protein sequence | MSVEIKAREI FNQFATVGAN DQKILTLPDF INVLSPFETD LPKPTYSLLY LIADESKKGY VTEQDWVSFI NTMTSPDGSF KLLYQFIASS SSNKTKMSYN QCIEALNKLN TSINPSYQQN LVKMNWIYLP RFFEPNGSIQ FNDFITMISY LPVTKLIGNF EKEANKKKTI DSTQLVDLLS TNLSHKISNK LKVNLSNITD YFEGQNEFSL SNLLFVYDTL NKIDLINEVI ANTPPTTEDK DDILINKMDL YTHLNDPLLK SANFKPVSML EIDLLFYLIN KKSGDNIPRK ELISYLNPSY NNNLKTLPLM FENSNSKHSL HTQDDNFSLW PIYDSLYSFF LGSIAGCIGA TVVYPIDMVK TRMQAQKHKA LYDNSFDCFK KIIKNEGFKG LYSGLGAQLV GVAPEKAIKL TVNDLVRRIG TNEDDGTITM GWEILAGSSA GACQVIFTNP LEIVKIRLQM QGKSKVIKAG EIPHKHLSAS QIIKQLGLKG LYKGASACLL RDVPFSAIYF PTYANLKKVL FGFDPSNTNS NKKLSTWQLL VSGALAGAPA AFFTTPADVI KTRLQVESKQ HDIKYSGISH AFRVILKEEG VTAFFKGSLA RVFRSSPQFG FTLASYELLQ NMFPLHPPLT KDSNFKAITG YPGIYNLTND QVYNSQSRND RIMYLNKSDI SPDVQKINDA LVKLPAEYVY KAQDAVRLLL DIDYKFGNFN YNSYLNFIQK K
|
| |