Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33268 |
Symbol | NAR1 |
ID | 4840427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 164892 |
End bp | 166529 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391742 |
Product | nuclear architecture related protein |
Protein accession | XP_001386245 |
Protein GI | 150866595 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCAA TACTATCTGC CGACGATCTC AACGACTTCA TTTCACCCGG AGTGGCGTGT ATAAAGCCTC CAGCACAGAA TAGCGATCAA AAGTTCAACC TGCTAAACGA GAATGGAGAA GTAGAAATAC AGATAGATAG CGAGGGCAAC CCTTTGGAGA TTTCAAAAAT CGACGGGAAG CAGACAAACT TGCTGCCTGC CCAAATATCG TTGGCAGACT GTTTGGCATG CTCTGGCTGT ATCACATCTG CTGAAGAAGT GTTGGTAGCT CAGCATTCGC ACGAAGAGTT GATCAAAGCG TTGAATGAGA AAGTTGACAA TAATAGTACC AAAGTATTTG TAGCGAGCAT ATCACACCAG TCTCGTGCTT CATTAGCTAC AGCGTATAAC TTGTCTATCG AAGAGATCGA CAAACTCCTC ATCAACCTAT TCATCAACCA GATGGGGTTT AAGTATATTG TAGGGACTTC TATAGGGAGA AAGCTTTCTT TGATCAACGA AGCGCAGAAT TTGATTGAAA AGAAGGAATC CGAGTTTGAC GGCCCTGTTC TTTCATCCAT TTGTCCTGGT TGGGTGTTAT ATGCGGAAAA AACTCATCCT TACGTTTTGC CCAGAATGTC CACTGTGAAG TCCCCTCAGC AGATCACTGG ATGTTTGTTA AAGACGTTAG CAGCGCACGA GCTTGGAGTC ACCAGAAACG ATATATACCA TCTATCCATA ATGCCATGTT TCGACAAAAA GTTGGAAAGC GCAAGGCCAG AAAAGTACGG AGAACAAAAT ACTTCCAACG ATGTAGACTG TGTTCTCACA GCAAAAGAAT TGGTCACCTT GCTTGAACAG CATTCTGATA AGTTTCAGTT AATACCACCG CAAGCACATA CTATCACCAA CTCTGCCATC CCTGTAGTAG ATTTGTACAG TAAATGTGCA CCTCGAACAT GGCCCCTCGT GCAATACTCT TGGTCCAATG ATAGCGGTTC TGCTTCAGGA GGCTACGGTT ACAACTATTT AAAGATGTAC CAGAATCATT TGATAATGAA GCATCCGACA AAGTACCAGC AAGAAGGATT TTCTATCGAC TATGTAAAGG GCCGTAATAC CGATCTCACA GAAATGAGGT TGATGTATGG AAGCGAAAAG CTTGCTAGTT CTGCCATCGT AAATGGGTTC AGAAACATTC AAAATTTAGT TCGCAAGTTG AAACCTACAG TCAAGCCGGG TTCGACTACA GGCAAAGGAA ATGCTTTAGT GGCTCGCCGC AGAGCTAGGG TTGCTGGAGG AATAACAAAG GCTAGCTCAC CTGCTGGTTC AGACGAAAGC GCAGATGCTT CCAAGTGCGA CTATGTAGAG ATCATGGCAT GTCCAAATGG TTGTATAAAT GGCGGAGGCC AGATCAATCC TCCTGAAGAT GTTTCTGAAA AGGATTGGCT TTCTGCAAGT CTTGAAAAGT ACAACCTGAT TCCATTGTTG GACTTGGCAG CAATGGAAAA TGTGGATACG GTGGCCGAGA TTATGCAATG GAGTTGTTTA TTCCGCGAGG AGTTTGGAGT CTCGGAAAAT AGGCTCTTGA AGACGTGGTT TAATGAAGTC GAGAAGCCCA CAGACTCGGC TTCTATTTTA TTGGGTGCCA GGTGGTAG
|
Protein sequence | MSAILSADDL NDFISPGVAC IKPPAQNSDQ KFNSLNENGE VEIQIDSEGN PLEISKIDGK QTNLSPAQIS LADCLACSGC ITSAEEVLVA QHSHEELIKA LNEKVDNNST KVFVASISHQ SRASLATAYN LSIEEIDKLL INLFINQMGF KYIVGTSIGR KLSLINEAQN LIEKKESEFD GPVLSSICPG WVLYAEKTHP YVLPRMSTVK SPQQITGCLL KTLAAHELGV TRNDIYHLSI MPCFDKKLES ARPEKYGEQN TSNDVDCVLT AKELVTLLEQ HSDKFQLIPP QAHTITNSAI PVVDLYSKCA PRTWPLVQYS WSNDSGSASG GYGYNYLKMY QNHLIMKHPT KYQQEGFSID YVKGRNTDLT EMRLMYGSEK LASSAIVNGF RNIQNLVRKL KPTVKPGSTT GKGNALVARR RARVAGGITK ASSPAGSDES ADASKCDYVE IMACPNGCIN GGGQINPPED VSEKDWLSAS LEKYNSIPLL DLAAMENVDT VAEIMQWSCL FREEFGVSEN RLLKTWFNEV EKPTDSASIL LGARW
|
| |