Gene PICST_33268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33268 
SymbolNAR1 
ID4840427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp164892 
End bp166529 
Gene Length1638 bp 
Protein Length545 aa 
Translation table12 
GC content44% 
IMG OID640391742 
Productnuclear architecture related protein 
Protein accessionXP_001386245 
Protein GI150866595 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAA TACTATCTGC CGACGATCTC AACGACTTCA TTTCACCCGG AGTGGCGTGT 
ATAAAGCCTC CAGCACAGAA TAGCGATCAA AAGTTCAACC TGCTAAACGA GAATGGAGAA
GTAGAAATAC AGATAGATAG CGAGGGCAAC CCTTTGGAGA TTTCAAAAAT CGACGGGAAG
CAGACAAACT TGCTGCCTGC CCAAATATCG TTGGCAGACT GTTTGGCATG CTCTGGCTGT
ATCACATCTG CTGAAGAAGT GTTGGTAGCT CAGCATTCGC ACGAAGAGTT GATCAAAGCG
TTGAATGAGA AAGTTGACAA TAATAGTACC AAAGTATTTG TAGCGAGCAT ATCACACCAG
TCTCGTGCTT CATTAGCTAC AGCGTATAAC TTGTCTATCG AAGAGATCGA CAAACTCCTC
ATCAACCTAT TCATCAACCA GATGGGGTTT AAGTATATTG TAGGGACTTC TATAGGGAGA
AAGCTTTCTT TGATCAACGA AGCGCAGAAT TTGATTGAAA AGAAGGAATC CGAGTTTGAC
GGCCCTGTTC TTTCATCCAT TTGTCCTGGT TGGGTGTTAT ATGCGGAAAA AACTCATCCT
TACGTTTTGC CCAGAATGTC CACTGTGAAG TCCCCTCAGC AGATCACTGG ATGTTTGTTA
AAGACGTTAG CAGCGCACGA GCTTGGAGTC ACCAGAAACG ATATATACCA TCTATCCATA
ATGCCATGTT TCGACAAAAA GTTGGAAAGC GCAAGGCCAG AAAAGTACGG AGAACAAAAT
ACTTCCAACG ATGTAGACTG TGTTCTCACA GCAAAAGAAT TGGTCACCTT GCTTGAACAG
CATTCTGATA AGTTTCAGTT AATACCACCG CAAGCACATA CTATCACCAA CTCTGCCATC
CCTGTAGTAG ATTTGTACAG TAAATGTGCA CCTCGAACAT GGCCCCTCGT GCAATACTCT
TGGTCCAATG ATAGCGGTTC TGCTTCAGGA GGCTACGGTT ACAACTATTT AAAGATGTAC
CAGAATCATT TGATAATGAA GCATCCGACA AAGTACCAGC AAGAAGGATT TTCTATCGAC
TATGTAAAGG GCCGTAATAC CGATCTCACA GAAATGAGGT TGATGTATGG AAGCGAAAAG
CTTGCTAGTT CTGCCATCGT AAATGGGTTC AGAAACATTC AAAATTTAGT TCGCAAGTTG
AAACCTACAG TCAAGCCGGG TTCGACTACA GGCAAAGGAA ATGCTTTAGT GGCTCGCCGC
AGAGCTAGGG TTGCTGGAGG AATAACAAAG GCTAGCTCAC CTGCTGGTTC AGACGAAAGC
GCAGATGCTT CCAAGTGCGA CTATGTAGAG ATCATGGCAT GTCCAAATGG TTGTATAAAT
GGCGGAGGCC AGATCAATCC TCCTGAAGAT GTTTCTGAAA AGGATTGGCT TTCTGCAAGT
CTTGAAAAGT ACAACCTGAT TCCATTGTTG GACTTGGCAG CAATGGAAAA TGTGGATACG
GTGGCCGAGA TTATGCAATG GAGTTGTTTA TTCCGCGAGG AGTTTGGAGT CTCGGAAAAT
AGGCTCTTGA AGACGTGGTT TAATGAAGTC GAGAAGCCCA CAGACTCGGC TTCTATTTTA
TTGGGTGCCA GGTGGTAG
 
Protein sequence
MSAILSADDL NDFISPGVAC IKPPAQNSDQ KFNSLNENGE VEIQIDSEGN PLEISKIDGK 
QTNLSPAQIS LADCLACSGC ITSAEEVLVA QHSHEELIKA LNEKVDNNST KVFVASISHQ
SRASLATAYN LSIEEIDKLL INLFINQMGF KYIVGTSIGR KLSLINEAQN LIEKKESEFD
GPVLSSICPG WVLYAEKTHP YVLPRMSTVK SPQQITGCLL KTLAAHELGV TRNDIYHLSI
MPCFDKKLES ARPEKYGEQN TSNDVDCVLT AKELVTLLEQ HSDKFQLIPP QAHTITNSAI
PVVDLYSKCA PRTWPLVQYS WSNDSGSASG GYGYNYLKMY QNHLIMKHPT KYQQEGFSID
YVKGRNTDLT EMRLMYGSEK LASSAIVNGF RNIQNLVRKL KPTVKPGSTT GKGNALVARR
RARVAGGITK ASSPAGSDES ADASKCDYVE IMACPNGCIN GGGQINPPED VSEKDWLSAS
LEKYNSIPLL DLAAMENVDT VAEIMQWSCL FREEFGVSEN RLLKTWFNEV EKPTDSASIL
LGARW