Gene PICST_43145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43145 
SymbolNTA1 
ID4837831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp887916 
End bp888989 
Gene Length1074 bp 
Protein Length357 aa 
Translation table12 
GC content42% 
IMG OID640389146 
ProductCarbon-nitrogen hydrolase 
Protein accessionXP_001383452 
Protein GI126133855 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.186344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGT TCGCCAGGCT CAGGGTGGCA TGTGTTCAGC TTAACCCGAG AATAGGTGAG 
GTTGAAGCCA ATATTCGCAA GGTACACACA ATACTAGTCA ATGTACCGAA AGTAGATTTG
GTAGTACTTC CGGAATTGGC TATAACGGGC TATAACTTCC CCAACCGTAG AGCAATTGAG
CCATATCTAG AGAATTTGGC CCTAGAGAAT GGACCCTCTA TCAAGCTAGC AAAGGAAATC
TCAAACAAGT ACAAATGCTT TACTTTGATT GGATACCCTG AAATTTCGAA TTCAACAATT
TATAACTCTG CCGTATTGGT AGGACCCAAC GGCTCAATAC TACATAACTA TAGAAAGACA
TTTCTCTATG AAACAGACGA GGTATGGGGA GCAAGTGAGA ATCCAGAAAA AGGGTTTTCG
TCTCTAAAGC TTGTACTTGA TAAGGAATAC TATTTGGACA AGCAGGCAAA CAAAACATAT
CCAACTGTAA CTACAAACAT CGGCATTTGC ATGGATGTAA ATCCCTATCA ATTCAAGGCT
CCGTTCAATG CGTTTGAATT TTCAGGCTCG GCATTCCACC AGAGAGCCAA GCTCCTCTTG
TTTCCCATGG CATGGCTATC GCCCCAATCA CCTTCAACTA AGGAAGACTT GACCAAGAGT
GAGAAGTTGA ACAAGGGCAA GATATTCAAT GAAAGGTACT TCTCCACAGA ACATAAACCA
ACGGTAAATG ACAATAACGT AGCCCCAAAG TTGGAGTCTA ATACTTTATT CGTGCCTACA
ACTCCAGAAG GTAGCACAGT AAACTACTGG CTTCTCCGTT TTTTTCCATT CATGAAGCAT
CCCAACAGTT ACCAGTCCAA ATACTATGAG ACTGCCACGC TTATAGCCTG TAATCGTGTA
GGGGTGGAAG ACGATATATT GTATACTGGA TCGTCATCAA TAATACAATT CTCTGGAACT
TCATCTTCGG CTCCTCAAAT TGATAGTGCC AACCCCAGCG TTAATGTGCT TGGGAGTTTG
GGCCAAGGCG ACGAGGGAGT TTTAGTAAGG GATATAGATA TCGAATTTGA CTAA
 
Protein sequence
MNKFARLRVA CVQLNPRIGE VEANIRKVHT ILVNVPKVDL VVLPELAITG YNFPNRRAIE 
PYLENLALEN GPSIKLAKEI SNKYKCFTLI GYPEISNSTI YNSAVLVGPN GSILHNYRKT
FLYETDEVWG ASENPEKGFS SLKLVLDKEY YLDKQANKTY PTVTTNIGIC MDVNPYQFKA
PFNAFEFSGS AFHQRAKLLL FPMAWLSPQS PSTKEDLTKS EKLNKGKIFN ERYFSTEHKP
TVNDNNVAPK LESNTLFVPT TPEGSTVNYW LLRFFPFMKH PNSYQSKYYE TATLIACNRV
GVEDDILYTG SSSIIQFSGT SSSAPQIDSA NPSVNVLGSL GQGDEGVLVR DIDIEFD