Gene PICST_16494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_16494 
SymbolARN2 
ID4840969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp747342 
End bp748994 
Gene Length1653 bp 
Protein Length551 aa 
Translation table12 
GC content40% 
IMG OID640392284 
ProductSiderophore Iron Transport 
Protein accessionXP_001386550 
Protein GI126140056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000381637 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.145167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGTACTAGAA AGGCAGAATT GTTGAACGAG CAATACCAAT CTCCACACCT CAAGGTCTGT 
TTGTTCGTTT CTATTTTCTT TGTTGCTTAC ACCTATGGGA TCGAATCTAC TTTGAGAGGA
AATATTCAAG CATATGCCAC TAGTTCATAC ACGCAACACT CTTTGTTGTC CACTGTTAAC
GTTATCAAAT CAGTTGTTGC TGCTGCCTCT CAACCAATGT ATGCTAGGTT GTCAGATAAG
TTCGGCAGAT TGGAGTTGAT GTTGGTCTCG ATTGTTTTCT ACATTGTTGG AACTGTGATT
CAATCTCAAG CATTCGATAT TAACAGATTT GCTGGAGGTT CTGTGTTGTA CCAAGTTGGA
TTTTCTGGAG TCATGATAAT GTTACAAATT ATATTGGCCG ACTTCTCAAA CTTGAATTGG
AGATTAGTTT GTTCATTTGT ACCTGCTCTA CCTTTCATTA TCAACACATG GGTTGCAGCA
GAAGTGCAAG CAAGTTTATT AGCCAACCAT TCTTGGAATT TTGCAATTGG AATCTGGGCT
TTCATTTTCC CACTCTCATG CGTTCCTTTA CTTTTGTGCT TTATTCATAT GATATGGAAG
GCACGTAAGA CAGATGAATG GCAACGGTTG AAGGAAGAAA GAACAAAGAC ACCATTCATC
CAGAAGGCAG TCGAATTATT CTGGGAATTG GATGTAGTGG GCATTGTTCT CCTTGTTTGT
GTTTTTGGGT TTATTTTGGT TCCTTTTACA ATTGCAGGAG GAGTCACAGA CAAATGGAAA
GAAGCTTCTA CCTTGGCCCC TTTGATTATC GGATTTGCTC TTCTTCCTGT TTTCGTATGG
TGGGAATACA AATATGCCAA GTTTCCTATT TCTCCTTTCC CGTTATTGAA AGATCGCGGA
GTTTGGTCAG CTCTTATTAT TGCTATCCTA ATTGATTGGG TGTGGTACAT GCCAAATGAT
TTTATGTACA CTGTCCTTAT TGTTGGTATG AGAGCTAGTG TCAAAGCTGC TACTAGAATT
TCTTCCTTGT ATTCATTCGT CTCTGTCATT GTTGGCCCTC TATTAGGTCT CTTGGTCGTA
AGGGTTAGAA GGTTAAAGGG CTTTATTATA TTTGGCACAA TTTGCTGGAT TATTTCCTTG
GGGTTATTGG TACATTTCAG GGGTTCAAAT GATGGTCTTG AAAGTGAAAA GTACTTGGAT
GGAGTTATTG GGTCTTTGTG TCTCTTAGGT TTTGGTGCTG GGTTCTTCAC TTATTCAACT
CAAGTATCAA TTGAAACCGT TACCAACCAT GAATACATGA GTATTGTACT TTCACTTTAT
TTATCCAGTT ACAATATCGG TGCTGCTATT GGTGCTTCTG TCAGTGGTGC CGTTTGGACA
AATGAAATGT ACAAAGCTAT TGCAGCCAAT TTCGAAGAGG CAGGTTTTGA TAGTGAACTT
GCGGCCCTCG CTTATGGATC CCCATTTGAA TTCATTAAAG AATATACATG GGGAACACCA
GAAAGAATTG CTGTGGTCTT GGCTTATGCC AAAGTTCAGA GATATTTATG TATTTCTGGT
CTCGTGTTGT GTTTCCCATT GCTTATGGCA ACATTTTTCT TGAGAGACCA CAGATTAGAC
TCTGTTCAAT CTCTAGAATT GGACAATGAT CAC
 
Protein sequence
GTRKAELLNE QYQSPHLKVC LFVSIFFVAY TYGIESTLRG NIQAYATSSY TQHSLLSTVN 
VIKSVVAAAS QPMYARLSDK FGRLELMLVS IVFYIVGTVI QSQAFDINRF AGGSVLYQVG
FSGVMIMLQI ILADFSNLNW RLVCSFVPAL PFIINTWVAA EVQASLLANH SWNFAIGIWA
FIFPLSCVPL LLCFIHMIWK ARKTDEWQRL KEERTKTPFI QKAVELFWEL DVVGIVLLVC
VFGFILVPFT IAGGVTDKWK EASTLAPLII GFALLPVFVW WEYKYAKFPI SPFPLLKDRG
VWSALIIAIL IDWVWYMPND FMYTVLIVGM RASVKAATRI SSLYSFVSVI VGPLLGLLVV
RVRRLKGFII FGTICWIISL GLLVHFRGSN DGLESEKYLD GVIGSLCLLG FGAGFFTYST
QVSIETVTNH EYMSIVLSLY LSSYNIGAAI GASVSGAVWT NEMYKAIAAN FEEAGFDSEL
AALAYGSPFE FIKEYTWGTP ERIAVVLAYA KVQRYLCISG LVLCFPLLMA TFFLRDHRLD
SVQSLELDND H