Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_10215 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001307 |
Strand | - |
Start bp | 992454 |
End bp | 994480 |
Gene Length | 2027 bp |
Protein Length | 655 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | peroxisomal targeting signal (PTS1) receptor protein peroxin 5 (Eurofung) |
Protein accession | CBF85028 |
Protein GI | 259486842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000612411 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.654696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTTC TTGGTGGCGC TGAGTGTTCC ACCGCGGGAA ACCCCCTCAC GCAGTTTACG AAACACGTAC AAGATGACAA GTCCCTCCAG CGTGACCGGC TTGTCGGGCG TGCCCCTGGA ATGCAAGAGG GTATGCGCTC GCAGGGAATG ATGGGGGGTC ACGACCAGGT AGATGCGCTC TACTACTAAT TCGTTCTTGG ATAGCTGCTT AATAATATGG TATTTAGATG ATGGACGAGT TCGCTCAGCA ATCAGCTCAA CTCCCCGGCG GCCCGCAGCA GCATATGAGG ATGGAGATGG AGCAAGTGAG ACAGCAACTA GAGCAGATGC ACACAACGCC GCGAACGGGT TCGCCGGGGT GGGCTGCGGA GTTTGACCCT GGAGAACAAG CCCGCATGGA GGCTGCCTTT GCCGGGCCAA AGGGTCCCAT GATGAACAAC GGATCGGGGT TCACTCCGGC CGAGTTTGCC CGCTTTCAGC AGCAGAGCAC CATGAGCGTT CCGCAATCGG CTAGCCCAGT TACCGCCGGC CAGTCACCAA TGATGGGTGG TTACCAGCGA TCTATGGGCA TGGGGTATAT GGGATACGGA GGGATGGGGA TGATGCAGCC TGGATTCGGA CCCATGGGCA TGCAACACCA GCAGCCGGCT GAGGCTTCAA CGCAGGATAA AGGAAAGGGG CGGATGATTG AGCTCGACGA TGAGAACTGG GAAGCTCAGT TCAAAGAGAT CGAGACGGCA GATCAGGGGA AGCTGGATGA CGAGGCCAAT GCAGCCATAG AGGCGGAGTT GAATGATCTT GACAGGTCAG TCCCCACCAC CAGTAGCACC GAAGACCTCA GTCATTTTGA GCGCGTATGG GAAAGAGTCC AAGCTGAGAC GGCAACAAAT AGAAAATTGG CGGAGGATTC TGAGTACAAT ATTGATGATA ATCTCCATAT GGGTGATATG GCCGAATGGG ATGGGTTCGA CAATCTGAAT ACTCGGTTCA GGGAGCCTCG ATTGGGTGAT TACTCGTTCG AGCAGGAGAA TGTCTTCCGC GACATCGCTA ACCCCTTTGA GGAGGGTATG AAGATTATGC AAGAAGGCGG CAATCTGTCC CTGGCAGCTC TGGCTTTTGA GGCGGCGGTG CAGAAAGACC CTCAGCATGT GAAGGCTTGG ACTATGCTGG GTACTGCCCA GGCTCAAAAT GAGAAGGAAC TCCCTGCTAT TCGGGCACTG GAGCAAGCTC TCAAGGTGGA TCCGAACAAT CTTGACGCCC TCATGGGGCT TGCTGTGTCA TACACGAACG AAGGGTACGA CTCAACAGCT TACCGTACCC TCGAGCGCTG GCTTTCCGTT AAATATCCTC AGATCATTAG TCGAGATGAT CTATCCTCGG ACGCCGACCT AGGATTCACT GACCGCCAGA TTCTTCATGA ACGTGTTACC GATCTTTTCA TACAGGCGGC TCAATTGTCG CCGTCGGGAG CGCAAATGGA TCCTGATGTT CAGGTTGGTC TGGGCGTCCT CTTCTATTGC GCAGAAGAGT ACGAGAAGGC CGTGGACTGC TTTACGACAG CATTGGCGAG CACTGAATCC GGAACCACGA ACCAGAGGGA GCAACTTCAT CTCCTTTGGA ACCGCCTGGG AGCTACGCTC GCCAACTCTG GTCGATCAGA AGAAGCTATT GAGGCCTACG AACAAGCTCT GAACATCAAT CCTAACTTTG TCCGAGCGCG CTACAACCTA GGCGTATCGT GTATCAATAT TGGTTGTTAC CCGGAAGCAG CGCAACATCT TCTGGGAGCC TTGTCCATGC ATCGTGTGGT CGAGGAGGAA GGAAAGGAAC GAGCTAGGGA GATTGTAGGC GGCAATGATG GTCGCATCAA CGAGGCCGAA CTCAACCGCA TGATTACTGC AAACCAAAGT ACCAACCTTT ACGACACGTT GCGACGGGTA TTTAGCCAAA TGGGTCGACG AGACCTGGCC GATTTGGTGG AAGCGGGAAT GGATGTCAAT ATATTCCGGA AGGAATTTGA GTTTTGA
|
Protein sequence | MSFLGGAECS TAGNPLTQFT KHVQDDKSLQ RDRLVGRAPG MQEGMRSQGM MGGHDQMMDE FAQQSAQLPG GPQQHMRMEM EQVRQQLEQM HTTPRTGSPG WAAEFDPGEQ ARMEAAFAGP KGPMMNNGSG FTPAEFARFQ QQSTMSVPQS ASPVTAGQSP MMGGYQRSMG MGYMGYGGMG MMQPGFGPMG MQHQQPAEAS TQDKGKGRMI ELDDENWEAQ FKEIETADQG KLDDEANAAI EAELNDLDRS VPTTSSTEDL SHFERVWERV QAETATNRKL AEDSEYNIDD NLHMGDMAEW DGFDNLNTRF REPRLGDYSF EQENVFRDIA NPFEEGMKIM QEGGNLSLAA LAFEAAVQKD PQHVKAWTML GTAQAQNEKE LPAIRALEQA LKVDPNNLDA LMGLAVSYTN EGYDSTAYRT LERWLSVKYP QIISRDDLSS DADLGFTDRQ ILHERVTDLF IQAAQLSPSG AQMDPDVQVG LGVLFYCAEE YEKAVDCFTT ALASTESGTT NQREQLHLLW NRLGATLANS GRSEEAIEAY EQALNINPNF VRARYNLGVS CINIGCYPEA AQHLLGALSM HRVVEEEGKE RAREIVGGND GRINEAELNR MITANQSTNL YDTLRRVFSQ MGRRDLADLV EAGMDVNIFR KEFEF
|
| |