Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_02954 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001306 |
Strand | + |
Start bp | 2221742 |
End bp | 2224583 |
Gene Length | 2842 bp |
Protein Length | 912 aa |
Translation table | |
GC content | 61% |
IMG OID | |
Product | extracellular serine-rich protein, putative (AFU_orthologue; AFUA_3G07870) |
Protein accession | CBF83680 |
Protein GI | 259486104 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0326959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCTC CCACGCTGGT GCTGCAGATC GGCTGTTTCT TTGGGATATT TACCTGCGAC ACGCCCCGGG GAGTCTGGCC TGAAGGACGC CCGACTGTTC TTTCCCCGTG GCCTCCCAGT CCCAGCTCGA CGAGCTCAAC TACTTCGGTG CCCCCGGCTG CTTCGTCTGC ATCTACGATC AGTGTCCCGA CAGTCACCGG CCCGACTGAG CCTACAGTGA CGGTGACTGC AACGTCCTCT GCATCTACTA CTCTGACGAC GCTCACGACT CTGCCCTCGC CCACAGACTC TGCTACTGCC ACTTCTGCTT CTACTTCTGC TGATTCCACT ACTATCTCCA GCACATCCAG TGCTTCTGAC CCGGCTGGCT CTTCCACTCT CGTTCCCACC TCAGAGCCTA GCGCCTCTGA GCCTAGCACT TCAGAGCCCA GTTCATCTAC CGGTACTCCA GGGACTTCGA CCAGTGATCC TGATCCCACG AGTGCCACTG AGCCGTCCAG TACTTCGACC TCGACTCCCG AGCCTTCAGG TACTGTGACT TCAAGTACTT CGACTTCGAG TCCTAGTACT AGCCCCCCCA GCACTTCCAG CACTTCCAGC ACTAGCCCGT CTAGTACCTT GACCTCTACC TCATCCCCCT CTACGTCTTC TACCTTAACG CCCAGTGCCA GCCCTACCCC GTCTACTACA GTCACCCCGT CCCCCAGCAT GTCTGTTACT TCCACTAGCG TGGCACCCTC GGCCACCGGG TCCCTGGCCA ACAACATCCT TGTCATCGCC CGCGACTCTA CGCAGGCCAG CGTTGCCTCG TCCGGCCTGA ATGGCTACGG AATCCCCTTC ACGACGCTCC TGGTCCCTCA GGCCGGCGTC GAGCTGCCGG CCCTGAACTC CTCCTCAGGG GGCAACTTTG GCGGTATCGT CGTTGCCGGC GAAGTCAGCT ACGACTACGG TAACGATAAC TGGCGCTCCG CCCTGACGGA CGACCAGTGG AACCAGCTGT ACGCGTACCA GCTCGCCTAC GGCGTGCGCA TGGTCCAGTA CGACGTCTTC CCCGGTCCCA ACTTCGGCGC CTCGTCCATC GGCAACGGCG GCTGCTGCGC CGACGGTGTC GAGCAGCTCG TCTACTTCAC GGATACCAGC GACTTCCCCA CGGCCGGCCT CAAGACCGGC TCCAGCGCGG GCGTCTCGAC CTCGGGTCTC TGGCACTACA CGGCTTCCGT CACCGACACC AACACCACCA AGGCGATTGC GTCGTTTGCC TCCGGCGGCG GGGTTGACGG CGAGTCTGTG GCCGCGGTCA TCAACAACTT CGATGGCCGC CAGCAGATGG CCTTCTTCAT CGGCTTCGAC ACTGTCTGGT CCCAGACCTC CAACTACCTG CAGCACGCCT GGATCACCTG GATCACGCGC GGCCTGCACG CCGGGTACCG CCGCGTGAAC CTGAACACCC AGATTGACGA CATGTTCCTT GAGACCGATA TCTACCAGCC CAGCGGAACC ATCTTCCGCA TCACGACCGA TGATATGGAC GGCATCACCA ACTGGCTGCC CAGCATCCGC GCCAAACTCA ACGCTGGAAG CACCTACTTT GTCGAGATCG GCCACAACGG CAACGGCAAC ATCGAAGCCG GCACCACGGC GGCCGGTGAG AGCACCTGCT CCGGCGGCGC CATCGAGTAC GACTCGCCTC CGGACACGGC CCTCGAATTT GTCAAGCCTA TCGGCACCGG CACTGACGTC TGGCCAACCT CGCCGACCAA CTTCACCTGG ACAACGACCT GCATGAACGC GGACAGCCTG CTCATATGGT TCCAGAACCA CCTGGACGAC TACGCCTACA TCAGCCACAC CTTCACCCAC TTGGAGCAGA ACAACGCCAC CTACAGCGAC ATCTACAAGG AGATCTCCTT CAACCAGGTC TGGCTCGAGC GTGCTGGCTT CTCGGCCGCG TCCAAGTTCA CCTCCAACGG TATCATCCCG CCCGCCATCA CCGGCCTGCA CAACGGTGAC GCTCTCCGCG CCTGGTGGGA CAATGGCATC ACCAACTGCG TCGGCGATAA CACGCGTCCT GTCCTCCTCA ACAGCGAGAA CGAGATGTGG CCCTACTTCA CCACTGAGGC CGCCGACGGG TTCGCGGGTA TGCAGGTCAA CCCGCGCTTC GCGACTCGAA TCTACTACAA CTGCGACACC CCTGCCTGCA CGACACAGGA ATGGATCGAT ACCTCTGCCG GCGCCGGAGA CTTCAATGAC CTGCTGGACA CAGAGCGCGC CGAGGTCCTG CGTCACTTGT TCGGCCTCCA TCGCGACCCG TACATGTTCC ACCAGGCCAA CCTGCGCAAT GTCGGGATCG ACCCCATCAC CGTCGGCTCT GAGACCGGCC AGTTCTCCAT CTTCCAGGCG TGGGTTGAGA CCATCGTTGC CGAGTTCACC CGGCTGGTGG ACTGGCCCAT TGTTACGATT ACGCATCAAG AGGTACACTT CCCTCTGCTT TCCCTCTGCT TCCCCTGCTT TTCTACCCTC TGCTTTAATT CTTGCGCTTG CTTTCCTCAT AGTTACAGCT TGCTAACTAA AGTAGATGTC CGCCGAATTC CTCGCCCGCT ACACCCGCGA CCACTGCGAC TACGGGCTCA ACTACATCCT CGACAACGGC GCTATCACCG GCGTCACCGT GACTGCGAAC GGCAACACCT GCGACGCGAA CATCCCCGTT ACCTTCCCGA CCGCGCCTAC GGACACCCTA GGCTTCGCGA CGGAGCAGCT GGGCAGCGAC CCCTTCACCG TATGGGCGCA GCTCTCCGGC TCGCCCGTGA CCTTCTCCCT TGCCACGCCA ATTGCACTCT AG
|
Protein sequence | MVAPTLVLQI GCFFGIFTCD TPRGVWPEGR PTVLSPWPPS PSSTSSTTSV PPAASSASTI SVPTVTGPTE PTVTVTATSS ASTTLTTLTT LPSPTDSATA TSASTSADST TISSTSSASD PAGSSTLVPT SEPSASEPST SEPSSSTGTP GTSTSDPDPT SATEPSSTST STPEPSGTVT SSTSTSSPST SPPSTSSTSS TSPSSTLTST SSPSTSSTLT PSASPTPSTT VTPSPSMSVT STSVAPSATG SLANNILVIA RDSTQASVAS SGLNGYGIPF TTLLVPQAGV ELPALNSSSG GNFGGIVVAG EVSYDYGNDN WRSALTDDQW NQLYAYQLAY GVRMVQYDVF PGPNFGASSI GNGGCCADGV EQLVYFTDTS DFPTAGLKTG SSAGVSTSGL WHYTASVTDT NTTKAIASFA SGGGVDGESV AAVINNFDGR QQMAFFIGFD TVWSQTSNYL QHAWITWITR GLHAGYRRVN LNTQIDDMFL ETDIYQPSGT IFRITTDDMD GITNWLPSIR AKLNAGSTYF VEIGHNGNGN IEAGTTAAGE STCSGGAIEY DSPPDTALEF VKPIGTGTDV WPTSPTNFTW TTTCMNADSL LIWFQNHLDD YAYISHTFTH LEQNNATYSD IYKEISFNQV WLERAGFSAA SKFTSNGIIP PAITGLHNGD ALRAWWDNGI TNCVGDNTRP VLLNSENEMW PYFTTEAADG FAGMQVNPRF ATRIYYNCDT PACTTQEWID TSAGAGDFND LLDTERAEVL RHLFGLHRDP YMFHQANLRN VGIDPITVGS ETGQFSIFQA WVETIVAEFT RLVDWPIVTI THQEMSAEFL ARYTRDHCDY GLNYILDNGA ITGVTVTANG NTCDANIPVT FPTAPTDTLG FATEQLGSDP FTVWAQLSGS PVTFSLATPI AL
|
| |