Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_05254 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001305 |
Strand | + |
Start bp | 3109256 |
End bp | 3112473 |
Gene Length | 3218 bp |
Protein Length | 1010 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | hypothetical protein |
Protein accession | CBF82237 |
Protein GI | 259485318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCA CTGCGCCCCG ACCCCCGTTC CTACCGAAGG ACCCCACTGA ATTCGTGCAG CATGTAACCA GTCATTCTGC TGAATGGTTC GAATACTGCA GCCAAGCAGA TCAATATATC GCCGCGGCCG AGACGACCCT TCTTTCGTGG GAGACGGGCA AGGAAGCCCT CCAGATCCAG GCTCTACAAC AAGAGAACGA GCACCTCCAT GACAAGTGCG CCCGTCTGCG CGACGTGATA TCCCGCCGGG ATGCAGTTAT ACAGTACCAG AAGGAGCAAG CCAAGGAAAA AGACATTGAG TTCTTGAAAC TAGCCAAAGA GAAACCCCAG GAACCCCAGC CAGCAATGCC TATAACTGGT ATATCAGATG GACAACCCAA ACCTGGCTCA CCCACACAAA CTCAGGTGTT TCACCAGCTC TCCGAGCGCC TGCCTGACCC GGATTGGTTT GAGGGAGACC GGAAGGACCT CCGCCGCTTT ATCTCCCAGA TCCATGAGAA GATGAATATA AACCGTGACT GTTTCCCGAC CCCACAGAGT AGGATGACAT ATGTCAACAA TCGTCTAAAA GGAGCCCCGT ATGCCCAAAT CTTGCCCTAT GTCAAGAAAG GAATCTGCCA GCTGAAGGAC TACGAGGACA TCCTGGATAT ACTAGATCGG GCCTTTGGAG ACCCAAACCG TGTTAACAAT GCCCGCAACG AGCTGTTCCG CTTCCGGCAG AATAATAAAG AGTTTGGCCT GTTCTTCGCC GAATTCCAAC GTCTTGCCCT AGAGGGAGAG ATGCCTGAGG AGACCCTATC TATACTTCTG GAACAATCAA TAAATCGAGA GCTTAAAGGG ATGCTTATGC ATAATCAACC ACCTACCCAA GATTACCATG ATGAATTTGA TTTGCTAAGT TCTTACAGGA ACTTGAGAAC CGCCGCCGGC ATTATGAAAT CAACCTGCAA TCAGCCAGCA GAAACTACCC TGCAATTACT AGAACTGCTA CTAGTCAGCT GTCAAGAACA AACTACACTA CCCTGCCTAG GACTATAGAG AACCCAGCCC CACAACGCAC ACAGCCTGAT GTACATAATG ATGCCATGGA TCTGTCATCT ATCCGCCAAC ATAACCCTAC ACGTCGCGAG CAGGGAGAAT GCTTCCACTG TGGATCCCCA GAACATATGG TCAGGAACTG CCCACACCCT GATAACCGCC CTCTTAGCAA TCCGCTCTGC CTACCCAGCA TCCAACATAA CCCCATCAAT TAAATCTGAG TCTACCGCTG TCTCCGAAGG CTCCCGCTCT CCATCACCTG GATTCTCGGA AAAAGGGGTA AGCCTGGCCT AAGTCGCGAC CAGGCGCCAC TACCCAAGCG CGTTGTTCAC CTCTCTGCAA GTGCTATTAA AGGAATGTCT GTTGAAGAAG AAACTGCCCG CGCCGACCTG ACTGTCCTGC CTGTTATCCT GACCCAGCAA GAGAAGAGCC TGTCCAGCTA CGCAATGCTA GATACTGGAG CTGACGGGAA GAGGTTTATT GACCAAGAAT GGGCAGAAGA CAACCACCTT GAGCTGCTGC CCCTGAAAAA CCCAATCCAC TTGGAAAGCT TTGACGGGAG AGAATCCGAA GGAGGGCCGA TAACCCACTA TGTTAGAATA AACCTGACAA TCCATGACCA TCATGAAAAG AAGGCTTGTT TCTTGGCTAC ACAACTAGCC CATTACCCAA TAATCCTTGG AATGCCATGG TTAGAGACTC ATGACCCCCG CTGGGGGTTT GCAGAGCACA CCTTAATATT TGACAGTGCC TATTGTCGAC AGAATTGCAA TATACCTGCC CAACCAGCCA AGATCAAGGC CCTGCGTGAC GTGCCTGCCC GAAGCTGCCA GAAGAACCTG ACTTCCTGTC CCAAAGGATT GGAGAAACAA GATATTGCCC TAGTCTCCCT CCGCGCCTGC TCAGCTTACG CCCGTAGGGG CCATGCCCTG TTTACAGCCA CTATTGGGAA TATTGACGAG GTATTGGCTA AGAGGTCAGG GGATGGTAAC CCTGAAGACC TACTACTACC AGAATACAAA GACTATGCAG ATATCTTCTC CCCTAAGGAA GCTGATAAGC TGCCCCCACA TCAGCCATAT GACTATTTAA TAACTCTAAT AGATGGAAAG ACCCCACCAT TTGGCCCATT ATATGGAATG TCCCGGGATG AACTAGTTGC ACTACAGGAG TGGATTATGG AGAATCTGAG GAAAGGCTTT ATTCGCCCAA GCTCGTCGCC AACAGCCTCA CCTGTCCTAT TTGTTAAAAA ACCCGGCGGA GGTCTATGCT TCTGCGTGGA CTACCAAGCT CTGAACGTGA TTTTGGTTAA GGACCAATAC CCTCTGCCAC TTGTCAAGGA GACCCTGAAT AATCTAAAAG GGATGAGGTA CTTTACTAAG ATTGACATTA TTTCCGCATT TAATAACATA CGGATCAAGA AGGGACAGGA ATATCTGACC GCGTTCCGCA CCTGCCTGGG GCTGTATGAA TCCTTAGTTA TGCCCTTTGG CCTTACCGGC GCTCCAGCAA CATTCCAGCA CTATATAAAT GACACCCTGC GAGACTATCT GGACATCTTC TGTACTGCTT ACCTTGACAA CATCCTGATC TACAGTCAAA CCAGGTCTGA ACATATTCAG CATGTCCGAA AAGTCCTCCA AAAGCTTAGA GAAGCTGGCT TGTTTGCAAA GCTAGTGAAG TATGAATTTA CTGTTCATGA GACCAAGTTC CTGGGCCTGA TAGTGGCTAG AGATAGAATC AAGATAGACC CTGAGAAGGT TCAAACCATT GCAGCCTGGG CCACGCCAAC CTGCATTACA GATATACAGG CATTTATTAG GTTCGCCAAC TTCTACCGGA GATTCATTAA GGATTTCTCA AAGATCATTG CCCCGCTAGT TAACCTGACT AAGAAGGATG TCGAGTTTCA ATGGACCCCA ACCTGCCAGC TGGCCATGGA CGCCCTGAAG AAGGCCTTTA CCAGCGCCCC TGTTTTGAAG CCATTCGACT GGACCCAAGA TATCATTCTT GAAACTGATG CTTCTGACTT TGTCTCTGCT GGTGTCCTGT CCCAGTATGA TGATAATGGC GTCTTACACC CTGTGGCCTT CTTCTCTAAG AAGCACTCTG CCACAGAATG CAACTATGAA ATCTATGACA AGGAACTTCT AGCAATTATC CGCTGCTTTG AGGAATAG
|
Protein sequence | MPATAPRPPF LPKDPTEFVQ HVTSHSAEWF EYCSQADQYI AAAETTLLSW ETGKEALQIQ ALQQENEHLH DKCARLRDVI SRRDAVIQYQ KEQAKEKDIE FLKLAKEKPQ EPQPAMPITG ISDGQPKPGS PTQTQVFHQL SERLPDPDWF EGDRKDLRRF ISQIHEKMNI NRDCFPTPQS RMTYVNNRLK GAPYAQILPY VKKGICQLKD YEDILDILDR AFGDPNRVNN ARNELFRFRQ NNKEFGLFFA EFQRLALEGE MPEETLSILL EQSINRELKG MLMHNQPPTQ DYHDEFDLLS SYRNLRTAAG IMKSTCNQPA ETTLQLLELL LRTQPHNAHS LMYIMMPWIC HLSANITLHV ASRENASTVD PQNIWSGTAH TLITALLAIR SAYPASNITP SIKSESTAVS EGSRSPSPGF SEKGQEKSLS SYAMLDTGAD GKRFIDQEWA EDNHLELLPL KNPIHLESFD GRESEGGPIT HYVRINLTIH DHHEKKACFL ATQLAHYPII LGMPWLETHD PRWGFAEHTL IFDSAYCRQN CNIPAQPAKI KALRDVPARS CQKNLTSCPK GLEKQDIALV SLRACSAYAR RGHALFTATI GNIDEVLAKR SGDGNPEDLL LPEYKDYADI FSPKEADKLP PHQPYDYLIT LIDGKTPPFG PLYGMSRDEL VALQEWIMEN LRKGFIRPSS SPTASPVLFV KKPGGGLCFC VDYQALNVIL VKDQYPLPLV KETLNNLKGM RYFTKIDIIS AFNNIRIKKG QEYLTAFRTC LGLYESLVMP FGLTGAPATF QHYINDTLRD YLDIFCTAYL DNILIYSQTR SEHIQHVRKV LQKLREAGLF AKLVKYEFTV HETKFLGLIV ARDRIKIDPE KVQTIAAWAT PTCITDIQAF IRFANFYRRF IKDFSKIIAP LVNLTKKDVE FQWTPTCQLA MDALKKAFTS APVLKPFDWT QDIILETDAS DFVSAGVLSQ YDDNGVLHPV AFFSKKHSAT ECNYEIYDKE LLAIIRCFEE
|
| |