Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03044 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001306 |
Strand | + |
Start bp | 1970031 |
End bp | 1971687 |
Gene Length | 1657 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | Endo-arabinanase [Source:UniProtKB/TrEMBL;Acc:Q1HFU2] |
Protein accession | CBF83510 |
Protein GI | 259486009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTATAGAAC CCTCATACTG CCAATGGTCC ATATCACCCT GCCTGGGCTG CTTCTCTGCC TCTGCCTCTA TCTATCTGTC GCTCCCGCCA ATCCGCTAAA CGCCCACGCC CGCGTTTCTG ACACGACCGC CTTCCCGCTC CCCAACGAGG GCCATGTCGT TGCCCACGAT CCCAGCATCG TCCGCCACCA CGAGCACTTT TATCTCTTCA AGGGCGGGAT TCATATTCCC GTATTCCGTG CTTCGAATCT GTCAGGGCCA TGGGAACGAC TCGGCACAGT GCTGAACGGG CCCAGCCTGG TCCAGAAACA GAACCAGCGG CGGCCATGGG CGCCCATGGT AACGCAGTGG AAGAACCGGT TTTACTGCTT CTACTCGATC AGCCAGAATG GGAAGCGCAA TAGCGCCATC GGAGTCGCAT CGAGCGATTC CGTAGAACCG GGCGGCTGGA CCGATCACGG GCCGCTGATT AATACGGGAC ATGGGCCGGG GTCGGGGGTA TACCCGTTCA ATGTATCGAA TGCGATCGAT CCGGCGTTCT TCGCCGATCC CATCACGGGA CAGCCATATC TGCAGTATGG GAGCTATTGG AAGGGGATTT TCCAGGTGCC GCTAGCGGAG GACTTGCTGT CGGTTGAGAA TGCCACGCAT CCGAACACGG ACCATCTTGT TTTTCTGCCG AAGAAGAAGC CGAAGCCGAA CGAGGGGGTT TTTATGAGCT ACCGGGCTCC GTATTATTAT GCTTGGTTTA GTCACGGGCA GTGCTGCCAT TTCAAGACGC AGGGGTTCCC GAAGGAGGGC AATGAGTACG TACTCGGGCT CGCGCTCGCG CTCGGTGTTG GAAGAAATTG GCTCGGCTTG GCTAACTGGC TGGACTACGT GCACAGATAT AGCATTCGTG TCGGGAGGTC AACGAGCGTT CATGGGCCGT TCGTTGATCG TGACAACAAA GATCTACTTA ACGGAGGCGG CTCTGTAGTA TACGGATCGA ACCACGGGAA GGTCTATGCT CCCGGTGGGC TGGGCGTCCT GCCTGGGGCC AACGGTGAGC CTGACGTGCT GTACTATCAT TATCGTAGGT TGCCTCTTCC CGGCTGGCAA GGCAGCGCTG TTGACATACT CTTATAGACA ATGCCAGTAT CGGCTTCGCC CAGGGTGTAC GTAACCCCGA ACTTCTTACT CACAATTGTC ATGTCTACTT CTTTTTTTTT TTCTTCCCTT CCTTTTTCTG ACAGTCATCG CAGGATGCAC GATTAGGCTG GAATTATCTT GACTATGTCG ACGGATGGCC TGTCCCGAGG TACGTCCGCA CTATACCGTA TCACCAGATC AGAGAATCGA TTTCAGCTGA CGCTTTATAC ACCAGAGCCC CTTCAAATCC CGGCAACAGC CTCCAGCCCC CTTCGTCTGT CTCTCTCCAG ATCGTCGCAT TCCTGTGTAT GTGTTGCCTA GCCATTATAC TTGCGCTCGC CTATAAGGGA GTAATGACTA TTGGCCGCAA GACAATGGTG CCCCTTGTTG CAGTTTTTTT GATTGCGTCG GCTCTTTGGA CCCTACATCG AACCCGCACC CAAACCTACG CCCAGTACTG ACTCGACGTT GTGAGCGTTG GCTTCTAACA TCGAACAACA GGTTTGGTGA TTCTGTTCAC TTTATGA
|
Protein sequence | MVHITLPGLL LCLCLYLSVA PANPLNAHAR VSDTTAFPLP NEGHVVAHDP SIVRHHEHFY LFKGGIHIPV FRASNLSGPW ERLGTVLNGP SLVQKQNQRR PWAPMVTQWK NRFYCFYSIS QNGKRNSAIG VASSDSVEPG GWTDHGPLIN TGHGPGSGVY PFNVSNAIDP AFFADPITGQ PYLQYGSYWK GIFQVPLAED LLSVENATHP NTDHLVFLPK KKPKPNEGVF MSYRAPYYYA WFSHGQCCHF KTQGFPKEGN EYSIRVGRST SVHGPFVDRD NKDLLNGGGS VVYGSNHGKV YAPGGLGVLP GANGEPDVLY YHYHNASIGF AQGSSQDARL GWNYLDYVDG WPVPRAPSNP GNSLQPPSSV SLQIVAFLCL VILFTL
|
| |