Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_02804 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001306 |
Strand | - |
Start bp | 2711765 |
End bp | 2713465 |
Gene Length | 1701 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | beta-galactosidase, putative (AFU_orthologue; AFUA_5G00670) |
Protein accession | CBF83980 |
Protein GI | 259486273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.409324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.343793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTGC CTCACCTCGC TCAAACCCCT AATGGGTTCC AGCTTGTCGT CAAAGACAAG CCGGTCCTCC TCCTTCCGGG CGAACTACAC AACTCCTCGC TCTCCTCTGC CAGATACATG TCCACTGTTT GGAATTACAT GAAAGAACAA TGCATCAACA CCCTTCTCGG CGCCGTTACC TGGGAAATGG TCGAGCCCGT TGAGGGTCAA TTTGACTTTA CCGAACTGGA TTCGGTCATT ACTGACGCAC GGAAACACGG GCTATATCTA GTATTACTCT GGTTCGGAAG TTATAAGAAC GGAGTCTCGA CGTATGTACC ACCCTGGGTG AAGAGAGATA TTGTAAGGTT TCCCAGGGTG CAAGTTTGGG ATGCGGACAA AGGTCGGAAG AGGACAATCG AGATGATCAG TCCGTTCACC GAGGAAGGGT GGAAAGCTGA TGCGCGCGCG TTCGGGAAGC TAATGGAGCA CCTGAAGGAA TTTGACAGCC AACATAGTAC AGTCGTCATG GTGCAGGTTG AGAATGAGAC GGGCCTCCTG GGAGACTCGA GGGATCGGTC GACTGTGGCG GAGAGGGTAT TTGCAAAAGG TTTTCCTCCT GAGCTTCTTC GTCATCTCTC TACATCCAAG AGTCTACATC CTCGGTTTAT CGAACGCTTT GGAAACAGAC TTCCACCCTC GGGTGAGATC AACGATGGTC AGACCTATGC CTGGGAAAGC ATCTTTGGCC CTGGCACCGC AGCAGACGAA GCCTTCATGG CCCACTATAT TTCAGAGTAC GTAGGCCACG TCGCTGATGC GGGAAAGAAA GCATACCCGA TCCCTCTCTA CACAAACACA TGGCTAAACT TCGATGACCC CTCGGTTTTG GATCTCCATG GGTATCCTAA CGTTGTTGGC GGAGGCGCGC GGCCCGGAAT ATACCCCAGT GGAGGGCCCT GCCCGCACGT CTCCGATATA TGGCGCTTCA ACGCACCGGC ACTCGACTTC CTCGCGCCAG ACCTATATTT CCATGACTAT GAACGGGTAT GTCAGGATTA CACAGTCCCC ACAACGAATC CACTGTTTAT ACCTGAACAA CGGCGCGATG ACCATGGAGC AAGAAGGGTA TGGCTGGCGT ACGCAAGTTA TGGGGCACTA GGCACGAGCC CTTTTGGTGT TGACACAGAA GCTACAAAAA TCGGAAAAGA ATATAGGCTT CTTTCACAGA CAGCAGGGTA TCTGCTCAAC TCCCCCCCAA GGCAGCGGAT GGGATTTTTC TTCGATGAGC TGCCGGAGAC CGGTTCACCA AAAGGAAAGC AGAAGTGGAC AAAGGTGTTC GGAAACATCG AAGTGATTAT TGAACGCGCC TTTGTTTTCG GAAAACCTGG TCCTGGTGGG GGCATGATAA TCCAGCTGTA TGACGAAACG TCATACAGGT TTCTTGTGGT TGGACGCGGA TTCCAGGTCC GGTTTCGTGG GCTAGACGAC ACCGTTACAT TCACAGGGAT CTTGGAAGCG CAGGAGAAGG AGGTCGATAG TGAAACAGGC GAGCTTAGGA CGTTGAGAGT GTTGAATGGC GATGAGACAA GAAGCGGCGA GTTTCTTATC ATGCCGAACG AGGACCCAGA CTATGGAGGG TTCCCTATTG CTGTCACCAT CCCGGCGAAA ACATATATTG CGGAGGTAGA GGCCTATACT ATTTGCGAGA AAAAAATCTA G
|
Protein sequence | MSLPHLAQTP NGFQLVVKDK PVLLLPGELH NSSLSSARYM STVWNYMKEQ CINTLLGAVT WEMVEPVEGQ FDFTELDSVI TDARKHGLYL VLLWFGSYKN GVSTVQVWDA DKGRKRTIEM ISPFTEEGWK ADARAFGKLM EHLKEFDSQH STVVMVQVEN ETGLLGDSRD RSTVAERVFA KGFPPELLRH LSTSKSLHPR FIERFGNRLP PSGEINDGQT YAWESIFGPG TAADEAFMAH YISEYDYTVP TTNPLFIPEQ RRDDHGARRV WLAYASYGAL GTSPFGVDTE ATKIGKEYRL LSQTAGYLLN SPPRQRMGFF FDELPETGSP KGKQKWTKVF GNIEVIIERA FVFGKPGPGG GMIIQLYDET SYRFLVVGRG FQVRFRGLDD TVTFTGILEA QEKEVDSETG ELRTLRVLNG DETRSGEFLI MPNEDPDYGG FPIAVTIPAK TYIAEVEAYT ICEKKI
|
| |