Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_09409 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | - |
Start bp | 549169 |
End bp | 550439 |
Gene Length | 1271 bp |
Protein Length | 307 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | DNA-binding protein HGH1, putative (AFU_orthologue; AFUA_3G04260) |
Protein accession | CBF87556 |
Protein GI | 259488251 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.518792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000000106987 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAACAC TCAGATCAGA CAGATTGGTA AATGCAGCTT AGAAACACAG TCTGAATTGA TGATAACACT ATCCAGCCTG CGCAACATTA GTTGGATATT CGGTGTCGAG GCCGGAAATC TTCAAACGTC ACCAGCTTTT GCCTATCCAA GACTTAAAAC TTCTTGTTCG AGACTATACA GTCAGTGAGC ATGCCCCTTT TTGAATTGAA CATATGGGAT TTCATTCTGC CCCTTCTTCA TGGAACTGAC CTGCTGGATA GCCTATTGCG AGCGATGCGT TAACAATTCT CGTCAACCTT TCTGGTGATA AGGAGATCCT AGATAAACTT GCTACTGATG ATGCTTTTAT GGAAACACTC CTCAACAAAG TGACTGTAGG TTTCTGCAAC ATTTGGTTCG ACCAAGGCTA TCCCATTCAC TCTCTCTGTT ACCTCGGTTA TTTTGATCTT TACTCACTGC ATCGGTCCAC TAACAAGCCA GTTGGCAGAA TAACAAAGAG GGAAATGCCG ATGGGATCTG TATGCTGTTT GCCAATCTTG GAAAATCCGA GAATATAAAA GAGCTATTGA CGCTTAAACG TCGAACGGCC AATCCTGTCT CAAACTCCGA GTATGCAATT GACCAGTTGA TGGATTGTTT CGTGAAAGGC GCCGACGGCG CACTCAACCA ACACGCGAAC TACGACTATC TATCTTATCT ATTTGCCGAC TTGTCGAAGC TAGAGGAGGG CCGCAAATAT TTCACAACGA GACAGGATTA TGATGGAGTC GTGCCTGTGA CAAAACTCAC CGTTTTCACG GAGCATGAGA GCACGGTCCG GAGGAGGGGT GTTGCATCGA CCATAAAGAA TGTTGCATTT GAAATACCAT TTCATCCGAC CCTCTTCTCT GAAGACGAGG CGAATCTTCT GCCTTACATA CTTCTGCCAA TTATGGGGCC GGAAGAGTAT AGTGAAGAGG ATACTGCAAA TATGCTCCCA GACCTCCAGT TGCTGCCCCC CGACAAGAAG AGAGAAAGCG ATAATGGAAT TATCGTTACT CATCTTGAGA CGCTTTTATT GTTGACAACT ACCCGGGAAG GACGCGATAA ACTAAGGGCA GTGAACGTCT ATCCCGTTAT CCGAGAATGT CACCTCCGTG TTGACGACGA CGGCGTTCGT GAGGCCTGTG ATAGATGGGT CCAGGTCATT ATGCGAGACG AAGAGGATGA AGGAAACTCA TCGGCAGGGC AGAATCAGGA TGATGATAGA AAGGTCGTGG AATTGTTCTA G
|
Protein sequence | METLRSDRLP IASDALTILV NLSGDKEILD KLATDDAFME TLLNKVTNNK EGNADGICML FANLGKSENI KELLTLKRRT ANPVSNSEYA IDQLMDCFVK GADGALNQHA NYDYLSYLFA DLSKLEEGRK YFTTRQDYDG VVPVTKLTVF TEHESTVRRR GVASTIKNVA FEIPFHPTLF SEDEANLLPY ILLPIMGPEE YSEEDTANML PDLQLLPPDK KRESDNGIIV THLETLLLLT TTREGRDKLR AVNVYPVIRE CHLRVDDDGV REACDRWVQV IMRDEEDEGN SSAGQNQDDD RKVVELF
|
| |