Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_04314 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001303 |
Strand | + |
Start bp | 2418838 |
End bp | 2422012 |
Gene Length | 3175 bp |
Protein Length | 998 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | CBF77785 |
Protein GI | 259482883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00374749 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.893249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAC GGTGAGTTGC CTGCTTTGCA TTCTTCAGAA CTGTGTTCAA AACCTCAATG GGCTGCGCCC TCCCCGCCCG TTTCAGTTCT CGCCTGCTTT GCACTGGTTT AACTCGCAAA TCACTATTTT TACTGTGCAT ATTTCAAGTC GCTGGGTCGC TGATTTCACG TCTAGCTCTG TGCGATTAGC ACTGTCGGTA TCGCTGCCAC CGGCTTTGCG CTCGGTCGCG TCGAACGGAC ACCACCCCAA GAGCCGGCAA ATACACAAAG CGCTCCGTTA CTAGATTCCG ATACAGTCCC CTCATTCGAT GCAGGACTCT CGAGGACATT TGCGACAACG GAAACACAAT CTCCCGTTAA CCCTAGCTTT CCAGAAACTC GAGAGAATGC TGCTTCCCGC AAAGACCAAG TCGAAGGCGA AAATATACCC AGGAAGACAT CATTTGCAGG TATAAGACTA CGTCGGCGTG GTCAGACCTT GACCATCCCA CCGATACAAA CAAAAGAAAA TCTGCCGAGT CCGTCGCATG TCCGACGACC GTCCTCTTCC TGGCTTCGGC GACTGTCGTT CCAACCTGAT AAGCGGTTCT CACTCCAGAG TCCGGACACC CCATCGTTTC CGGAACCAGC CAGTCCAGCT TTCCCACGTC CGTCCAGTCA ACGCAGGGTA CCCAATAAAC TTGTCAAACG ACCAAAATCC CAGCATTCCA ATGCGAGCCC TTTCTTCGCA CACACCATCT CTTCACCAAC ATCTAGCGCT CTACGCAGAC CTGTAACAAG TTACCAGCGA TCTGAGACCT TCAGTCACAA GGCAACCCAT AGCTTGAGTT TCGAACCCAG TTTGGCGCTA GGTCCTCTTG CAGAGCCTCC TTGCCACAAT GCTACATCGG AAAACGACCC CTGGCAACCG TACTTGGTAC CGAAACCTGA CACATCGTCC GAGCGACTGG ATCGTAGGTT TTCAACATCA ACAAAGCCTC CAGAGACAGC AGTGCGACGA ATCCTTCCAC AATCTGATGC TGCTCCGGCC CTTATCCTAG CAACATCGAT CATAAAAAAG GATGGTGCGG TGAAAGCTGT ACCGGAAGGG CCTGCAACTC AACCTGTTGA GTTTCGCAAT CCGTTCGAAC AAGCGCCATT TGATCTAAAG ACACACCTAG CTGAAACAAA TGCTTGTCAA TCGCCCTCTG AGGAGGTGCG TCCAAAGTCA CCTGGAGAAT CTGCAGGGCC ATTAGATATC AGGGTGTTAC GCAATGGCTC ATTTACGGGC CCGAAAAGAA GGGCAGCATC GACCCCTTTA CCTGAACAAT CAAATGTGGA AGGAGCTATC TGGGTCTCAC CACGGGCACC CGAAAGGCGT AACATTACCG ACCCAAATGT CTTCAGGCGA CCGTCAACCA ATCCCGCAAA CGGTGGGCTT TCATCTTTGT ACACGTCAGG GATTATGGGG CCGCAGTCGC GGATTCTCTC GTCGTACCGC AAAGAATACC GGGATATCGG ACAGAGCGCC TCGGCAGACG GGTCAGCCCT ATCTTCACTG TACAGTTCGA TTCGTCCACG CTCTAAGCGC CATTCTATCG CTGCTTCAGA CCCAGCATCA ACCGTCATAG GTTCCGATGA TACTCGGGTC TTCACTTCTG GTGAGGAAGA CGAAACTGAC TTTATGAGCG ATACTGCTTT CGACTCAATA CGTACTCATT TTACCACCGA CAGTTTTTGT TTCCAATCAC CTCGAATCGA GACCATCTTT GACAGAAAAA TTCCTCCTGG GAACACGATT GATCCATCCG CTGGTTCGAA CGATATGAGT CCATCTAGCG TTCTCCTCTC TGCACCAATA AAGTCACTTA GTTCTGAAGA GGAAAGAAGA ACAATTGTGA CCTCCTTCAC GCCGGCAACA CAGATCATCA AGAATGCTCA TGAAGCAGAT AGCGAAGTGT CCTTCCCATC TGATTTGAGC GACGATGATG ACTCAAAAAG TATGGTCGCC TCGTTACCCG GAGATAAAAC CATCCGCCCT GTTAGGCCTC ACGTTGGCGC TCTATCCTTG GTTACCTCCG GCGGCCAATA TCAGCGAGAC ATTCATGGGT CAGGCGGAGT GACCGGAGAA TCCCATGAAA CATTGCCCAA GATGAACATA TTCGACTGGT CCGAACATCC AAGGCCTGAT CGTGAGGGGT CCGGACCTGA CGGTCGCCCG CGCACCGTTC ACGGAAAGCA TGGCCCAGGC ATGCGTGGCA GTCGAGCAAC CGGACGGAAA CCGCCCAGTA CCCTTCATTA CCGTAGCCAA AGTGTTCCTG TTGCAAGGGA GCCCGCTATG CCAAACGAGT CAAGGCAATC TTCCGGAAAA TTTGGGACCT GGGGCTTGGG TAGTAAAGGT GTCAGTGAAG ACTGGGACAG TGACTTCGAG TTCGAAGACA AGGACAAGGA CGAGAATGCG ATGAGTGAGA ACATCAACCC AAACAAGAAT GTTAGTCGTC GGAGCGTGAT AGTACCTCAA GCGATCATGG AACGTCAAGC CAGTCTCCAC GGCCAATATG GTCAGGTTCA AGAGCTAACT CTGTTGGTGG AAGAATTGAA ACGCTTGAGG CATCAAGCCA GCTTTCTGGA CATTGTTCGG GGTCCGTCAA ATGAACTTTG GAAAGAAGCT GAAGGGATCG TTGATCTAGC AACCCTCGAC GATGACGACC ACAATGAATC CCCTCCCAGG TCGCCATCAT CACTAACTTT CAGCTTTGAT GAGTCTGAAG GCGAATCCTC CCAGATAAAT GACCCTTGGA AACGCGTTAG TGGAGATTCA TGGAGGGCCT CGCTTTCAGA AAACTCTAGC CTCCGTCCGA CAACGTCTCC GGGACCGGAG CAGACAGTAT TCTCCACAAA AGCAAACTCC GTGCTTGATC TGATCTATCA GCAACGCCTT ACCCAGAATC CCACAAGTAT CGACACCCAC CTCCCAAGGT CAAAGAAACT TCCATTCGAT ACGCAGTCTC TCCGTGACCT TGTTGTCCGA GCCGGCGTGG TAACCCGAGC GTTGAAGGAA GTGGTTCGAA AGGCAGAAGG AGTTGCAAAC GGTTCAGAAG AAAACACGCA CCCATCACAT CCACCATTCA GTCGCATTTT CGAAACTCCT TCGCATGATG ATATCTCGAA CTTTGAGACC TCCTGCATTA GCTGA
|
Protein sequence | MYQRTVGIAA TGFALGRVER TPPQEPANTQ SAPLLDSDTV PSFDAGLSRT FATTETQSPV NPSFPETREN AASRKDQVEG ENIPRKTSFA GIRLRRRGQT LTIPPIQTKE NLPSPSHVRR PSSSWLRRLS FQPDKRFSLQ SPDTPSFPEP ASPAFPRPSS QRRVPNKLVK RPKSQHSNAS PFFAHTISSP TSSALRRPVT SYQRSETFSH KATHSLSFEP SLALGPLAEP PCHNATSEND PWQPYLVPKP DTSSERLDRR FSTSTKPPET AVRRILPQSD AAPALILATS IIKKDGAVKA VPEGPATQPV EFRNPFEQAP FDLKTHLAET NACQSPSEEV RPKSPGESAG PLDIRVLRNG SFTGPKRRAA STPLPEQSNV EGAIWVSPRA PERRNITDPN VFRRPSTNPA NGGLSSLYTS GIMGPQSRIL SSYRKEYRDI GQSASADGSA LSSLYSSIRP RSKRHSIAAS DPASTVIGSD DTRVFTSGEE DETDFMSDTA FDSIRTHFTT DSFCFQSPRI ETIFDRKIPP GNTIDPSAGS NDMSPSSVLL SAPIKSLSSE EERRTIVTSF TPATQIIKNA HEADSEVSFP SDLSDDDDSK SMVASLPGDK TIRPVRPHVG ALSLVTSGGQ YQRDIHGSGG VTGESHETLP KMNIFDWSEH PRPDREGSGP DGRPRTVHGK HGPGMRGSRA TGRKPPSTLH YRSQSVPVAR EPAMPNESRQ SSGKFGTWGL GSKGVSEDWD SDFEFEDKDK DENAMSENIN PNKNVSRRSV IVPQAIMERQ ASLHGQYGQV QELTLLVEEL KRLRHQASFL DIVRGPSNEL WKEAEGIVDL ATLDDDDHNE SPPRSPSSLT FSFDESEGES SQINDPWKRV SGDSWRASLS ENSSLRPTTS PGPEQTVFST KANSVLDLIY QQRLTQNPTS IDTHLPRSKK LPFDTQSLRD LVVRAGVVTR ALKEVVRKAE GVANGSEENT HPSHPPFSRI FETPSHDDIS NFETSCIS
|
| |