Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03615 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | - |
Start bp | 3531246 |
End bp | 3534444 |
Gene Length | 3199 bp |
Protein Length | 1049 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | CBF75772 |
Protein GI | 259481858 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.391595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0454206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTAA GTACGCTTTC CTCGATTCTA CAGCCCTAAC AGCTTCTTCT CCCAGCAGAG CGCCATGGCT ACCACGGAGC CCACCCACGG ACAGGCCGAC AGTGTTGCCA GTGTCGAGAG CCCGGCCCAG GATAACCCGG GCCCCAAAGA CCATTCTGAC TACACTCGGG CACTCGAGAA GCGACTCAGC GAGTTGGAGT CACGGCTGTT GAATGTCGAG CTACACAGCA AAGAATCGGT GAGCAAGCGC GTAATCAGCG GCGACAACCC TGAAAGCGAA CTCGAGAATG CGGCAAAAAG TCCTTCAGAC GACGACGCAG GGCCTGAACC AGAACAGCTC CCTGTCGTGC GAGAAATACG CAGGTTGAAC TGGATCAACT TTGTAAACCG TTTTCCGGAC CAGAAGGATG CAGCCCTCAT TGAACTCCTC ATGGCCCCGC CGTCGTTAGA AGATGAAGAA AAGAAGGACA GCCTCTTCTG CGCAAAGCTG CAACTGGTCG GCGCCGAGGA AGCGCAGCAG CTTATGGCAA CAATGCGAGA GCGTAATGTC TTTCGATCGG ATGCGTACCT TCAATCTGTC CGCATTTCAT CCATACCACT GGCCCGCGAG CTTGTGGATA TCTTGGGCTC GGACGATGAT ATCACAGCCC CCCTAATCTT TAGGCGACCA TTTGCACCAT TGATCTATCA TATCGATGAC TTCAAGAAGA AGCTGGCCAA GTTGGAAGCA GAGCTCGAGA AAACGACGGA AGAGGATGTT TTGAGCATAT TTCCCTCGAG CGTGGTGCCT ACGCAGACTG CGATGACTGC CTCCGAAAAG GCACGCGCTG ACCGGACAAC TCTTGCTGGT GATTTTCGAT ACCTGATTCA GTTCCTGGAG CGGGAAATAC TGCCGTTCGC AAATCTTTTT GACAGCTCCA GTAAGCCTGA GACTTCGAGC AGGCACAGAA AAACATGTTT CCGCGATTTG TGGCATCTGT TTAGGGTTGG CGAGTATATC TACAACCCGG CTGCAACTTT CTCTTTCCAA TCGAACCCCA AAGCAGTTGT TAACGAAAGC GGTGACCAGA AGCTATGGAA GCTGTATCGG AAACGAACAG TTGGCAATGA TTTTGAACTG AAATGTTACC GCATTGACCA TAATGGCGAA GCCTATGTCT GCATTCCGAC TACATTCACT ATCAGTTATT TCAAAGGGGA ACAGGACATT GTCAACCTGA CGGTCTACCC GCTTCGGTTT GCAGAGGACC ACAAAGCCCT CCTACAGGTA TACAAGGAGT CGGGCCAAAA GTGCAAAGAG TGCATCGAGG CCAAGTTCCT CTTGCACACC GGATGGGCTC TGACTCCCGA GTCCCAGAGC TCGTTGCAGT ACATTGCCAG CGATGTTATT CTCGACGCTG GTGAGGCGAT GAAACTCCAT TTCAACTGGA AGTTTGATTC AAAATACCCC AGTACAAAGG AGTGGGATAG AGGCTACCCG TATTATTTGT ATCATGTTTT CTACTGGAGG TTGAAGGACC AAAAAGCTGT CTCCTCGAAC ATCCAAAATG CCTGCTGGGT GGACGACTGG GTGGGCCGCA AGGAGAAGAT TGACTATTGC GAGCGTGTGG ACTCTTTCCT TTCACACGGC ATCAACGGCC GGGAGAAGGA ATATCAACTC AACGACGATG ACATCGTGCT ATTGCCACGT CGGCTTTTCG CCTATGTCCT CCAAGAGCGA CGGTTTGTGG CCGTGGAGAC TCGAAACCTC TCAAGCGTGA AGGACTCTAC AAGTGGCACA TTTGAGAACT TGGTGATCAA TCCGGACCAT ATGAGCTTGC TTCAGTCCCT CGTGCATTCC CATTATATGC GCAAGCAGAT ACAAGATTCC GGTCGATACA GTGTCAACCA GGACATTGTC CACAACAAAG GACGAGGCCT GGTTATCCTC CTTCATGGCG TCCCGGGTGT TGGCAAGACC TCGACAGCAG AAACCATTGC CCATCAATGG AAGAAGCCGC TATTGCCGAT AACCTACGGG GACCTCGGAC TATCGCCATC GAACGTTGAG AGCAAGTTGA AGGACGTTTT TCGCCTTGCT CAGCTTTGGG GCTGCATCCT TCTCTTGGAC GAGGCAGATG TCTTTCTCTC CGAGAGAAAA GCAACGGATC TCGAGAGAAA TGCTCTTGTT TCCGGTGGGT TTCCTTCACA TGGTGCAGAA TCATCACTAA CAGTCAGCGC TTTCACAGTC TTCCTGCGGG TCCTGGAATA TTACATGGGA ATCCTCTTTC TCACCACCAA CCGCGTAGGC ACTATTGACG AGGCCTTCAG GTCCCGCATT CACATTAGCC TCTACTACCC TGACCTTGGT AAACGCGAGA CCAGGAAAAT CTGGAAGTTG AATCTCGACC GCTTAAGAGC CATCGAGGAA GAGAGGGCTG GCACAACTGG CAAACCGGCT CTGACAATTG ACGTCGATGG TATAAAGAAC TTCGCTCTTG AACACTACAA GTCAAGTCAA CAAGGTAAAG GCAGGTGGAA TGGAAGGCAA ATCAGGAACG CATTTCTCGT AGCGTCAGCG CTCGCCCGCT ACGAGAAAGA GCACCCGGAC TCAAAGTCCC AGCCCTCGAT CAACACGTCG CCGTACAACA CGTCGTCTTA CGACATTTCA GCAAGACACT TCAAGGTTGT GGCAGAAGCT GGGGTGGGCT TTGACAAGTA CCTCTACGAA ATCAAACGCA AGACTCCGGG AGAGCAGGCC CTCTTGCATG GTTATCGCAT TGACTCGGTC ACCCATAAGT CGCCTCAAGA GCCTGGCAGC CAGTTTACTG CAGGCCAGGG GGGTATACCT CCAAGCAACC AGGGTCTCTT TCCCGGTCAG TCGAGTCCAT CCCTACAGGG GCATTATGAT GGCCTCCAAG CCCATAATCC TGGATCCCGT CAGTCTCAAG CTCAATTTTC TCAGTATGGC TATACAGCGG ACGGCTTACG CAGCCAGCCT TTCGTACCGC ACGGCCATGA TCCTACTTTC AATCGTTCTC CGCAAACGGG ATACGGCCAG GAGTTTGGAA TGGGGATGGG AATGAGGAAT GAATATAATC TTCAGCCACC TCCAGTGACT TCAAGCAAGC ATTCTTTTAA AGGTTCTCCC AGGGGCACTC CAAGTGGATT TTCCGGCACC GCTGGGCATG GAGATGATGA TTCTGATTCT GATGATTGA
|
Protein sequence | MSQSAMATTE PTHGQADSVA SVESPAQDNP GPKDHSDYTR ALEKRLSELE SRLLNVELHS KESVSKRVIS GDNPESELEN AAKSPSDDDA GPEPEQLPVV REIRRLNWIN FVNRFPDQKD AALIELLMAP PSLEDEEKKD SLFCAKLQLV GAEEAQQLMA TMRERNVFRS DAYLQSVRIS SIPLARELVD ILGSDDDITA PLIFRRPFAP LIYHIDDFKK KLAKLEAELE KTTEEDVLSI FPSSVVPTQT AMTASEKARA DRTTLAGDFR YLIQFLEREI LPFANLFDSS SKPETSSRHR KTCFRDLWHL FRVGEYIYNP AATFSFQSNP KAVVNESGDQ KLWKLYRKRT VGNDFELKCY RIDHNGEAYV CIPTTFTISY FKGEQDIVNL TVYPLRFAED HKALLQVYKE SGQKCKECIE AKFLLHTGWA LTPESQSSLQ YIASDVILDA GEAMKLHFNW KFDSKYPSTK EWDRGYPYYL YHVFYWRLKD QKAVSSNIQN ACWVDDWVGR KEKIDYCERV DSFLSHGING REKEYQLNDD DIVLLPRRLF AYVLQERRFV AVETRNLSSV KDSTSGTFEN LVINPDHMSL LQSLVHSHYM RKQIQDSGRY SVNQDIVHNK GRGLVILLHG VPGVGKTSTA ETIAHQWKKP LLPITYGDLG LSPSNVESKL KDVFRLAQLW GCILLLDEAD VFLSERKATD LERNALVSGG FPSHGAESSL TVSAFTVFLR VLEYYMGILF LTTNRVGTID EAFRSRIHIS LYYPDLGKRE TRKIWKLNLD RLRAIEEERA GTTGKPALTI DVDGIKNFAL EHYKSSQQGK GRWNGRQIRN AFLVASALAR YEKEHPDSKS QPSINTSPYN TSSYDISARH FKVVAEAGVG FDKYLYEIKR KTPGEQALLH GYRIDSVTHK SPQEPGSQFT AGQGGIPPSN QGLFPGQSSP SLQGHYDGLQ AHNPGSRQSQ AQFSQYGYTA DGLRSQPFVP HGHDPTFNRS PQTGYGQEFG MGMGMRNEYN LQPPPVTSSK HSFKGSPRGT PSGFSGTAGH GDDDSDSDD
|
| |