Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_04435 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001303 |
Strand | - |
Start bp | 2055406 |
End bp | 2058535 |
Gene Length | 3130 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | cell morphogenesis protein Sog2, putative (AFU_orthologue; AFUA_4G07260) |
Protein accession | CBF77530 |
Protein GI | 259482751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTCTA CTTTGGTTCG ACCGGAGGAT AGTCTAAGAA TGCCTCGCTC ATATCGAGAC ACTGAAGAGG AAGACGCGAA CCGACACAGT CCCGAGCCAG CGGCGCCGTT ATCCTCGTCA TCCTCGACCA CCGACGGGTC GAATAAGAGC GAAGATGAAG CCACGTTGGT GAACGGGTCT ACCGGCGATA CCAGCAGCAG GCCGACAAAG TCGCGTGAAT CGCTTACGCC GGAGGAGACA ATCCAACTTG CGCGCCGAGC AGTAGAGAAC GGGATCCAGG AAACGAAACG GTCTTTGGCA GGAAGCGAAG CGGTCGGTGA TGTGGTCAAG CCGAAGCTGA CCATTGATTT GGGCCACTCA AATATCAGTC GTATACCGGA GCCGGTTGTG GATATTATCA AGGATGAAGT TGAACGGTGT GTAATAACAA AGCTCCTGGG ATTCATCGTG CACGAAGCGA ATGAAAGTTG GTCCACATGA TGCTGACTAT GTCGTAGGCT CTCGTTATGG AACAATCAAC TGGTCCATAT TCCTTACCGA TTCGCAGAGT GCTCTCATCT TCGATATCTC AACGTCAGAT CTAACAACTT TCGTGAATTT CCTAGAGGAG TACGTTTCGT CTGATTGATT TCTTGTCGTA GCTTTCTGAC TGGTATGCTA GGTGATTAAA CTGCCATTAC TGGAGATTTT GGATTTGAGC CGCAACAAAA TTAGCCAGCT TCCGGAGGAA ATAAAGAAAT TAACATCTCT CCGAGTTCTG TCGGTGATGC AAAACCGCCT TGACGATTTA CCACTTGGTG TGTCGGATAT GAACAAGCTT CAGATCCTCA AAGTAGCTGG AAATCCACTG CGGTACCCGC TCCGCAAAGT AATAGAAACA TCTGAGGCCG AGATTACCTC CTCAATGATG AGCGATAACG AAAAAGAAGT TGCTTTAACG GCTGAGCTTA AAAGGTATCT TAAGGCGCGG CAACCTATTA ATTCTAATGA TTTTGAATCA AACAACGAAA GGTATGTTTC TGAAAGCCGG CTGCGTTGGG TTTGAGCTAA TTCTCGGCGA CGTAGTGATG GCATCTTGGA CACTCCAAAA CCCGTCAAAC GGGGAGTGAG CAGTCGCTTC CCAGTGATTC CAAGCACAGG CGACGGCGCT GACCCTAAAT CGCCGTCATT ATCACGGCCA CCTCCCATCC CACTTAAATC ACATTATCGC ATTGCTTCTG GACATGGCGG TGCTCTGCAA ATCCTCCAGC GTCCCGGTCC CATACCTGGG GCTAATGAGC GGAATCGAAG TAATAGCGAG GGTATCATTC AAGCCTCTTT TGCAACACGG TCCAAGCGCA TGGGGGTCAT CAGTCGAAAG AACACAGACT TGGGCACACT GGACGAAATG CGGCCATACC GCAATAGTCA CCTCCGTGGT CTCAGTTATG GATCTATTCT TCGGACAAGG CCATCTATCA GCAACTCATC TAGTCCGAGC AGCCCAAGAG AGCGCAGGCG GCCCAGAGAT GGATTTGTGA ATCGGATGTC GAGTCTACCA GAGCACAAGG GCGAGCGGGA AACTGAAAGC CCCATAATCG AAAGCGCCAA GGGAATTCTT TTTGCCCTCT TTCAAGTCCA GTCTCATGTC TATGCCCTTA TAAATGTCAT CAAGCGTGAT GATTACCGGC GAAATAGTCT CGAAATTGTC TTTTACAACG CTTCCACTCA TGTGGACCGA CTAAATGAGG CGCTTGAAAA TGCAGAAAAC TCACGAGCAG ATGATGCGGA GCCAATGCGA GCCTCCTACG AGGCTGTCAA GAGGGAATGC GAGACGTGTA TCATGGCGTA CTCTCACGTC GGAACCCAGT TACGCAACAG CCTTGACAGA ATCGTTGCCA ATGGCGATTC GCGCTACATT CGCTCGCTGA TGCTTATGAT ATTTGGTAGC GTGGTTGAGC TCCGCAACGC ATGTGCATGC TTGGAGGTAC CTGTTGGAAA CCGACCGAGA CCTACTGGAA GGCCACCAGT ACCTGAGATC AGCAGAGAAT CTGCGGATTC AGATAGATAC CACTGCACAA CAGTCACCCC AACGCGGGGA AGAGAACCGT CCTTCTCCAC TCGCCGGCTC CGTAGCGATA CAACAATCTT GCATCCACAA ACTAATATGC ATGGGCCCCT GCCAGCCTCT GCCACATTTC AGTCCGCCGT TAGCTCTCCA GGCTTCGCTC CCACTCCATA CAGCTATGGA GCGAGGAGTC GGTCGAGCAG CAGGTCGAAT CATGTCAACA CATCCGTACC CTCGTCCCTT GCTACTCCTC GGTCAGGCGA GTCTTTCCCT CCGATGCCTA CTTCGGTGAT ACCTAAAATA AACCCCTTGA CCGGCTTGGA CGAAATAGAA GAGGAACGGA TCTTTGAGAA GATTTTCTAC CAACTTACCG CTGCTTACAC CGCTGCTCTC CAAGCGCTAC CTGTTGCCCG CCGTCATTTT GCGAGGAACC TGGAGGTCGC GGAACAGAAT CGGGAATTCG AAGATATCCA GATGCTATGG AACAACCTCA TTCACCGCTG CCGGGTTTGT CTGGAAGTAT CAGAAGCCCT CGGGCTTCGT TTGTCTAATA TGAAAGTGAA GGAACCAGGG GGTGGAATGC GGAACCAACG AGAGTTCTGG CAACTATGCA AAGCCTTTAT GCAGTCTTTT GTCGAACTGG TGACTGACAT GCGCGAAGTA CGGAGCATGC ACTTACTCCC ATCTGAAGTA ATCGTGATTC TTCGGCCTGT GCAGAAGGCC AGCAGGGAAG CAGGTCGCTT GATCGAGGCT TCTCCATGGT CATATCTTGC AGACATGGCT CCCGGGAACG GCCCCCCGGC CATATACGGA CCACCCTTGC CGTCCCAAAC TTCACATCAG CAGCAGCAGC ATCACCATCA CCATCAACAA CATCACCCTC AGCTCAACAC AAGCATGTCA CCTTCTGTTG CGCTTCCCGC AACCCCCCTT AGCGCTGCTC TGGGCCCAGC GGCGCAAGCA ACAATACCTT CGACACCTGC AAGTGCCTAC AGCGATAAGT TCTTCGAGGG TGACGTCTTT CAACGAGCCG ACTCCTTGCT TTCAATGCCA AATCAGGCGC CATTCTTCTC GCGACGGTGA
|
Protein sequence | MISTLVRPED SLRMPRSYRD TEEEDANRHS PEPAAPLSSS SSTTDGSNKS EDEATLVNGS TGDTSSRPTK SRESLTPEET IQLARRAVEN GIQETKRSLA GSEAVGDVVK PKLTIDLGHS NISRIPEPVV DIIKDEVERL SLWNNQLVHI PYRFAECSHL RYLNVRSNNF REFPRGVIKL PLLEILDLSR NKISQLPEEI KKLTSLRVLS VMQNRLDDLP LGVSDMNKLQ ILKVAGNPLR YPLRKVIETS EAEITSSMMS DNEKEVALTA ELKRYLKARQ PINSNDFESN NESDGILDTP KPVKRGVSSR FPVIPSTGDG ADPKSPSLSR PPPIPLKSHY RIASGHGGAL QILQRPGPIP GANERNRSNS EGIIQASFAT RSKRMGVISR KNTDLGTLDE MRPYRNSHLR GLSYGSILRT RPSISNSSSP SSPRERRRPR DGFVNRMSSL PEHKGERETE SPIIESAKGI LFALFQVQSH VYALINVIKR DDYRRNSLEI VFYNASTHVD RLNEALENAE NSRADDAEPM RASYEAVKRE CETCIMAYSH VGTQLRNSLD RIVANGDSRY IRSLMLMIFG SVVELRNACA CLEVPVGNRP RPTGRPPVPE ISRESADSDR YHCTTVTPTR GREPSFSTRR LRSDTTILHP QTNMHGPLPA SATFQSAVSS PGFAPTPYSY GARSRSSSRS NHVNTSVPSS LATPRSGESF PPMPTSVIPK INPLTGLDEI EEERIFEKIF YQLTAAYTAA LQALPVARRH FARNLEVAEQ NREFEDIQML WNNLIHRCRV CLEVSEALGL RLSNMKVKEP GGGMRNQREF WQLCKAFMQS FVELVTDMRE VRSMHLLPSE VIVILRPVQK ASREAGRLIE ASPWSYLADM APGNGPPAIY GPPLPSQTSH QQQQHHHHHQ QHHPQLNTSM SPSVALPATP LSAALGPAAQ ATIPSTPASA YSDKFFEGDV FQRADSLLSM PNQAPFFSRR
|
| |