Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03665 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 3386591 |
End bp | 3389954 |
Gene Length | 3364 bp |
Protein Length | 1053 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | sulfate transporter family protein (AFU_orthologue; AFUA_4G12440) |
Protein accession | CBF75657 |
Protein GI | 259481799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTTC TGGGGCGCCG GCAGCGCGCA GACTCGCAAG CCTCCCACCC TCCACCAATA TCAAATGACA ATGTTGCTCC CGACACAATC GATACCCTCA CTGGGCCTAG TTGCTTTACT CGCGTTCCGG AGCCGGAACC CCAAGTACCT GGAGGGACAG GGACGAGTTA CAGGACACCG AGCCGGTCAT TTTATCACCG CTCTTTTCAT AATGCTTCCG GTACGCGCTT ATTGAGAACC TCTGGTCATG TTTGGTAGCT AGCCGTTACT GACCGAGCCT AGACCCGGCT CACTACTCCT CTCACGGGCT TCGAGAACAG ACTGCGGAGC TGGCTTCCCT TGCTTTGACC CGGGAACCGA GGGCTCTACC ACAAGCTATG GAGATATTTC GACCTGTGGA TGGATCCGCC GAGTCGGTTG CGCCGTTGTC GCTGTCAGGG CCAGATAGGA CGCAGGAGAT GGGACATACG CCGACGGTCC ATTCTGGGGG CTCGTCGGTT TTGACGGCTC TGATCCGTGA TCCGTCCTCG AGTGTCAGCA GAGAAGGCCA AGAGAGAACC GGGGACCGAA ACGAGGAGGG TGAGCGTGCC CATGCAGTTG GGCCGTCTGC TCCGGGCAAT GCCGATGAAG AAGAGGACAC GAGCACGGAG TGGACATCTT TGCTGGCCAA GACGCCTTCG AGACCATCTA GGAACTACGG GGGAGCGGGT GACGTTGAGA GTCAAGTGTA TGCTCCAAAG GAGACTCAAA GGACTCTTCC GAGTGCTCTT ACGACAGTAT CGACGTGCTT CCAGGTACTT GCTCATCCGA AATCGTGGGA TCGCCGAACG GTATGGAGGC AAGGCGTTGT CCGTCCGTTC AGTCTGCTGC CCTCGGTATT TCTTGGTCTA CTCCTTAACA TCCTGGATGC TCTCTCATAC GGGATGATCC TCTTTCCGTT GGGCGAGCCG ATCTTCTCTC ATCTAGGATC AGATGGTATA TCGATGTTCT ACGTGAGTAC GATCATTGCT CAGCTTGTCT TCTCCTGCGG GGGCTCGATA TTTCGCGGGG GAATAGGAAG CGAGATGATT GAGGTCGTCC CATTCTTCCA CCAGATGGCG TTTACAATAC TAAACCGCGT TGGGCAAGAT AATCCGAAAT CGGTGATAGC CACCACGATC CTAGCATTCT CGGTCAGCTC CGTCCTTACA GGCCTGGTAT TTTTCCTCAT GGGGACGTGC AAGATAGGCT CGCTTATTGG GTTCTTCCCC CGGCATATCC TTATCGGATG CATTGGGGGT GTAGGCTTCT TCCTTATTCT GACTGGTCTC GAAGTCTCAG CTCGATTGCC TGGGTCTTTT GAGTTTGATA TACCTACCAT GCAGAAGCTT TTTAACCTGG CTGCACTGGC CCTCTGGGTA ACGCCGCTGT TGCTCGCGAT TGGTCTTCTT GTCCTCAAGC GTTTCGTCAG ATCTAACTAT CTAGTTGGTG CTTATTTCAT TGTCGTCGCC TTATGTTTCT ATATCGTCAA GTTCATTGCG CATATTCCCA TGGATTCTTT GCGAAACAGC GGCTGGGTGT TTGATGCCCC CTCATCATCA AATCCCTGGT ACCACTTTTA TACCCTCTAT GGTAAGTCGA CTGCCACTGC TAAGTCCGAT CCTGTCTGAC TAGTTTAGAC TTCTCAGCTG TCCACTGGCC CGCGTTTGTC GACACCATCC CAGCAATGTT TGCGCTCACA TTTTTCGGAA TTCTGCATGT TCCAATCAAC GTGCCGGCGC TGGGGATCTC CACCGGCGAA GATAACCTGA ATGTCGATAG GGAGTTGATA GCTCACGGTG TGACAAATGC TCTTTCAGGG TTTGCCGGTA GTATCCAGGT AATATCGCCG TCAAATTTCT GAGGGTCTTA GCCTAACATG TTCCAGAATT ACCTCGTATA TACCAACAGC TTGCTTTTCA TCGACAGTGG TGGAAACTCC CGCTTAGCTG GCGTTATGCT TGCAGGAGCT ACTGCGGGAA TTATGCTGGT TGGACCTGTG ATAGTGGGGT TCATCCCTGT AATGGTCGTA GGGGCTTTGA TTTTCCTACT GGGGATTGAA TTGATGGAAG AGGCGCTGGT CGATACATGG GGGAAGTTAC ATCGACACGA GTATTTGACG GTGAGTTAGA GTCCGCAGTT TATGTGGTTC CAGCTGATTT TCTAGGTTGT CATTATTGTT GCGACTATGG GCGTTTGGGA TTTCGTTGCG GGTATTTTGG TAGGCATTAT TCTGGCGTGT TTGAGCTTCG TTGTTCAAAC ATCGCGCAAA TCTGCCATTC GTGCCACGTA CTCGGGCAAA GTTGCTGGGT CCACAGTTCG TCGGCCCCCG ATCCAACAGC GATATCTGAA AGAGGCTGGG CAGCAGACCT TGATCCTCAA GCTCGGTGGT TATCTCTTTT TCGGAACGAT TGTGGATGTG GAGAACACGA TGCGAGGACT GATTGAAGAT GAAGCATTCA GTAGACTTCC AATCAGGTTC ATCATATTGG ACCTTTGTCG GGTATATGGC GTTGACTTCT CAGCGGCGGA GGCGTTTACG CGTATCAATC GAATCCTGAA GAAGAGGAAC GTGCGGATGA TGATCTCCGG GCTCGATGTG GGGGGCGATG TCGGAAAGAG CCTCCAGAAT GTGGGACTGT TCGAGCCAGA GCTCACTGTG CGGATATTCG AGGACTTGAA TTCGGCCCTG GAGTACTGCG AGAATGAGTA CCTCAATGTC TTTTACAGCC ATCGAGAAGC ATTGCTGAGG AGGAAGGCCG CTCCTCAGAA CCTCGAGGTT CCAGCAATCC AGCACCGTTC TCAGTCAGCC GAGGGCTTTG TAGGCTCCCC ACGCCATCAA TATCTCCAGC GCGCAGCCAC AACGACACTC AGCGAAGACG AAAGCGCGAT CCTGCCCCCA GCAGCATGGT CGGCGATGCG CCAACCCCTC CCTCTCCTCC TTCAGACCTT CCAAGGCCTA ACCTCGCGCA ACGAAGATTT CTGGTTCGCT GCGTGCCCAT ACTTCGTCCG CACGACCTAC GCAATGGGTA CAACCCTATT TCGAGAAGGC GACGTCCCCA ACGCATTCTA TCTTCTCGAG TCTGGAATGC TGCGCGCAGA GTACGATCTC CCGCAGGGCC GATACTTTGA GCTCATTGTC GCCGGACGGC CGTGCGGCGA GCTGCCATTC TTCAGCGAGA CGCGGCGGAC GGCGACGGTG AAGGCAGAGC AGGATTGCGT GACGTGGAGC CTTGATGCAG AGAATTGGAA GGCCCTGAAA GAGGAGGAGC CGGATATCGC GCGCGAGTTG CTGACGGTCA GTCTAAAGCT GACGACGGAG CGGATGGATA GTATTACTTC GTGA
|
Protein sequence | MGVLGRRQRA DSQASHPPPI SNDNVAPDTI DTLTGPSCFT RVPEPEPQVP GGTGTSYRTP SRSFYHRSFH NASDPAHYSS HGLREQTAEL ASLALTREPR ALPQAMEIFR PVDGSAESVA PLSLSGPDRT QEMGHTPTVH SGGSSVLTAL IRDPSSSVSR EGQERTGDRN EEGERAHAVG PSAPGNADEE EDTSTEWTSL LAKTPSRPSR NYGGAGDVES QVYAPKETQR TLPSALTTVS TCFQVLAHPK SWDRRTVWRQ GVVRPFSLLP SVFLGLLLNI LDALSYGMIL FPLGEPIFSH LGSDGISMFY VSTIIAQLVF SCGGSIFRGG IGSEMIEVVP FFHQMAFTIL NRVGQDNPKS VIATTILAFS VSSVLTGLVF FLMGTCKIGS LIGFFPRHIL IGCIGGVGFF LILTGLEVSA RLPGSFEFDI PTMQKLFNLA ALALWVTPLL LAIGLLVLKR FVRSNYLVGA YFIVVALCFY IVKFIAHIPM DSLRNSGWVF DAPSSSNPWY HFYTLYDFSA VHWPAFVDTI PAMFALTFFG ILHVPINVPA LGISTGEDNL NVDRELIAHG VTNALSGFAG SIQNYLVYTN SLLFIDSGGN SRLAGVMLAG ATAGIMLVGP VIVGFIPVMV VGALIFLLGI ELMEEALVDT WGKLHRHEYL TVVIIVATMG VWDFVAGILV GIILACLSFV VQTSRKSAIR ATYSGKVAGS TVRRPPIQQR YLKEAGQQTL ILKLGGYLFF GTIVDVENTM RGLIEDEAFS RLPIRFIILD LCRVYGVDFS AAEAFTRINR ILKKRNVRMM ISGLDVGGDV GKSLQNVGLF EPELTVRIFE DLNSALEYCE NEYLNVFYSH REALLRRKAA PQNLEVPAIQ HRSQSAEGFV GSPRHQYLQR AATTTLSEDE SAILPPAAWS AMRQPLPLLL QTFQGLTSRN EDFWFAACPY FVRTTYAMGT TLFREGDVPN AFYLLESGML RAEYDLPQGR YFELIVAGRP CGELPFFSET RRTATVKAEQ DCVTWSLDAE NWKALKEEEP DIARELLTVS LKLTTERMDS ITS
|
| |