Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_08198 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | - |
Start bp | 1102396 |
End bp | 1104826 |
Gene Length | 2431 bp |
Protein Length | 509 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | choline transporter, putative (Eurofung) |
Protein accession | CBF74091 |
Protein GI | 259480975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.273035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.278378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTAGAT CTGAAGCGTA CAACTTTAAC AAACTGAACA GTAAAAAGCA ACCTTCCTAT CTTGGAGCAG CCATGACAAC GAGTGTCCTA GAGAGAGAAG AAGAAAGCAA GCATGACTCA TCCAGCGGAA CTGACCCAGA TATTATCACC GGCATCCTCC CAGACACAGA TACTACCAAC GCAGATGACC TCATCCTCCA AGCAAACGGC CACAAGCCCC AGCTCCGCCG GCAGTTTAAC TGGCTGTCAG CCCTGGGCCT CGGCTTCTCC ATCACAAACT CATGGGTGGG GTACTTGGTA GGTTGCCCTC CTTCACTTCC ACTCTGTTCG CTATTAACTG CGATACGTAG AGCAACTTTG GCCAGAACAT GAAATATGGT GGTCCGCGCC TTGTCATCGT CGGGCTGTTA CTCGCCTTTA TCGCACAGTC AATTATCAGC ATCGGCCTTG CGGAGATTGG GTCTGCGTTT CCGTCGTCCG GCGGCCAGTA TCATTTTTGC TTCTTGCTGG CCCCGCGCAG ATCTAGGAGG TTTGCGGCGT ATGTCATCGG GTGGATGTCC GTTGTTGCGT GGTGGGTTAC GACGTATGTC TGCCTGCGGC TTGGTCTTAT TTTCTGGTGA GCGGAGGAAC GGTGCGTAGA TATTGATGGA GGCTAGGTCA TCTGGCATCT CCCTTACTGC AACGACTTTA GCCGGGATAA TACACTTCTT TGACCCGAAT TTCAAGGCAA CGCAGTGGCA GATCTATCTG TTTTTCGTTG CGATGGCGAC AATCACAAGC AAATATCCCC TCCCCAAACC CTACCTTTAG TCAAATGAGC TACTAATGCT GCCGGCAGTA ATCCCTGTCT TCGTCGCATC CAAGAAGATC TCAGTTATCT GCCAGATCAC CTTGCTTCTG TCTATCGTCG GCGTAGCAAT GACGCTTTTC GTTCCTACCG GTATGCATGA GCAAGTTCAG CAGCCTTCGT TTCTCGTCAG CCCATCCGCC GGCAAGACAG GCTGGGGATC TGGAGTATCT TGGATGCTGG GCATATGCAA CGCCATGTAC GCGTTTGGGG GAACAGATGG AGGTGAAGAG CTTTGCACAT TCCTGAAGCG TCAAAAGGAC TGATGATGGC CAAATAGCCC TCCACATCGC CGAAGAAATG CAGCACCCCG GCCGCCGTGT GCCGCAGATT ATTATCACGA CGCTGATGAT CGGTCTGGCA ACGACTTTGC CGCTTTTCAT AGCACTACTG CTGTTTTCAA GTGATACAGT GGAGATCATG GACTCCCCTC TTCCAAGCGC AGAGCTTATC CATCAGGCGT GAGTCGCTAG TAATGATCAC TAGTGGTTGA TATAGGCCGC ATCGCTGAAT ATTCATAGGA CTGGCAGTAG GACAGGCACA ATGTTTCTCA TTATATGGAT TCTAATCGTC TATATCTGTA TGTTCTCATA TACTCTTCAT CTCTATCAAG CTTACTGAGA AATGCGCACA GCATGCCTAC CTTCCCAATG GGTAACCAGC GGCCGACTTG CCTGGGCTTT TGCACGAGAT GTAAGTCTGC TCGGCCTTCA CCTTTTCCAC CTCGCTCTTT GATATGGTTT AAGTAATATC TGACCAAAGG ACAGAACGGC ACCCCCTTTC CTCGCTACTT CAGTGCCATC AGTCCTACAT TTCAATTCCC CGTCCGCACT ACGACAGCCG CATTTGTCTT CGTCCTGCTC TATGGGCTGC TCTACCTAGC TTCCACAACG GCCTTCAACT CAATAATCAC CTCTGCAGTG CTGTTTCTGA ATATCACGTA TGCGGTACCG CAGGGAATTC TTTTGCTCCA GCGAGCACGA TCACTGCTCA CCAGCAGCTC AAAGACGAAT ACGGATATGA GCATTCTTCC TCCCAGGTAC CTGAGTCTTG GGCCCCTTCT CGGTTCACTC TGCAACGCCT TCTCAATCCT GTGGATTATT GTCCTGGGTG TATTCGTTTG CCTCCCGCCT GAGATTCCGG TGAACCTTGC GTCGGCAAAT TATACGCCTG CTGTTGCAGT CGGAATCTTT GGCCTTATTT TGCTGTTTTG GGGGCTTGGT GGGAGGAAGA TGTTTGAAGG GCCGCAGGTT GATTGGGAGG GGTTGGAGTT AGGGTTGAGG GTGAGGGTGA GAGGAGAAGC CTAGCAACAT GAAAACATGC AGTGCAGGTG GGCAAGCTGG TAAGTGAGAT AACTGCAAAG CTAGATTGAG CGTTTAGGGT ATAGGGCTTG ATATTATACT CGTTTTGCTT GCCATGTCAT CCTACTATGC ATCTTCAAGA GTAAGGGCCT GACCGACTTG GAAGGGAGAA TTAGGTCCAT CACGCAGGCG CAATCTGGCC TTTATCAGTC ATATAGCTGG CTACACCTTT ATAAAGGCAT GTTTGTCCTT TCCTCGATTT GTTTCTGGCG GTCATGTTTG A
|
Protein sequence | MCRSEAYNFN KLNSKKQPSY LGAAMTTSVL EREEESKHDS SSGTDPDIIT GILPDTDTTN ADDLILQANG HKPQLRRQFN WLSALGLGFS ITNSWVGYLS NFGQNMKYGG PRLVIVGLLL AFIAQSIISI GLAEIGSAFP SSGGQYHFCF LLAPRRSRRF AAYVIGWMSV VAWWVTTNPC LRRIQEDLSY LPDHLASVYR RRSNDAFRSY RTDDGQIALH IAEEMQHPGR RVPQIIITTL MIGLATTLPL FIALLLFSSD TVEIMDSPLP SAELIHQATG SRTGTMFLII WILIVYISCL PSQWVTSGRL AWAFARDNGT PFPRYFSAIS PTFQFPVRTT TAAFVFVLLY GLLYLASTTA FNSIITSAVL FLNITYAVPQ GILLLQRARS LLTSSSKTNT DMSILPPRYL SLGPLLGSLC NAFSILWIIV LGVFVCLPPE IPVNLASANY TPAVAVGIFG LILLFWGLGG RKMFEGPQVD WEGLELGLRV RVGKLACLSF PRFVSGGHV
|
| |