Gene ANIA_04236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_04236 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001302 
Strand
Start bp1604544 
End bp1606312 
Gene Length1769 bp 
Protein Length465 aa 
Translation table 
GC content51% 
IMG OID 
Producthypothetical protein similar to TAT-binding protein 1 (Broad) 
Protein accessionCBF74421 
Protein GI259481153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.110869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GATTTACCAG CAACCCAATA CCCCTTTTTC TTCTTGTCTC TACCTCTCTC TAACTATCTG 
TGCTCCTATA TTCTCTAATT TATCGTAGTT CTCCGCAGAC TGTTATCATG TCGACATTGG
AGGATCTCGA CGATCTTGAG CGCGAGGAGA GAGACAAGAA GAAGGAGCAA GGCGATGGCG
GCGATGGCAA GCAACCTGGT GGTGATGGAG ATGCCGAAAT GAAGGATCCG GATGCGAAGA
AGAAAGATGA GGATGACGAT CTTCTAGACG AGGAAATCCT GAATTCAAGC ACAGCGGACA
TTATCAAGCG GCGGCGAATG CTGGAGAACG AGCTTCGCAT AATGAAGAGC GAATACCAGC
GGCTGACGCA CGAACAAAAT ACGATGAAGG AGAAGGTCAA GGACAATCAA GAGAAGATTG
AGAACAATAG GTGAGACTTC TAGCTCTTCT AGATGGCGTG GCGATTTGGA ACTCCACCGG
AAACCAGGAA CTGGGCGCAG AGTTTCCTGC GAATAGGGGC TAGTAGTCCT CGTTCATTTC
TAACATTCAT ATCATAGGCA ACTACCGTAT CTCGTCGGAA ATGTTGTTGA GCTGCTAGAT
TTGGACGTCG AAGCTGAAGC TGCCGAGGAG GGCGCCAACA TCGATCTAGA CGCCACCCGA
GTAGGCAAAT CCGCTGTCAT CAAAACGTCG ACTCGTCAGA CCATCTACCT TCCTCTTATC
GGCTTAGTTG ATCATGAGAA GCTTAAGCCT GGTGACCTTA TTGGTGTCAA CAAGGATTCA
TACCTCATTC TCGATACCCT GCCGGCAGAA TACGACAACC GGGTGAAAGC AATGGAGGTC
GACGAGAAGC CTACAGAGAA GTACACAGAT ATTGGTGGTC TGGATAAGCA GATTGAGGAG
ATCGTCGAGG CTATTGTATG GCCCATGAAG GAAGCAGAGA GATTCAAGAA GCTTGGCATC
AAGGCGCCGA AAGGTACTTA TCAAACAGTA TCTGGTGTAA ACAGACTCAC AACTAATACT
GCTCATAGGT GCTCTGATGT ACGGGCCTCC CGGCACAGGA AAGACTCTTC TCGCCCGAGC
CTGTGCAGCA GAAACTAACG CAACCTTCCT AAAACTCGCC GGCCCCCAGC TCGTGCAAAT
GTTCATCGGT GACGGCGCGA AGCTCGTCCG GGACTGCTTC GCCCTTGCTA AAGAGAAGGC
TCCCTCGATC ATTTTCATTG ATGAGCTTGA CGCTGTGGGC ACTAAGCGTT TCGACTCTGA
GAAATCTGGT GATCGTGAAG TCCAACGAAC CATGCTTGAA CTCCTTAACC AGCTCGACGG
ATTTGCCTCG GACGACCGCA TCAAGGTTCT CGCCGCCACC AACCGCGTCG ATGTCCTCGA
CCCCGCCCTC CTCCGTTCCG GCCGTCTAGA CCGCAAGATC GAATTCCCTC TCCCCAATGA
GGAAGCCCGC GCCAACATCC TCCAGATTCA CTCGCGCAAG ATGACTGTTG AGGACTCCGT
TAACTGGGCT GAGTTGGCAC GCAGCACGGA TGAGTTTGGT GGCGCGCAGT TGAAGGCTGT
CTGTGTGGAG GCTGGTATGA TTGCGCTGCG AAAGGGGCAC AGCAAGATCG GGCATGAGAA
CTATGTGGAT GCCATTGCTG AAGTCCAGGC AAAGAAGAAG GATACGAACA TGGGTATCTA
TGTTTGAACA AATTCTTGTA TCCTTTTGTA AGCCTAGATT TGGTTTAACT CTCCTTCCGC
ATACAATGCG ATGTGTATCA ACTTACGTG
 
Protein sequence
MSTLEDLDDL EREERDKKKE QGDGGDGKQP GGDGDAEMKD PDAKKKDEDD DLLDEEILNS 
STADIIKRRR MLENELRIMK SEYQRLTHEQ NTMKEKVKDN QEKIENNRQL PYLVGNVVEL
LDLDVEAEAA EEGANIDLDA TRVGKSAVIK TSTRQTIYLP LIGLVDHEKL KPGDLIGVNK
DSYLILDTLP AEYDNRVKAM EVDEKPTEKY TDIGGLDKQI EEIVEAIVWP MKEAERFKKL
GIKAPKGALM YGPPGTGKTL LARACAAETN ATFLKLAGPQ LVQMFIGDGA KLVRDCFALA
KEKAPSIIFI DELDAVGTKR FDSEKSGDRE VQRTMLELLN QLDGFASDDR IKVLAATNRV
DVLDPALLRS GRLDRKIEFP LPNEEARANI LQIHSRKMTV EDSVNWAELA RSTDEFGGAQ
LKAVCVEAGM IALRKGHSKI GHENYVDAIA EVQAKKKDTN MGIYV