Gene ANIA_04111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_04111 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001302 
Strand
Start bp2001569 
End bp2002829 
Gene Length1261 bp 
Protein Length381 aa 
Translation table 
GC content55% 
IMG OID 
Productalpha-ketoglutarate-dependent taurine dioxygenase (AFU_orthologue; AFUA_3G01010) 
Protein accessionCBF74685 
Protein GI259481298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCAG CCCCAATCGA CCCGAGCATC ATTAATGTTG CCGAGCCGCG AAAGGACACC 
CTCGGTCTTC CGGCCACGGC CCGCGAGAGA CTCGAAAAGG GCACAGTTGA CCTCTCGAAT
GGATATCCCT ATCGCCCTTC TCGTCCTCTC TACTTAGACG ATGTTTACCG GATCCGCGAC
TATGACCGCC AACACATTGA TCCCGGTACT CGTGCCGATC CGGAGAAGAA AGCCCTTCTT
TCCGCTGCGA AGGAAGTGGT CCACTTGACA AAGCACATAG GGACCGAGAT CGTAGGGCTG
CAGCTGAAAG ACTTGACTGA CCAGCAGAAA GATGAGCTGG GCCTGCTGAT TGCCGAACGC
AGCGTCGTGT TCTTCCGCGA TCAGGACATC TCACCGCAGG AGCAGAAGAA ACTCGGCGAG
TGGTACGGCG AGATTGAAGT TCATGTATCA TTCCCATTAC GAAAAAGCAA AGAAGTTACT
GACTACAACA GCCTCAAGCC GCTCAAGTCC CCGGTGTTCC TGGTGTGACG GTCATGTGGC
CTGCTCTGCA GGCAACCGAA ATACCAGCCA GTTTTCGTCG TCCGGGAGGC GCCTCTCGAT
GGCACACAGA TCTTGTCCAT GAGCGGCAGC CAGCCGGCGT GACACACCTT CATAATGATA
CGGTTCCATC CATTGGGGGT GATACCCTCT GGGCCAGCGG TTACGCTGCT TACGAGAAGC
TCTCCCCTGC GTTCCGCAAG ATCATCGATG GGCGGACAGC CGTCTACCGC TCCGCTCATC
CGTACCTTGA CCGTAATGAC CCTGAAGCTG GGCCTAAGTA TGTTGAGCGT GAACATCCAC
TTGTGCGCGT CCATCCAGCA ACTGGTTGGA AGGCACTGTG GGTTAATCGA GCCATGACGG
TTCGCATCGT TGGGCTCGAC AAGGCTGAAA GCGACCTAAT TCTGGGATAT TTATATGATG
TGTTTGAGAA GAATGTTGAT ATCCAGGTGA GGTTCAAGTG GACTCCTCGT AGTAGTGCAC
TATGGGATAA TCGGTATGTT TTGGCTGGCT CCACCTGCAC TGGCATTGCT GACTTCCTAG
GATCACTATA CACAACGCTA GCTGGGATTA TGAAGGTTCT GAGCCACGGC ATGGCACGCG
GGTTACTGCA CTAGCTGAGA AGCCGTTTTT TGACCCCAAG GCTAAGAGCC GCAGAGAGGC
GTTGGGTCTG CTGGGGAAGG AGGAGATTGA GGAGCTGGAG CGATTAAAGC TTGAGCAGTA
G
 
Protein sequence
MAPAPIDPSI INVAEPRKDT LGLPATARER LEKGTVDLSN GYPYRPSRPL YLDDVYRIRD 
YDRQHIDPGT RADPEKKALL SAAKEVVHLT KHIGTEIVGL QLKDLTDQQK DELGLLIAER
SVVFFRDQDI SPQEQKKLGE WYGEIEVHPQ AAQVPGVPGV TVMWPALQAT EIPASFRRPG
GASRWHTDLV HERQPAGVTH LHNDTVPSIG GDTLWASGYA AYEKLSPAFR KIIDGRTAVY
RSAHPYLDRN DPEAGPKYVE REHPLVRVHP ATGWKALWVN RAMTVRIVGL DKAESDLILG
YLYDVFEKNV DIQVRFKWTP RSSALWDNRW DYEGSEPRHG TRVTALAEKP FFDPKAKSRR
EALGLLGKEE IEELERLKLE Q