Gene ANIA_00803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_00803 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp2378732 
End bp2382042 
Gene Length3311 bp 
Protein Length1066 aa 
Translation table 
GC content53% 
IMG OID 
ProductFibronectin type III domain protein (AFU_orthologue; AFUA_1G14620) 
Protein accessionCBF88752 
Protein GI259488915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.154563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCT TCTTGGCAGC CAGTATCCTA TGGGCTCTGA GCTGGTAAGT GTTGCCGTGT 
TCCTTTGTTA CTTCCCATGG TATTGACTTT CTTGAAAGGT TACTATACCG TGCATGGCAA
GTTTGTCAGA CTCCCAATGA GGTATTGATT GAGAAGCTTG GTTTGGATAT CCCTCCCCCG
CCAGAAGTTA CGCTGGAAGA GATAACGGCG CGAGGGATAC GGATCGCCTG GAAGCAGCCG
GAATTCCACA ACTCTATCCA CAAACATATC ATTCAAGTAA ATGGCGCTAA AGGTGGGTTG
TACCGCACCT ACGCCTACTC GCAGCTGACC TGGTCTGTTT CATGACAGTT GGCGAATCCA
AACGAGCGGA AACCGCAGTC GAAATCCTGA ACCTGACTCC TGGGAGCATA TACCACATTT
GTGTCTTGTC CATTAGCGCA GCAAATTTTC AGACCCCCAG TGCAGTTATT CATGTGCGCA
CTAAGTCACT TCCGGCATCC CAAGCCGGAG ACAATGCCTC AGCAACCGGT CCTACGATTC
GAGCGTCAAT TCCCCGGTCA ACAGCCGGTC TCCCTGCTCC TTCGGCGCCG TTAATGGCCC
GCGAACATAG TGGCGGCCCA CTACAGAAGC GTTCATCTGT TGGGCGCCGA CAGTCCCCTG
CGGCCGGTGC AGCAGACATT TCGCAAAATC ATCCGGACAA TACACTGAAC AGCGCTGCAT
CCTACGATCA GAGCGAAGAT ATATCCCTGC TAGCGGACCG ATTGAAGAAT CTGCAACACG
ACAATGATGC TGTAGAGAAG CAGACTCTCG AGGAGGAGGA AGAACATATC GCATTATTGA
AGGATCTGGA GAAGCAGCGA GATGACTTGA GGAAACGGGT TAAGGAGAAA GACGAGGCTA
GCGGCGACCT CAAGAAGCAT GTCAACAAGC TGGAAAGTGT TAATCGGACG GTACAAAGCG
AAAAGGCAAA ACGTGAGAGG GTTCTCCAGC AAAAGGAGGC TGATCGCAAG AAACGGAGGA
ACGATCTTTT ACGCTGGCAA GAACAAATGC CGCAGATTTC TGCGGATACG GCGCAGGCTA
GGCAAGAGAA GGAGCGCCTC GAAGAGGAGG GAAAGAAACG CGCGGACGAA GTTAGAGAAA
AGATTGCGAA GGAGCAGGCC GAGATGAGGG CAATCGATGA GGAAATTCAA GATAAGGGTG
GACGGGTCAA GAAACTTGAG GACGAAAGGA AAGGGTACAC GGATGAAGAT GGAGAGGATG
GTAATGAGCT GGATCGCATT GACAATGAAC GAGCCCGGCA GTGGGAAATC AAGCTCAGCC
ATTTGCAAGC ACGCTATGCT ACGCTTGTAA ACCTCCATAC GCAGGCTCAA CAACAATACC
AAGAGGCACA GGAACGCTTG AAATGGTTGA CCTCGCAGCG AGCCGGTACG GCGCCGTTCT
CTCTGCCACC TATGGATCTA GAACTACCCT CCGGGGCAAC TATCCGCCCA CCCCGGCGCC
ATCGCAGCTC GCTCAACAGC AATGTCTCGT CGCCAATGAA TTTCCCTTCT CTAGAACCAT
TTCCTACTAG CGTGACATAC AACCCTCCGA CCACGGGCTC GCCTACATTC GCACCAGCCT
CGTCTTTCTT TAACATTAAT AATGGAATGA CCCTCCCTGG ATTGACCGGT GAGCCTGAAA
GTTACCGTAG AGGTTCCGAT TTCTCAATCA CGAATCCTCA GATGAGCCCT CGAGCGGACG
CTCTGCTTCC GTCTGATCTA CTTGGCGATG AAGAATCACC CGAATTGCCG CGGCCTGTCC
TCAGAAGTCG ATTCTCGAAT ATTGAGCAGG ATACGACTAA GCGTGACAGT TTCCCCGCTG
ATCCATCTTC ACCAGAATCC TCTGGCAGCA AACCCGCCAG TCTCTTGGGT AGTCCTGAGG
AAGTACAGAA AGGTCCTCAG GAAGCTAATT CCCAGACTGC ACCCTCGGGA GATGCAGAAG
AAGCACCGAA AAGTGCCTCA AGACGCTTGT CTGGTCTTTT CAACTTCAAT CGTCCGCGTG
GTAAAACGCT CGCAGAAGAC CCGCCTCTTC TTGGAACATT GAAACAAGGG CAAAGCCAAT
CATTTCCAAG AGAAATGGAT GAAATAGAGC CAATTGGGCC CAGGCGTCGT CGCTTGAGCT
ATACCACCAG CTGGGCTAAT CCAATGTCTT TGCTGCCAAG AACCCACACA GCCGGAGCCA
CTCCAGATAG CTCATCTGAT CATCTGCCGT CCAGACGTAC CGCTATCTCC AGCATATTCT
CACCAAGCAG ATTCGGATTT GGAAGTGGCA GTGGTTTGTC CAAGGGCGAG GGCGCTGATT
CTGTTTCAGG GTACAACCAG TTCAGTCCTA GGCACGATCC GATTGACCCG TCGTCGATCT
TAGGTACGGT TCGGAGGAGG GGTTCATTGT CTCCCAGGCC ATCTTCCACA TTCTCATTTG
ACAACTTACT CCCGCATCCC TCAACAGATA ACCGTCACTT TGGTTGGCCT TCTGCTGATA
AGCCTGGTCA CCGAAGCCCT CTCGGTTTCG ACTGGACATC ACCGTCCACG TGGTCGAGAA
CTCAGTCTCG GCGCCCTTCT ACGACTCAAT ATGGATCCTC GGGCCATCTT CCTCTGGGTT
TTACAGCTGA ACCTGACTTC CTTGATGACT CTTTCGAGAG GCAGGGTCGG CCACTGCAAG
CCCCGATTGG TACGCGGCCT TCGTCTTCGC ATCGACCAAT TACTCCAAAA TTGAATCCGG
CAGCGCCTAC CTTCAAAACC ATATTCAGGA GCAAGGGCTC AGAGAAGAAG GAGGATAAAG
AGAAGGGCCA GACTACTGAG GAGGCGGACA CGTCTTTTGA CATGTCCTTC GACCATGGCT
CTCCCTCAGA ATCCCGCGCA TCGAGAGACT CTCGATCCCT GTCTAGCTTC CCCGGAGACT
CATACGAATC CCTTGAGCGC ATGCCATCTG CAACGTCTGC AGAAAACGCG AGCTCGAAAG
AATCCTTTAT CCGAAAGATC ACCCGGAAAG GCAGCTCCAG CAAGTTTGGC TCTTGGAAAG
ACCGCTCCGG TTTGTTCTCC AGGAAGAGCG ATGCTTCGCA AGGCGACGTC GATGAAGAGG
GCGATAGCGA GGCACAACTG GCTAAGAGCA TCGACAGCAC TGTGTCCAGC GCGCCTAGCA
GGAGCAGCCT TAGTTTTTTC AGCCGCAAGT CGAAGAAATC CGACAAGGCA GCTAGTGAGA
CGAGCGAGCG GCCTAGCGAG CGGATTAGCG AGTATGGCGA CGATGAGATA CCGGAAGAAG
AAACTATCTA G
 
Protein sequence
MALFLAASIL WALSWLLYRA WQVCQTPNEV LIEKLGLDIP PPPEVTLEEI TARGIRIAWK 
QPEFHNSIHK HIIQVNGAKV GESKRAETAV EILNLTPGSI YHICVLSISA ANFQTPSAVI
HVRTKSLPAS QAGDNASATG PTIRASIPRS TAGLPAPSAP LMAREHSGGP LQKRSSVGRR
QSPAAGAADI SQNHPDNTLN SAASYDQSED ISLLADRLKN LQHDNDAVEK QTLEEEEEHI
ALLKDLEKQR DDLRKRVKEK DEASGDLKKH VNKLESVNRT VQSEKAKRER VLQQKEADRK
KRRNDLLRWQ EQMPQISADT AQARQEKERL EEEGKKRADE VREKIAKEQA EMRAIDEEIQ
DKGGRVKKLE DERKGYTDED GEDGNELDRI DNERARQWEI KLSHLQARYA TLVNLHTQAQ
QQYQEAQERL KWLTSQRAGT APFSLPPMDL ELPSGATIRP PRRHRSSLNS NVSSPMNFPS
LEPFPTSVTY NPPTTGSPTF APASSFFNIN NGMTLPGLTG EPESYRRGSD FSITNPQMSP
RADALLPSDL LGDEESPELP RPVLRSRFSN IEQDTTKRDS FPADPSSPES SGSKPASLLG
SPEEVQKGPQ EANSQTAPSG DAEEAPKSAS RRLSGLFNFN RPRGKTLAED PPLLGTLKQG
QSQSFPREMD EIEPIGPRRR RLSYTTSWAN PMSLLPRTHT AGATPDSSSD HLPSRRTAIS
SIFSPSRFGF GSGSGLSKGE GADSVSGYNQ FSPRHDPIDP SSILGTVRRR GSLSPRPSST
FSFDNLLPHP STDNRHFGWP SADKPGHRSP LGFDWTSPST WSRTQSRRPS TTQYGSSGHL
PLGFTAEPDF LDDSFERQGR PLQAPIGTRP SSSHRPITPK LNPAAPTFKT IFRSKGSEKK
EDKEKGQTTE EADTSFDMSF DHGSPSESRA SRDSRSLSSF PGDSYESLER MPSATSAENA
SSKESFIRKI TRKGSSSKFG SWKDRSGLFS RKSDASQGDV DEEGDSEAQL AKSIDSTVSS
APSRSSLSFF SRKSKKSDKA ASETSERPSE RISEYGDDEI PEEETI