Gene PHATRDRAFT_18241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18241 
SymbolAP3mu 
ID7197467 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp900013 
End bp901410 
Gene Length1398 bp 
Protein Length416 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178028 
Protein GI219112553 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCCC TGTTCATTCT TTCACCCACG GGAGAAGTAT TGATTGAACG TCACTTTCGT 
GGCGTTGTGA CTTCTCGATC TGTTTGCGAA ACCTTTTGGG AGCGAGCGTC CGAGTCGGTC
AATCACCACG GCGGGCTTTC CTCCGCGACG AGCCTGCTGA CGTCGCTGCA CTACGATAGC
GTACCTCCCG TAATGGAGGT TCCAGAATCT GACCAAGGAA CACTCTACGT TATATCCATT
CTACGCGAAG GCCTCAGCTA TTTGGCCGTC TGTCCAGCCG AAGTCAGTCC GCTTCTTATT
ATTGAATTCT TGCAACGAAT CGCCAATATC TTTGTCGAGT ACTTTGGACC TCCGGCGGAC
GAATCCGCCA TAAAAGACAA TTTTTCTACC GTTTATCAGC TAATCGAAGA GATGGTTGAC
TTTGGATGGC CGTTAACAAC GGAACCCAAC GCGCTCAAGG CCATGATTCG TCCACCCACG
GTGATGAGCA AACTGTTGCA ATCATCGACG ACCGTCAGTG ACGAATTGCC GTCGGGAACG
ATTAGTAACA TTCCCTGGCG CGCCGCAAAC GTACACTACA CACAAAACGA AATTTATATG
GACATTGTGG AGGAGGTTGA CGCGATTGTA AACGCTTCTG GCGCGGTTGT GTCGTCGGAC
GTTAGCGGGT CGATTCAATG TCAATCACAC CTGTCCGGTG TTCCGGATCT GTTGCTAACG
TTCAAAGAGC CGGATCTGAT TGACGACTGC AGCTTTCATC CTTGTGTACG CTACGCTCGA
TTCGAAAACG ACAAAGTGGT TTCCTTCGTC CCGCCGGACG GTAATTTCGA GCTCATGCGA
TACCGCATAC ATCCGGAGCG AGCACGCAAT TTTAGTCCTC CGGTATACTG CCATCCGCAA
TGGTCATATA GCTCCTCAAC GGATGCGTCA CAAAGCATAA CATCTGAGCG ACCTACCAAA
AACGGCCGTA TAGCGCTACA AGTTGGTGTC ACAACTTTGA GCAGTTTGGT GTTTTCGGCG
TCAAGAAAGG GCCCCCTGCA GGTTGAAGAA GTGGCTGTAC TGATTCCGTT TCCTAAACAG
ACACGAACGA CTGCTGGGTT TCAGGTCAAT ATTGGTTCGG TCATGTATGA TGAAGCCGCC
AAAGTTGCCC GCTGGACGCT CGGCAAGATG GATGCGTCTA GAAAAGCGAC CTTGTCGTGT
ACTTTTACAG CCCTGACAAG CAACGACGAA GAAATCACAT CCTCCATACC CAATGTATCG
CTCACTTGGA AGATTCCGCT AGCATCCGTA TCGGGATTGT CCGTCAGTGG TCTCTCCGTC
ACTGGAGAGT CCTACAGACC ATACAAAGGT GTACGGAACG TTACCAAGTC GGGCCTATTT
CAAGTGCGGT GTAGCTGA
 
Protein sequence
MQSLFILSPT GEVLIERHFR GVVTSRSVCE TFWERAVPPV MEVPESDQGT LYVISILREG 
LSYLAVCPAE VSPLLIIEFL QRIANIFVEY FGPPADESAI KDNFSTVYQL IEEMVDFGWP
LTTEPNALKA MIRPPTVMSK LLQSSTTVSD ELPSGTISNI PWRAANVHYT QNEIYMDIVE
EVDAIVNASG AVVSSDVSGS IQCQSHLSGV PDLLLTFKEP DLIDDCSFHP CVRYARFEND
KVVSFVPPDG NFELMRYRIH PERARNFSPP VYCHPQWSYS SSTDASLVFS ASRKGPLQVE
EVAVLIPFPK QTRTTAGFQV NIGSVMYDEA AKVARWTLGK MDASRKATLS CTFTALTSND
EEITSSIPNV SLTWKIPLAS VSGLSVSGLS VTGESYRPYK GVRNVTKSGL FQVRCS