Gene PHATRDRAFT_8773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_8773 
SymbolMARK3 
ID7196841 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1876123 
End bp1878387 
Gene Length2265 bp 
Protein Length511 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176859 
Protein GI219110215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.527877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAGG CTCCCGTACA AATAGGCCAG TTTATACTGG GCAAGAATTT AGGAATCGGT 
GCCTTTGGAA AGGTAAGTTG TACGGAACAC ACGAACGATA TTCCCATGGT GACACTCCAG
AAGATCCTTG TCGTCGGACA GCTACATTTT TCAAGATGTC ATGTACTTTG ATTTAATTTA
TTGATTTTGA CAATCTAATT GTCGCTACGA CATATTATTT TGGGGATGGT CGAAGAGACC
AGTTGCGTTC GTATCGTTTA GGGACCATGG GATTGCCCCA ACGATACTGC CCTCTCACGT
GTATACTTCT ATCACCTTTT CTATCAGGTA AAATTGGCAA CACATGCCGT AACCGGACAC
AAGGTAGCGG TGAAAATCCT GAACAAGAAC AAAATCAAGC AGCTGGGTAT GGAGGAAAAA
GTTCATCGGG AAATCAATAT TCTGCATTTG TGCACGCACC CACACATTAT TCGCCTGTAT
GAAGTTATTG ACACCCCAAC TGATATATTT CTTGTGAATG AGTACGTCTC TGGCGGCGAG
CTTTTTGACT ACATTGTTTC CAAAGGCCGA TTGTCCGCGG ACGAGGCCCG TAATTTCTTT
CATCAAATTA TTTCTGGAGT GGAATATTGT CATTTTCAAA AGATTGTTCA TCGTGATCTC
AAGCCGGAGA ACCTTCTCTT GGACGCCAAT CTGAATATCA AGATTGCCGA TTTTGGATTG
TCGAATCTCA TGAGAGACGG TGACTTCTTG CGTACGTCAT GTGGATCCCC AAATTACGCC
GCACCAGAAG TTATCAGTGG CCATCTGTAC GCCGGACCAG AAGTCGATGT CTGGTCCTGT
GGTGTCATTC TTTACGCTCT TTTGTGTGGA TCTCTTCCGT TCGATGATGA ATCGATTCCG
AACTTGTTCA AAAAGATCAA AAGTGGAATG TACAGCTTAC CAACACATCT TTCCCAGTTG
GCGAAGAACT TGATTCCGCG CATGTTGGAA GTTGATCCAA TGAAAAGAAT TACTATTGCC
GAAATTCGTT TGCATCCGTG GTTCCAGCAT AAGCTTCCTC CTTATTTGAG GCACCCTCCA
GAGCTGATGG AGAAGCAGGA GCGAATCGTC GATCAAGAAG TTATTGATGA AGTGATGAAG
CTACCGTTTC ACAAGGCTTA TGGCAACACG AAAGGTCTTG CGAACGGTAC TCTCAATGTG
CCGCAACATC AGTTCCTCAC TAATATTGTC ACAAGGGAAC TAGTTGAGAC AGCAGCGGCC
TTGGAAGACA GTCGCGACTC CGACGCGAAA AAATTGCTGA AGGATTTGCG ATGCGCGTAC
GAGCTAATCC TCGATCACAA GCACACTCGT CTTCGCGTTA TGGAAGTCGC CCGCGCCATT
CAAGAGGCGG CGAGCGCGAC GCCACCAGCG TTCTCCCCTG GAGGATCTCG AGGGACAACT
CCCGGTGGTC ACTATGGAAC CGGCGGTAGT CGATACGGTG GTAGTGTCGG CAACAGCTAT
GATGGTGGGC GAACATATTC TGCAAGTGTG TCTTCCAATT CCCATTCTCC GACTGCTTCG
CAATCATCAC CCGCACAACA ATCAAGACTT GCAGAAGAAG CGACTCGTGC ATTAATGCAA
CCTGGTAGTA CGACCAGCAG CCACAGCTCA TCGCATACTC CCCCGGGATC CGTATCGGCA
ATGTCTGGTC ATGGAATTGT GCAAATGACT TCGTCAATTC CAGGAAATAC GGGTATGATT
GCCCAACATC AACACGGACG GCGAACTCGC CGATGGTATC TTGGTATTCA ATCAAAGAAG
GATCCTGCAC ACGTCATGAC GGAAGTCTAT AAAGCGCTCA TGTCGCTTGG TTGTGAATGG
TTACAGCTAT CATCCTACCG AATCAAGTGC AAATGGCGTC CAAATACTGG AGGGAGCGGT
TCGAGCTCTA CAATTCCTTT GGCTGGGGGT GAATCCCCAC AGGCAGCGTG GATTTCGAAT
CCTTCTCGAG GATTGAGCGA CGCATCGATG GATGTGGACA TCGATGGCAA GGAGAGACAT
ACCAGCCCGA CGTTTGGACT CAATGCAATG CAAGTAATTG CTGGTGAAGA TGGGCATAGT
TTGCGTGTAC CGAATCTCTC AACCTCAGAA TATTCAATCA AAATTGGTCT AACTCTCTAC
AAGGTGCAAC AAAATATTTA TCTTCTCGAT TTTCAGAAAA TGACCGGCGA CGCCTTTTCT
TTCATGACTC TGTGTGCGAA TATCATAACG GAGTTGAAGA GTCTG
 
Protein sequence
MDQAPVQIGQ FILGKNLGIG AFGKVKLATH AVTGHKVAVK ILNKNKIKQL GMEEKVHREI 
NILHLCTHPH IIRLYEVIDT PTDIFLVNEY VSGGELFDYI VSKGRLSADE ARNFFHQIIS
GVEYCHFQKI VHRDLKPENL LLDANLNIKI ADFGLSNLMR DGDFLRTSCG SPNYAAPEVI
SGHLYAGPEV DVWSCGVILY ALLCGSLPFD DESIPNLFKK IKSGMYSLPT HLSQLAKNLI
PRMLEVDPMK RITIAEIRLH PWFQHKLPPY LRHPPELMEK QERIVDQEVI DEVMKLPFHK
AYGNTKGLAN GTLNDLRCAY ELILDHKHTR LRVMEVARAI QEAASATPPA FSPGGSRGTT
PGGHYGTGGS RYGNTGMIAQ HQHGRRTRRW YLGIQSKKDP AHVMTEVYKA LMSLGCEWLQ
LSSYRIKCKW RPNTGGSGSS STIPLAGVIA GEDGHSLRVP NLSTSEYSIK IGLTLYKVQQ
NIYLLDFQKM TGDAFSFMTL CANIITELKS L