Gene PHATRDRAFT_47724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47724 
Symbol 
ID7202902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp641412 
End bp643378 
Gene Length1967 bp 
Protein Length646 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181946 
Protein GI219123260 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.642125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACGG CTCTGGGAAA GGAAGACGAG GGTCAAATCC CACGTCAACT GGATGCCGTT 
ACCGAGTCGG AGCAGACCGA TCAACGAAAG AGCAATGCGG TCGCGGCAGA AAGCGAGGAG
TTTCATCGAG CACCGCAACG CCAGACGCTA CATATGCGGA AGGAATCCAA AGAGTTTCTG
AACACTATGC ATTCCTACTT GGATTGCAAA CGAAAATTGG AAGCCTTGGC AGGCAAAGTG
GACCCCTGCC GAACACTGCA CGGCATGACG CTATATACGC ACTCGGGCGG AATACGCGAG
GCGGACGGAA CCTACAAAGC GACTTATCGG CAGCAAAACC AATCTGGAAA CCCGGTTCGA
GACTCGGAGG AACTCCTGCA AGCCGCCACG GTGACACGAC CTCTCTTTTT GGCGCTCATG
CAACAACTAG CATCCCAAAT AATGCAAAAA CGACAAGATC TGCGCCGGGA CGGTGCGCAA
TTCAAGGCAC TGGTCGAACT TCCCCTCAAA GCCTGGTCGA GGATTGTGGA AAAGTCACGG
GACGACTACG CGAAACGGAC ACCGGGTCCA CCGGAAGCAT GGCTCTACGA TCTCAACCGC
GCTTCCGTTC TTTGCCCTGA CGTTGCCACC ATGGACGCCG TCGTTACGTG GCTTTACAAA
AATACATATA TTGTTACGGC CAAGAATCGC TTTCATCGGC CCACTATCAG CGGATACCGT
GACTTGCTCT TTGTTTTACG GATACCCGTG AGACCCACGG ATCAACAAGG CGGTACCGGT
TCACCCGCCA ATCCTGTTCT ATTTTTTCAC ATGTGCGAAC TGCAAGTGCA TCACGAACAA
ATTTGGCAAC TTAGCAAAAC ACTGGAGACG CACAGGCTAT ATCGATACTT TCGCTCCTAC
TTTTGCGGCA ACGACGAAGC CAGTGTCGGG AAGCTGATGA CAGAACTGGT TGAAATTGAA
GATTCTGGCA AACTCGATAC GATGGTTGTA CAGCGAGTGC TGGAATCGGA CGACGTCCCG
CGGATACGCA AGCTCACCGA GCTCTTTCAA CATCTGCCAG AGCACGATTT TGCCCTACTC
TTGTCCCAAC GCGCTCTCGA CATTCACGTG GCGGCCGAAC CGGATGCTCC TAGCCATAGC
AAGCATGAAA CCCACCACAG TCATCGTATC AACGTTGCCG AATCGTACAA CCATATTGGC
GACATACTTT TGGCTAAGGG AGAGTATCTT AGTGCCTTGA CGCACTATCG ACAGGCACTC
GATATTTACC TACAGACGCT TGGCCCACAG CACGTCCAGA CGGCAGCGAC ACACAACGCG
ATTGGTCTAG TGCTATCCAA TCAAGCATCG TACGACGAGG CCTTGATTCA TTTCGAAAAA
GCACTCGCAA TTTTTTCAGC GCCCGGGGGA GACAGCAGCA GCAACAATAC CGACCAGAGT
ATCGCCAAAA ATGCCCATCC CCAGGCCGCC ACCGCTTGGA TTTACATTGG TCACATTGCC
CAGGCTCGTG GAAATCTGGA GCAAGCTCGG GAGAATTACG AAAAAGCGCG TGCAATCGCT
TCGGCGCTGG AGGAACAGCT ATACGGAACG AAACAGACTC TACTCGCCAC GACAGAAACG
AATTTGGGAA TTGTAAGCTA CCATCAGGGG GACCTGGACG ATGCCTTGGT GCACATGCAA
CAGGCATTGG CTACCCGTCA AGCTGTTCTG GGACGTAAGC ATCGTCAAAC TGGTCTAGTC
CACGAGGCAT TGGGAACAAT TTGGCGTGAT CTGGGAAATT TGGAAACCGC CGCTGAACAC
TTCTGTCAAG CGAACAGCAT TGATCAAGGA ACAACATGTG ACTGGGAACG CTCGCTATTG
AAGAAACGGT TGCATTGGAG CACAGGGGGA CTCTCTATCG ATGAGAAAGT GAGCCGGACT
TCTACAGAGG AGTTACCGTA GCTCTCAGGG ATAGGTGCAC GACTGAT
 
Protein sequence
MGTALGKEDE GQIPRQLDAV TESEQTDQRK SNAVAAESEE FHRAPQRQTL HMRKESKEFL 
NTMHSYLDCK RKLEALAGKV DPCRTLHGMT LYTHSGGIRE ADGTYKATYR QQNQSGNPVR
DSEELLQAAT VTRPLFLALM QQLASQIMQK RQDLRRDGAQ FKALVELPLK AWSRIVEKSR
DDYAKRTPGP PEAWLYDLNR ASVLCPDVAT MDAVVTWLYK NTYIVTAKNR FHRPTISGYR
DLLFVLRIPV RPTDQQGGTG SPANPVLFFH MCELQVHHEQ IWQLSKTLET HRLYRYFRSY
FCGNDEASVG KLMTELVEIE DSGKLDTMVV QRVLESDDVP RIRKLTELFQ HLPEHDFALL
LSQRALDIHV AAEPDAPSHS KHETHHSHRI NVAESYNHIG DILLAKGEYL SALTHYRQAL
DIYLQTLGPQ HVQTAATHNA IGLVLSNQAS YDEALIHFEK ALAIFSAPGG DSSSNNTDQS
IAKNAHPQAA TAWIYIGHIA QARGNLEQAR ENYEKARAIA SALEEQLYGT KQTLLATTET
NLGIVSYHQG DLDDALVHMQ QALATRQAVL GRKHRQTGLV HEALGTIWRD LGNLETAAEH
FCQANSIDQG TTCDWERSLL KKRLHWSTGG LSIDEKVSRT STEELP