Gene PHATRDRAFT_36628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36628 
Symbol 
ID7201883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp921722 
End bp923131 
Gene Length1410 bp 
Protein Length469 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180924 
Protein GI219120369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0161158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCC ATACGACGCG ATTATTGCAC CAATTTCTCC TCATACTAAC GCTTGGGACC 
GTGTCCTTAA TTTTTGAAAT TTACTGGACT ACACATGCGA TCATTGAAGG CAATGCGAGC
ACACCGGAAA AAAACCTTTG GAAATACTTT TCTTCATTCG TGTCGACAAG AAATGCCCAA
ATTTATATCA GCAGTATTGG CTTCGATAGC TTCGCTGGCC GATTCACGTC GCCGAAACCA
TGGAATCACT CTCAGTTCTT TCCTGGAAGA AGGAAAATCG ATGCACGCGA CTCCGCCATT
TACAGTTTTG CCAACATATC GAGTGCTTCG GCTGCTATTC CACTCGTGCT GACGGTGAGC
TCTGAGAATG CTAGCAGCAT TACGACGACC AAAGCAAATA CCTCCGGCCG AAAGGAACAC
CCCGACAATA CATTTTCAGC ATGCATCCTC GTCATGGACG ACAACCATCG ACTCGTGGAG
TGGATTGCTT ACCACTACTA CGCCTTGAAT CTGCGGCACT TGGTAGTTAC GGTGGATCCT
CATTCCAGAA CGCGTCCTAC GGCGGTACTA GATCGCTGGA GAGATCGCAT GTATATCGAA
GAATGGAATG ATCGATCGTT TCTACCCTCT AATATTGGAC GAAGCGCGAA CGATACGGTC
GAACAACGAC ATTTAAAGCA TAGGTTTCGT CAAGCACAAT TTTACAAGGG CTGTATCCGA
AGACTAAGGG AATTCAACCG ATCTTGGACG ACTTTCATCG ACTCCGATGA GTACCTTACC
ATCAATAGCC GCATGGTGGA CAACACGGCG CTCCGGATGC AACAACCAGG ACACGTCGCC
GACTACTTTC ACGAGTTAAC ATGGCAAGCG CACAGCGGAC CAAACTACAC CTTCGCTGTG
AATTTCGGTC AATCCTGCGT TTTGCTATCA CGCGCCATGT ACGGATCAGT AGAAAGTACA
GACGAGGAGA TCCGTAGGGA CGTGCCAGAT TTTCTGGATC CAGCCCGCTT TGATACGCTG
CGATGGCGGC ATCGCTCAAC CGAGGATGAC CACGTGTTAG CCAAAAGTCT GATTGACGTC
TCTCAAGTAA AGCAACACCA CTTGGATGGC AAAGCCAATG CGCACAAAGC TCTTGTTGAA
ATGTGTAAGT CAAACTCGTG GATAGCGTAT ACACTGCCCA TTGGCATTCA TCACTACCTT
GGTAGTTGGG AGCAGTATAG CTACCGAGAC GACGCTCGCG ATGGTGGCGA TGCACACAGC
TACGAGACGT GGCAAAGGAA AGGCTCTGCG CTGGTCAGTA CAGATGACGA GATCCGCCCT
TGGATTCGCG GGTTTGTAAA GATGGTAGGC AACGGTACGG CACTGTCTTT GTTGGAAGGA
GCTGGGCTTC CCACAAACCG GACTGCGTAA
 
Protein sequence
MATHTTRLLH QFLLILTLGT VSLIFEIYWT THAIIEGNAS TPEKNLWKYF SSFVSTRNAQ 
IYISSIGFDS FAGRFTSPKP WNHSQFFPGR RKIDARDSAI YSFANISSAS AAIPLVLTVS
SENASSITTT KANTSGRKEH PDNTFSACIL VMDDNHRLVE WIAYHYYALN LRHLVVTVDP
HSRTRPTAVL DRWRDRMYIE EWNDRSFLPS NIGRSANDTV EQRHLKHRFR QAQFYKGCIR
RLREFNRSWT TFIDSDEYLT INSRMVDNTA LRMQQPGHVA DYFHELTWQA HSGPNYTFAV
NFGQSCVLLS RAMYGSVEST DEEIRRDVPD FLDPARFDTL RWRHRSTEDD HVLAKSLIDV
SQVKQHHLDG KANAHKALVE MCKSNSWIAY TLPIGIHHYL GSWEQYSYRD DARDGGDAHS
YETWQRKGSA LVSTDDEIRP WIRGFVKMVG NGTALSLLEG AGLPTNRTA