Gene PHATRDRAFT_50561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50561 
Symbol 
ID7199389 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp128961 
End bp131228 
Gene Length2268 bp 
Protein Length575 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185488 
Protein GI219130682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACACAAAAAG AGGTTCCTAC CGCCCATTGT ACGAATTACG TAGTTATGTA TACAATTGTC 
AGTCACCATG AATCTAACTG TAAGTTTACA TTGCGCAAGG CAAACGTGTA GCTTTGGACG
TGTGAAATGG AATCCTGAGA TAGATAGATA GCTACCTGCA ACGTCGGGGA AGACAGGTTT
GGTTTGTGGC CTTTCGTCGA GTCGTTTCTA CAGAAATATG TGGAAAACGA GTGCCATGTG
TATCGGGAAA GAGCCCCTTT CATAGGCCTT TTGGACCCTT CTCCAAAGCC TCGACTGTGA
GGGAATTGTG TTCGTCCGTC GTGTATCTCG TCGCTCTCTC TCTCTCTTGT ATGTACATTG
GATGGGACGG AATCAAAAAG GATACCCCCG ACCTGCCCGA GTTGTCCGTC TCGATCCTTG
TCTTTGTTGC GCTCTCTCTT TCTCACCCGG ACACGGATTC ATCTATCTCT CTATTCATCC
CTCCGTTTCG TTTCTGTTTC TTTTTCGATT GCTAAAACAG ACTAATCCTT CAGGGACTGC
ATACGACGCC ATGACCGCCG CCAGACCCGA GCCTCACTCG CAACACAATC CCCGCACCGG
TCGCAAAGGC GACCCGCGGA TGCACCGAGC TGTCGCAGCT CGATTGAACG ATCCTAACCT
CAGTCTTTTT GACGCCTTGA CAGCCGGAGG CTTTACCTAC CCGACGGACG AGGACGCCAA
CTACATGGAC AGTGAACAAG TCACGCTGGG ACAACGCAAG AATCAACTCT CCCGAAGACT
CCGACTGGCA CGCAAGCAGA CGACGCGTCA CTCACCCCAC CACGAGAACG ACGAGAACTC
CTCACACAGC TCGACGTCGC ACGGAGGCAA ACGCTCCCGA CCCTCCGACG AAAATGGACA
CCACGGGGTC TCTGTACGGA AAAGGAATGT AGATCCCAAC GACCCAGCCT CGCTACAGAA
TGATCAGCAG CAAATAATGC TGGAGGAATT GAGTGTGGCG ACGGCCGAAG CGGAAGAAGA
CCGTCCGCGT CTCATGGCGA AATTCCATCC CCAGTATCAT CCCATACTCG TGCCGAGGAT
TGGAAATCAT CACCATCCGA TGTGCTCGTC GGCGTCGAAT ACGAATAACC CCTCTACGTC
GTCACGGCAC AATCCTTCCG CGACCAACAC GCATTTGGGA AACAGCGACA TGCCTTTCGC
CACCAGCAAC AATCGATCCA CGTACGAGAT TCACTACCCT CCCGAACGCA ATAACGATCC
CACGGCACCA TCGGGCGTCG CCATTGCGAG TCTCAACTCC ACCGCCCTCA GCCTCGGTAT
CACACTCGAG CAACTCGCAA TGACGCTGAA TAGTACCAGT ACGCTGGCTA AAATCATCAA
TGGGGACAAC AACACCGCTC GGCAGAAAGA TCTAGCCCTG CATTTGTACC AGAACAAAAA
CAAGGTGTTG TACGAAAAGT CCATGGTCTT GGCGGGGTAC CCAATGTCGG ACGCCCGGGA
AGGCTCGGCC AAGCACCTCG AGTTCGCCTT TGACGCCTGG AAAATTGAAG GCCAGCGTCT
GCAAGCGCTC GCGGCGGATC GACAAGCTGA AGATCCGCCA ATGGAAGCCA AAACGGGATC
ACCGCAGCCC CATTCTGTCG CTGGCATATC CCGTTTGAGT CAATCGAGCC GAACCGAATT
TAAAAACCAC AGCAACATAG ACCCCGGTCG TACGGCGCAC AAGAATTCGT CGACGAATGG
TCACCAACAT CGTCACGCGA ACGAATCGGA ATGCGGACTC GATGGTAGGC ACATGCATCG
TCTGGAAGGC AAGTGCGGTC ACAAGGCCAT CCTGCATAAA ATGCCGGACG GGAGTGCTCA
CATTGACTTT GTTGTCGGTG ACAAGGTTGA ATGCTACCAT AGCGTGCAGC CGGCACGGAG
TGGCAACGGC GGGAATCGCC TTAGTGACAT GTGGCCGTCC AAATATTTGT GTGAAGATTT
AAATTGTCCA AAGCACTGTA ACAATCAAGC TTTACAGCAC CAACAGCACG GAAATGATTG
TCAAACTCTT ACCGCATCCA AGATTTTGGA TTTAGAATCC ATCGATTTAG ATGGGAAGGA
ATGGAGCACC GATTTTACGA ACGACGACAC CAACAGCTTA AGGGCATTGC TCAACCTCGG
CGATTCGGAT ATAGGTCGAT CGCGGACTAC ATCCGTGGAC GACGGCCCTA CTACGAACGA
GCTTTTGGAA ATTTAAATGA TAAGAATCAA TGCAATGTAT TTCTCAAC
 
Protein sequence
MNLTTNPSGT AYDAMTAARP EPHSQHNPRT GRKGDPRMHR AVAARLNDPN LSLFDALTAG 
GFTYPTDEDA NYMDSEQVTL GQRKNQLSRR LRLARKQTTR HSPHHENDEN SSHSSTSHGG
KRSRPSDENG HHGVSVRKRN VDPNDPASLQ NDQQQIMLEE LSVATAEAEE DRPRLMAKFH
PQYHPILVPR IGNHHHPMCS SASNTNNPST SSRHNPSATN THLGNSDMPF ATSNNRSTYE
IHYPPERNND PTAPSGVAIA SLNSTALSLG ITLEQLAMTL NSTSTLAKII NGDNNTARQK
DLALHLYQNK NKVLYEKSMV LAGYPMSDAR EGSAKHLEFA FDAWKIEGQR LQALAADRQA
EDPPMEAKTG SPQPHSVAGI SRLSQSSRTE FKNHSNIDPG RTAHKNSSTN GHQHRHANES
ECGLDGRHMH RLEGKCGHKA ILHKMPDGSA HIDFVVGDKV ECYHSVQPAR SGNGGNRLSD
MWPSKYLCED LNCPKHCNNQ ALQHQQHGND CQTLTASKIL DLESIDLDGK EWSTDFTNDD
TNSLRALLNL GDSDIGRSRT TSVDDGPTTN ELLEI