Gene PHATRDRAFT_47663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47663 
Symbol 
ID7202867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp453022 
End bp454920 
Gene Length1899 bp 
Protein Length495 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181910 
Protein GI219123185 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.55455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCATCGTCGC AATGGATTTT CACAGGCTTG TTTCGAACGT GGGGTCCGCA AGCTACTTCG 
TTCAAAACGG GTCAGATTAG AAGCGTGATG ATAGAGTACC AAGACTTGCT CGTAACTGAC
TAAGATCGAG ATCACATCCG ACCTCCCTCG CAATCGATTC GATTTGAAAG AAAGAAAGTC
ACTCGTTACA AAATGAGTGA CCCTTCCAGT AAGCAATGGA TTCACATTCC TGATTCATCT
AGCTTCTGGA CTAGGTGGCG TATGATTTAT TGCCTTTTTA GGTGTTTCGG TGTCACAGTG
TCTGCAGCTT TCCAGTTTAC TCCATTCAAT CGCGGAACAA AGTTGGCTGT AGTACCTATG
GCAAATTACG CTCCGATCTC TGAGGGAGTG AATGATGAAA CACGACCGCG CAGTATCTTT
TTCCGTTCCC CATTGCTGGA TTACGGTTAC CTTCCCGTCG TCGAGGAATA CGAAAGCGGT
AGTCTAGCGA GAAAGCCGCT GTTACTTTAT CTTCCAGGCT TTGATGGATC TTTTCTAAGC
GCGTTTCTGC AGTATCCGGA ACTTTCGACT GCTTTTGATG TTCGGTGCAT GAGTATTCCA
GCTTCGGATC GATCAACATT CAATGAACTG AAAAGATCAG TACTACAATA CCTGCGTATG
GAGATAGCGG AATCAATAGT TGGAGATTTG GATCAAAGGT CCAGTCGGAA CAAAACCCAA
CCTATTCTAA GCTCTAGTCC ATTCGATCAA ATATTCTCTT TTACCAAGGG CGCCTCCTCA
AAAGCGGTAT ACAAGAGGAG CAGCCGGTCA GTATACCTTG TCGGCGAATC ATTTGGTGGC
CTTCTAGCCA GTGAAATTGC CTTGTCGATT CTTGAGAGCG AGAAAAGCCA TGCGAATAGC
ACTATCGATT TGCAAGGACT CGTGCTCGTT AATCCAGCTA CATGCTATGA CCGGTCTCGC
TTAGCCGCCT TAGGACCGCC TGTGGCCAAC AGCGTACCAT GGATGTATCC AGCCAACTTG
GCAAAGCTCC TGCCCCTCTT TACCGACGAG TATTCTTTGG CTCAATTGAG ACTAATCGTA
CAAGCCAAAG CCTTGCCCTC TGTAATTGAT GATGCTCCCC GTGAAGCCTA CTTGGGACGT
GTGGCATTAT CATTGCCTTT CATCTTTCCC TCCATGCCTC AAGCCACTTT GTCGTGGCGG
CTGTCTCAAT GGTTGGAATT TGGATGTGCT AGTGCCGAGC AGAGGTTGAC GGGTCTGGCT
GCTTTCCCTA GCTTTCGTGT ATTGATTGTC GCGGGGGAAT TCGATGCCTG CTTACCATCA
ATCGACGAAG CCGAGCGTTT GGTTAGTGGC GTCTTGCCCA ATGCCAAGGT GCACGTTGTG
GAGGGTGCTG GGCACGCGAG TACCTGCGGT AGTCGGATGG ACTTGACAGC TGTTATGCGC
AACTGCTTTG TTGAACTACA ACAGAAAAAT GGACGCCGTT CAGTGACCTT GCGGACGGCC
ATGAAAAACG AAGCGGCATC AGGCATAGAA GAGTATTTCG GCATGCAACC GCGATACGAT
AACGCGACAA TTGGATTGAA TCCGTTACGC TACTGGAGTC CGGAATTATA CCTAAAGCAC
CGACCTAAAA CCGGCCCAGG TCAGCGGAAA ATTTCTCGTA CCACCAGGCA CAAAGGATAG
GTAGGTACAA TCGAGCGACT GGAATTTTAA ACCGATATAC TGTTATGGTA CCATTGGGGC
TTCTCTTCGG AAAAACATCT TACAGAACTG TGCCGAAAAA TAAAGGTGAG ATTCCTATAT
CCGTTAATAG TAAGGAGATA GACGTTCGAG GATGATCTGA GGCTCTGTCT GGAAATTTTG
AAATTGACAA CAGGTATGAC CTTGCTGTTA GCGTTTGGA
 
Protein sequence
MSDPSSKQWI HIPDSSSFWT RWRMIYCLFR CFGVTVSAAF QFTPFNRGTK LAVVPMANYA 
PISEGVNDET RPRSIFFRSP LLDYGYLPVV EEYESGSLAR KPLLLYLPGF DGSFLSAFLQ
YPELSTAFDV RCMSIPASDR STFNELKRSV LQYLRMEIAE SIVGDLDQRS SRNKTQPILS
SSPFDQIFSF TKGASSKAVY KRSSRSVYLV GESFGGLLAS EIALSILESE KSHANSTIDL
QGLVLVNPAT CYDRSRLAAL GPPVANSVPW MYPANLAKLL PLFTDEYSLA QLRLIVQAKA
LPSVIDDAPR EAYLGRVALS LPFIFPSMPQ ATLSWRLSQW LEFGCASAEQ RLTGLAAFPS
FRVLIVAGEF DACLPSIDEA ERLVSGVLPN AKVHVVEGAG HASTCGSRMD LTAVMRNCFV
ELQQKNGRRS VTLRTAMKNE AASGIEEYFG MQPRYDNATI GLNPLRYWSP ELYLKHRPKT
GPGQRKISRT TRHKG