Gene PHATRDRAFT_50430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50430 
Symbol 
ID7199247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp34323 
End bp37294 
Gene Length2972 bp 
Protein Length917 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185419 
Protein GI219130536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAAAAGCAA CACCACCAAG CGTACCTGTC AACTTTCGTC CTCCACACCG TGCTTTTCTT 
GTCATTGATT TGTTCTGTGA CTCTAGCGAT TGACACCGAG AAGCCGTTCT TTCGCAAAGC
ATCCCCGGAC AGCACGAGGA CGATTGCTGG ATCACTTTCG CGCCCACCAT CCTTCCGTTT
ACGAGAACAG ACGCATCCCC ACACCATACG CCAGTGTCAT GTCCTCCTCG TTGACTCCCG
CTTCACTCCG ATCCGACGCC TCAACGGAGT CGGGAACAGA CGAAGACACA CAGGCATACG
ACAAGCTGGC TCTCGACGAT TGTGTGAACA GTCCCGTCCC CTTGAATCCG GACGAATCAC
AAAACGCCGA AATCCGAGCG CTTGCGGCCA AACTCGCCCC TGATTTTGAA CAAGCGGACG
CTCTCCTGCA GGTTCCATTC GAATTTCAGT CGCATCCCTC CTTCTCTGTT TTGGAAGCAG
CCACTTCGGA AGACGATGAC ATGGATCGGG AAATGGAGAG TCTGCAATTC TCGGAAGCAC
TCTTGCGACA AGAATTGGAA CTCGCCCAGG ACTTTTCCAC CTTATTTCAA ACAGTTTCGC
ACAATATGTC AAACACTAGC GACGCAGTCG ACAAATTCAA TTTATCGGGA ACACCCTTTT
GCGAGGGGAT GAAACGACAG CTCTTTGATC CGCCGGATCA CCAAAGTGCC GACGGTTCAC
CTGCCCTCGC TCTGCAGTCG TCGTCCAGTC GAGAGGCCGA TATAGATCCC AGCGACGGTA
ACAAGCAGGA AGAATGCAGC TTGGAAATTG ACGACGACGA TTGTCCAGAC AAAGTTCTGT
CGCCAATCCG TATACCATCT CCAGTATCTT CGCCGAATGG GAAATCGACA GCCACGACTT
CTTCGTCTCC CGAAGTTGAC CAAGGACCCA GGGATAGTTC CATACTGCTA ACGGCAGGTA
CACCATCCCG TTCCGTAGCC AATCCTATTC AGCCCTACAC ACACAAGGAT CACTTTACCG
CTTTGCGCCT TTCGTTGGAA GAACACGGCG GTTGGTATTC GGTCGACTTG ACTTCCTTCG
TTGTACCACC CCGCGACACA ACTACCGGGA ATCTCATCGG TTTGAAAGAC TATTGTCTCG
CCATTCCCGA ATCCAAACTT AAACATTTAT ATGTGGGTTT ACCAGATGCA GCCAAACCCG
GACCTGATAC TACTGTTACT GGTTGTAGTC AATACACACT ACCCTTACCG GTTCGCACCT
TGGCAATCCC AGTCCGACCT GACGTTCTCT GTGGAGCCAT AATGGATGCC GTACATCAGG
TTCTGACTAG TACCGCCATA CATGCACGGA TTCTCAAACG ACAGGGCGGT CATTTACGAG
GAATCATATC CGGGTGTGTC GTCCCACGAG ATCCGGATCG ACTTGTGGAA GAATCCTTTC
ACAGTAAATC AAGTGCGGGC GAGTTGGAGG GTGTACCCAG TACATACCCA CCCTTTCTCA
TCGATGCTCA ATTATGCACG GCCAAATCTG ATTCTTGCGA GCGAGTCTTG TTGCTACGCG
TGTATCACTG TTGCAGCGAG TCTATTCAAC CCAGTGACGA TGTGGACAAT ATGAACACAT
CTTGGGTCAC GGCACTGTCA CAGCAAAATC CGGCCGGCGA TCACGTATTA GGAACAAATC
ACTTTGCGTT GGTAGAACAC CTTGATATTG AGGCATCCAC TCGTTTGCGC GAATGTTGTG
CATTGGTGCA GCGGGTAGAA GCTCCCGAAT TGTCAAAAAG GATCCGCTCC CCGGGCAAAC
GCTTCGACAA CCGTGAGTCC ATGCAGACTC TTGTCTCCAG TCATCTATTG GAGCATTACC
GAGCGTGTCC GTCCGTCCGC GAAGGAAGCA TCACCTTACC GTCTCTGAAT TCGGACGACT
GGCCAGTGAT TCAGTCTTCG TGGCGTTTCG TCCAAGCGAC GTGGGAAGAG CTAGAAACAC
GGGACTTGAC TTACACGACA TTGACCACGG CGCGCTTTGG GGCTTTTCCG GCTTTGCCTA
CCTTGGATGT ACACTACTGC TCGCAGATTC GACGGTTTTC ACGGGAAGTC ATGATAATGC
AGCTACTGAA GAGCGCGAGC GAATTAGAAG AGTACGCGCG CGAGGCCGAG TACGCTTGTG
CCAACATGAT TTCGCTACTA CAGCCAACAT TTGACGCATA CGGTATGGAA GCCCCATTGT
TGCCCAAACC TGTGCCGCTC AACGAATATC CCTTGGACTT TACACCACCC CAACAGGCAT
GTCCACCTTG GGGGCTGAGA GTGATGGAAG CTTTAAACGA GACTCAAGCT CTTACAAGTG
ACGCCGGGCG AGACGAACCG ATTTTGTCGC CGACAACTCT GTATGCTATT GATGCGTCAG
AGTCATTGGC CATGGCACGA CGTGCGGTCT CTCTTATCCT GAACGCATTT CAAATACAAG
ACGACGAGGA GAAGGGTGCT CGGCTAGGCC GTAAGAACCT ACAAGTAATG GATAGGTTGG
CCAAGATGCA AGCACATCAG CGCACTTTGA TTCAATCCTT ACAGAACGGT ATCGCATTGT
CCGAGAAGGC AGCCAAAGCT GCAGATGATT TTCACAGGAA AGCGGGTGTC ATGGAAGTAC
CTTTGTTAAA ATGGAATATT GTTGTTGGGG GGGCTTCCGG CACCTGCTCT GTAACGGCAA
AACACCTTTT GTTTATCACT CAGCTCATTC CGGTGATTGG TGGCAGCCGG ACGGCCATCT
TCCGGATAAG CGAAGTGGAC TTTGATGTGC AAGAATCAAC TCCCTCTATT CTTAATCCTT
TACCAACAGT AGTAAGCGTG CGAAAAGACG GCCAGCAAAT ATACAGCTTT CGACCTTCAG
CGGGTGGCAA GAGACTGAAG AGCGTTTTGG AAACAATCAA GGCAACGGCT CTGGACCAAG
ATGCGCTTCC AGAATCACCC TCATCAGCAT AA
 
Protein sequence
MSSSLTPASL RSDASTESGT DEDTQAYDKL ALDDCVNSPV PLNPDESQNA EIRALAAKLA 
PDFEQADALL QVPFEFQSHP SFSVLEAATS EDDDMDREME SLQFSEALLR QELELAQDFS
TLFQTVSHNM SNTSDAVDKF NLSGTPFCEG MKRQLFDPPD HQSADGSPAL ALQSSSSREA
DIDPSDGNKQ EECSLEIDDD DCPDKVLSPI RIPSPVSSPN GKSTATTSSS PEVDQGPRDS
SILLTAGTPS RSVANPIQPY THKDHFTALR LSLEEHGGWY SVDLTSFVVP PRDTTTGNLI
GLKDYCLAIP ESKLKHLYVG LPDAAKPGPD TTVTGCSQYT LPLPVRTLAI PVRPDVLCGA
IMDAVHQVLT STAIHARILK RQGGHLRGII SGCVVPRDPD RLVEESFHSK SSAGELEGVP
STYPPFLIDA QLCTAKSDSC ERVLLLRVYH CCSESIQPSD DVDNMNTSWV TALSQQNPAG
DHVLGTNHFA LVEHLDIEAS TRLRECCALV QRVEAPELSK RIRSPGKRFD NRESMQTLVS
SHLLEHYRAC PSVREGSITL PSLNSDDWPV IQSSWRFVQA TWEELETRDL TYTTLTTARF
GAFPALPTLD VHYCSQIRRF SREVMIMQLL KSASELEEYA REAEYACANM ISLLQPTFDA
YGMEAPLLPK PVPLNEYPLD FTPPQQACPP WGLRVMEALN ETQALTSDAG RDEPILSPTT
LYAIDASESL AMARRAVSLI LNAFQIQDDE EKGARLGRKN LQVMDRLAKM QAHQRTLIQS
LQNGIALSEK AAKAADDFHR KAGVMEVPLL KWNIVVGGAS GTCSVTAKHL LFITQLIPVI
GGSRTAIFRI SEVDFDVQES TPSILNPLPT VVSVRKDGQQ IYSFRPSAGG KRLKSVLETI
KATALDQDAL PESPSSA