Gene PHATRDRAFT_47576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47576 
Symbol 
ID7202637 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp158326 
End bp160147 
Gene Length1822 bp 
Protein Length574 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181856 
Protein GI219123073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.124789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACCCTCCAA CGACAACGAT CAACGAAAAA GTGTACACGG AACGACGGCC ACGCATGACG 
AGTCGGCGCG TGACGGCGAC CTTACGGCAA CAACTGCGAC AAGCCGCTCG CGGTGACACT
CAATCCACAC CCAACGGAAG TCCCCCGCGG CGACTCTGGA AATGCTTTCC ACTCACCGCG
CAACACGTCA ACGTTCCCAC GAAACGTCAC CCCGACGACA CTGACGACGA ACGAAACACT
TCGGCTCATC CACGCTGGTT GCGGACTCCC AGTGAGTTTC ATCAAACTCT CTGCGATCGC
ATCGAACGTG CCCGACGTCG TGTGCACTTG GCCTCACTCT ACATTGGACC CGCCGTGGAT
CCGTTCAAGT ACGACAAGGA AGCAACCTTT GCCCAAGTGT TGTCACGCAT CGATCCCCGA
GTCGACGTCC GTATCTTGTT GGATCAACAC CGAGCGCTCC GTCCCGTGCC GGTGCCTCCG
CAACGGGCAC CCGCACCGTT CGTCCGCCAC CATCTCGTCG GCGGAAGCCT GCCGACGAGC
TCTCGCGCAA CGTCCCAAAC TACCGGACGA ATCCCACAAC GGGACTGCCG ACACCGAACA
CAACAAGAGT CAGATTCATT TGTTATCGGT ACTCGGTCCT TGGCTGTCCC GACTGCCCAA
TCCGTACAAC GAAATTGCCG GCGTCTTTCA CGTCAAACTC TACGTCGTGG ACGACGCCGT
CCTCCTCAGT GGCGCCAATT TGTCGCAGGA ATACTTTGCC GATCGACACG ACCGCTACGT
ATGTATATAC AACGGTGGCA ACGGGTTGGT CGACACCTAC GTCGATTTGA TCCAAGCCTT
GTCGGAATTC GGTAGTCAAC GATACGAAGG AATCGACGAG AATGGTGTGG CACAACTCAC
CAACGTACCC GATCGACAAC GACTCTTTCG AGCGATCCGG GACGTACTGA CGATCGAGGC
CGATACCGCA GGAATCGACC ACGAACCAGA TCCCGACGTC ATTGCCTACG CGGTACCCAC
CTTTCAGGCA CCCCCCGGTT ACTTTACAGC AACCTGCGAC GCAACCGAAC TCGCCACCAT
GCCCACTGAT CTACAAACCA TTCACGATTT GCTACGCCAA ACCGCCGCGT GGGCTCCAGC
GGCGGCGTCG TCGTCATCGT CCGCACCACA AACGACCGCC ACCACTGCAA CCACAACACA
TCAAAACAGG CCAGTCACAC TTCGTCTCGC CAGTGCCTAT CTTAATCCAA CACATTCGTT
TCTGGAATCC ACGAGGAATT TAAACGTTTT CTTTTTGACG GCCGGAAAAC TCTCGCACGG
CTTTCGTCCC AAAAAGGTGA CGGGTCACGT TTCCAAAACG GCCTGGATCC CTACCGTCTT
TGCAACGCTA GTGGCATCCT ATCCTCCGTG GGTAAAGACC TGGTGGTACC AACGCGAAAG
CTGGACCTTT CACGCCAAGG GATTATGGTT AACAACCACC GCCGAAACGG TGCCGGAATC
CACGACGACG ACGTCCAAGA CCAACGTTCC TACACTGACG AAATCCCAGC TGCGCATTCC
CGAGACGGAC GAGCTTTTGG TCGTCTCCCA CGGATCCGGC AATTACGGGT ATCGATCGGA
ACAACGAGAT ATGGAAAGCA ACTTGCTCTT GGTTTTCCCC TCACCTACTG ATGGACAGGA
AAGCAACAAT CCATGGGCTC AGCAGCATAT TGACGAATGG AACGAATTTG TACCCTCGGC
GGTGCCAGCT TGTTTGGAAG ATACCGACCC ACTGCCAAAG CCAGTGCAGT GGGTATTGCC
ATACATCAAG TCGTTTTTTT GA
 
Protein sequence
MTSRRVTATL RQQLRQAARG DTQSTPNGSP PRRLWKCFPL TAQHVNVPTK RHPDDTDDER 
NTSAHPRWLR TPSEFHQTLC DRIERARRRV HLASLYIGPA CCHASIPEST SVSCWINTER
SVPCRCLRNG HPHRSSATIS SAEACRRALA QRPKLPDESH NGTADTEHNK SQIHLLSVLG
PWLSRLPNPY NEIAGVFHVK LYVVDDAVLL SGANLSQEYF ADRHDRYVCI YNGGNGLVDT
YVDLIQALSE FGSQRYEGID ENGVAQLTNV PDRQRLFRAI RDVLTIEADT AGIDHEPDPD
VIAYAVPTFQ APPGYFTATC DATELATMPT DLQTIHDLLR QTAAWAPAAA SSSSSAPQTT
ATTATTTHQN RPVTLRLASA YLNPTHSFLE STRNLNVFFL TAGKLSHGFR PKKVTGHVSK
TAWIPTVFAT LVASYPPWVK TWWYQRESWT FHAKGLWLTT TAETVPESTT TTSKTNVPTL
TKSQLRIPET DELLVVSHGS GNYGYRSEQR DMESNLLLVF PSPTDGQESN NPWAQQHIDE
WNEFVPSAVP ACLEDTDPLP KPVQWVLPYI KSFF