Gene PHATRDRAFT_43550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43550 
Symbol 
ID7197581 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp791838 
End bp793912 
Gene Length2075 bp 
Protein Length563 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178004 
Protein GI219112505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCATTTACA ATCCTTGTGC ATGTACGTAT CCCCGCTAAG CCCGTAGTGA TTTATGAGAA 
AACCCGCGCT GCTGTTGTTG CATTCACCAG CATCGTTGAA TTGGTTTTCC GGAACCTTTG
TTTCCGCCAG AGGAGTTCGC CCTGAGCGTG TCGCTCCTAG ATATCCAATC TGGACCAAGT
CTCTGCATCG ACACGGCATG TCTAGCTATA GCGGCGGTGA TCGAGGAGGC CGGGGACGAG
GCGGTGGAGG TCGTGGTGCC TACTATAAGA ATAAATACGG CGGTGGCGGA CGAAACAGTG
GCGACGCTCG CGATCGAGGT CCTCTGGGTG GAAACGGTAA CCTGGGAGAC AATCACCGCG
CACGGACTAG TACCAACGGT GGAACCTTCC AAGATTTAAA GCAACTGTTA CAACACATCG
ACGGTCGTCA GTATCCCGCC TATCATGATT TAGAAACTGC TCCAAATACG GGATGGGTTC
ATCCCGAGGG ATTTGTCTTA CAAGTCGGAC GTGCTCAGGC AGATCCGTTC GCCCCGCCTA
CCCGGTGTCG CGTCACCCTT CCACCTTCCG TCTCGCGCAT TTCAAACTCT TTCTATACAA
ACGCTACGCG GCGCATGGCG ACTGGTGATT TCTTGCTGCG GCGCTTGTAT GGCAACTGCA
AACGTGTGGG AGCCGATCAT AGCTTGCGTA GCAGTAGTGG TGGTAAGGGT GGATGGAGTG
GACCCAAAGG AGGCGACGTG CAAGTGCTGG AGCCTACACA GAATGTTATC GAGCAGTCAG
CGGTTCAAGT TGACGAACAA GGTAACATTC TATGCCAAAT TACCATCAAT CTTCCCGCAA
AGGGGAGATC AATTATGGGT CACGCCGCAC ACGAAATCAT GGACGCTGTG CTACCGCAAC
TGATCAGTGA TAGTCTGATG TTCACCTCGA TGAACTTGGA CAGTATACGC ACGCACATCG
AGTCGGTAGA GGATCAAGCC TGGCTGCAAC AACAGCTGGA TACTGCCGGA CTGGTATCTT
TCGTGCGGAA CGGGGCAATT CTGCCCCGTG TCTCCGGAGT TGAAGATCGC CCCATGGCGG
GATCGGTCGT TGCCTTTAAG TCTCCTCCGT CGCTGCAAAA GGAGTTTACC CTCCCCATCA
GCGGCGTGGT AGTCCAGGGC ATGGGAATTC GCAAGGGTGT TACCCTGATC TGCGGTGGTG
GCTTTCACGG CAAATCACCC TCTTACAAGC CATACAAAGC GGAGTCTATC TGAAAGTACC
TGGAGACGGC CGGGAATTTT GTGTGACCAG CAGCCAAGCG GTCAAAATTC GTGCGGAAGA
TGGCCGTGCT GTCCAAGCGG TCGATATTTC TCCGTTCATC AACAATCTGC CGTTTGGTAA
AGGTACATCT TGCTTTACCA CGTCGGATGC GAGCGGAAGC ACAAGCCAGG CAACAAATAT
TGTGGAGGTA AGTGAGCAAA CGTGTGCTTA TGTCGGGAGA GAGACCATGC ACTGGCGAGT
TGTATTTCCG CAGAGCAAAC TTCCATTGCT ACTGAGGAGC GTTTGACCGC CAAGAGAGAT
ATCAGGACCC GACTATTCGT TGTACTGTGC CGCAATTTAT CATCGGTACA ACCGACATTT
CGTCTTTTCC AACACATGCT TATGACTTCT TCTTCATAGT CAATCGAACT AGGTGCAGAC
ACCCTTCTAG TAGACGAAGA TACGTGTGCA ACGAACTTCA TGGTTCGAGA CAATAAAATG
ATGGAGCTTG TTGCTTCCGA CAAAGAACCA ATCACACCGT TTGTGCGTGT TATCAGATCC
TTGTATGAGT CACAGGGGGT TTCTTCGGTT CTAGTTATTG GTGGACTTGG CGATTATTTC
GACGTAGCTG ACCATGTGCT ACTAATGGAT TCGTATGGAT GCCAGGATGT CACAGCACAT
GCGAAAGAGA TTGTTGCTCG AAGCGGATCA GATTCTGCTA AACTGCAGGT AAAATTTGGA
AAAATTCGTC AACGATTCCC GGTACTAGAC ACATTTGCTG CTAATGGAAA GGTTAGAACC
CCGGCAAGGG GGGTTATATC GTACGGCGAC GTTGA
 
Protein sequence
MRKPALLLLH SPASLNWFSG TFVSARGVRP ERVAPRYPIW TKSLHRHGMS SYSGGDRGGR 
GRGGGGRGAY YKNKYGGGGR NSGDARDRGP LGGNGNLGDN HRARTSTNGG TFQDLKQLLQ
HIDGRQYPAY HDLETAPNTG WVHPEGFVLQ VGRAQADPFA PPTRCRVTLP PSVSRISNSF
YTNATRRMAT GDFLLRRLYG NCKRVGADHS LRSSSGGKGG WSGPKGGDVQ VLEPTQNVIE
QSAVQVDEQG NILCQITINL PAKGRSIMGH AAHEIMDAVL PQLISDSLMF TSMNLDSIRT
HIESVEDQAW LQQQLDTAGL VSFVRNGAIL PRVSGVEDRP MAGSVVAFKS PPSLQKEFTL
PISGVVVQGM GIRKGVTLIC GGGFHGKSPS YKPYKADQAV KIRAEDGRAV QAVDISPFIN
NLPFGKGTSC FTTSDASGST SQATNIVESI ELGADTLLVD EDTCATNFMV RDNKMMELVA
SDKEPITPFV RVIRSLYESQ GVSSVLVIGG LGDYFDVADH VLLMDSYGCQ DVTAHAKEIV
ARSGSDSAKL QNPGKGGYIV RRR