Gene PHATRDRAFT_38120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38120 
Symbol 
ID7203054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp126871 
End bp128349 
Gene Length1479 bp 
Protein Length492 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182161 
Protein GI219123707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGT TCAGGAAACG AATATCGCTG TTCACCGCCG GTCTCTGGAT ATTTGCTGTT 
AGTCTGATCT ACTTTGAACG GAAGGAACTT TTTGGGATTC AACAGCCGAC GTTTAACACC
ACAACAGAAG AACGCTTCGA CGAAGCGTAC TTGGAGCGGC AGTTACAGCC CGCCAAATAT
ACAGCCGTCA ACCGTGTAAA AAGCAAAGAC GGTATTTATC CGGCTTCCAA CCCCATCGTC
TTCGAAAAAT ACCGGAAAGC ATCAACAAAT TCCAGTCGGG CTGAGCCACG TACGGGTCGG
TTTGCCTACG CCTTTATAGT TGGAGGCTGC AAGCCCGAAA GTCCATCGTA CCGCCCCTAT
ATTTACAACA TTGCCATATC TACCTATATT CAACGAAAGC GAGGCAGTAC CGCAGACGTG
ATTCTCATGG TACAGATGGC CTACGCCTCA TCACACGAGA CTTTACCGCC TGACGACAAG
GCACTTTTGG AAAGACTCGG TATTCAAGTG CAATACATTC CCAAAACCAA GGACGAAAGC
TTCTATCGAC TCATGCTGGA CAAATTTCGC ATTCTGCGAC TGACTCAGTA CGATCGAGTG
CTGTTTATGG ATTCGGACGC CCTCGCCAGA ACCAACCTGG ACTACTTGTT TATGCTGTCG
TACGAGGGAG TCTTAAAGGA AAACTTCATT TTGGCGGGCC CCACGGAGCC CGCAAACGGC
GGCTTCTTCC TGCTCCAGCC TCGACCGGGC GACTGGGAGC GACTGCTCGA CATTGTCCGC
GTCGCGGAGG AGCGTGGACG CCAAAGTCTA CCCTACCCGC ACTGGGACAC AACACTTGGA
TGGGGACAGG CGATTCCGGC TTTGCAGCCC TACGAAACGC TGGTTTCCGA GAGCCGCACC
AAGGCTACCC GATGGGGGTT CCACGGCGCC TTTGCCGATC AGGGGTTACT GTACCATTGG
GTCAAGTACG AACGCCAATC CGTCTCGATC CTCGTAAATC GGGTTGTGCA AAACTGGGGC
ACGGACGCGG ACGGGCAACC CAAAATGAAG GAAAAGTTAA TTTTCCAAAC CTTGCTGGAC
CGTGTTCCCG AGTCGTCGCA CCACCCCAAC TTTTGCTGGC GCACGTTCGT TCGGGTCCGC
GCTTGTGTAC CACCGCATTC CGACTACGTG CATTTTACCG GCAATCGCAA ACCGTGGAAG
GCGCTCGGAA GGACACAAAA CGCTACTACC GTGTCAAAGA CTCGGTATCG CCGGAGTGTC
ACCACCGCCG GGCCGCGGGA GTACCGGGGT CAGCAAGAGT ACCGAGCGTT CTGGTACGAC
ACGTTGCGGA CGATGGCGAC CGAACTGCAC TGGACCATGC CGGCAGTCGA GAGTATCTCC
TGGAACCGTA CGCACCGTGC CCCACTGGTT AAGTTTCCGC TGCACACGGA CGCGGCAACG
ACGCAGTACG CTGCCAGTGT GTGGTCCGAA CGCTCGTGA
 
Protein sequence
MTQFRKRISL FTAGLWIFAV SLIYFERKEL FGIQQPTFNT TTEERFDEAY LERQLQPAKY 
TAVNRVKSKD GIYPASNPIV FEKYRKASTN SSRAEPRTGR FAYAFIVGGC KPESPSYRPY
IYNIAISTYI QRKRGSTADV ILMVQMAYAS SHETLPPDDK ALLERLGIQV QYIPKTKDES
FYRLMLDKFR ILRLTQYDRV LFMDSDALAR TNLDYLFMLS YEGVLKENFI LAGPTEPANG
GFFLLQPRPG DWERLLDIVR VAEERGRQSL PYPHWDTTLG WGQAIPALQP YETLVSESRT
KATRWGFHGA FADQGLLYHW VKYERQSVSI LVNRVVQNWG TDADGQPKMK EKLIFQTLLD
RVPESSHHPN FCWRTFVRVR ACVPPHSDYV HFTGNRKPWK ALGRTQNATT VSKTRYRRSV
TTAGPREYRG QQEYRAFWYD TLRTMATELH WTMPAVESIS WNRTHRAPLV KFPLHTDAAT
TQYAASVWSE RS