Gene PHATRDRAFT_44836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44836 
Symbol 
ID7199554 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp369885 
End bp372220 
Gene Length2336 bp 
Protein Length494 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178989 
Protein GI219116388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0506241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAGCC TATTAGTTTT GGTAATGTTC GCGCAGCTCT TCTGGTTCCA ACTATTTTGA 
ACGCTGAAAC CGTCGATTCC ATTCCAACGG TAGCACCTTT CACACGCTCA CTGTCTATGG
AAGGCACAGA TACTTCCTTT AGAGATGATA CATCGGCACA GACGGGGAAT GGGTCCGCGT
TCCCGTTGCC TCTCCCACCT CCTAGACAAA CTGCTGAACC CAGTACCAGA AATTCTTCTG
TTGGGGAAGA AGACAGCTCA CGTCCTGATT GGCTTCGAGA TTCTCCTGAG CAGCTTCCTT
TGGGGAGCGA AACGGTACAA AGAGAAGCTT CGAACGCCAG TGGCAGAAGT GCCGGCAGTA
CAGGGAATCA GATAGTCCAG AGGACGTCCT CACAACTACA AGTGGTAATG TCATATGGCG
CATAGCATCA GAATATCCCC TTACTCTACC ACCGAACTCA CTGAAGTTTT TTTAATTGTC
GAAAAGCTTT CCAGCAGCAT AGCTCGCGGT ACTAACAAGG CCTTTGAGGC CTTGTTCGGT
GATGCAAAAC CTCCGTTTGT CAGTCGAAGG CGTCTTGCTG ATGGGGAAGC TGCGAGGTAC
ATCGTTTCCC GGACACTTCT TCCCGCTTCC GTTATCTTCA GCCAGCCCAC AAAAATGGTG
CGTTCTCCAG CGCTCGTGTT GGATGCTCTA TCGTAGCTTC GTCTCGGGTT GCTAATACGT
AGTTCGTTGT TCTTTTAAAA GTGGATCGCA ACTCTGCAGA CGAATCAAAA GGCTCTGGAT
AGTAACGATG TCATGGAGGC GTCTAAATCT TTGCGGGCTT TTAGTTTGCC GTCCGAACGC
CAGGCAAAAT GTTTGGCCCA AGCTTGGACG CCTCCGCGGA TGGAGCCGTT CGCCAACCAC
CCACTGTGCA ATACCTGCCA GTCCAAGTTT GCTGTCTTCC GTCGAGCCTG TCACTGCCGA
AATTGTGGTG TTTGCGTCTG CAAAGACTGT ACCGTGACTT GGCCGGCAAA GATGGTACCG
GAGACTTACA ACATAAAGAA AACAGCGACA GTAAACATAT GCAAGGCCTG TGATTGGCTT
TGCAACAGCT TTCGTCTTGC TCTTTTGGAT GGTGACCAGG ACAAGGCAGT TGCTCTTTAC
GCTACTGGGA ACATCAACAT CGTGTGCCCT TTTGGAAACG TACGTAGTCA GCCTACGGAA
TCCGGCTGCA TTGCAGTGCC ACCACTCATT TTCATTTCTT CTAGGTCAAA GGAGAGTTGT
TCTACCCTGT TCACGCATGC GTTCTCGGTG AGTCACTTTC GATCTTGCGA TGGTTAGTCG
ACGAGAATTG CTGCCCCATA AAATCCGTTC GAGTCAATGG AAGGACGAAG GATGGAATCT
GTAACTATAC GGCAATTGTG ACATCAAAGG GCCGGTCGTT GCTGGGAATC GCAATGGAAA
ACAACTTGAT ACCAATCGTT CGATATCTGG TTGTAGAGAA GGGCCTCTCT TTGGCAGAGG
AGAAGTCCCT AACCCGCGAA ACGCTTGTCC AAAACCTTCA GCTTGCTCTA AGGGCCATAC
CAAACGCCAC AACATCAACC GAGGCCATCG AAATGGATGT GTCGGAAGCA TTGTATCACG
ACGCCACAGT CAATGACAGC GCCGAGGCGG ATTTGGGGGA CACGACGAGC CCGCTTTCTT
CAGAGTATCA AAACGTTGTA CCAGTGCCTA TTCCTGACCG CGAACAGGGT AGCGAGAGAG
ATTTGCCAGG TGGGCGTACT CTAAGTGAAG AAGCTCGCAA TTTCGGAGCA ATTAGCCGAC
CCGGTAGAGG TTCTTTTTCG TACGATGGTC GGCAGGATGA AAATGAATGT AGGTGTACTT
ACTGAAACCC GTCTTAGCCT CAGAAGATTC TCATTTATCC AACTTGTTTG TCCACAGGTA
TCATTTGCTT TGACGCCAAT ATCGAGTACG TCCTTCACTC TCGCCTGAAT ATGTTCGTCT
CTACCTAGCG AGACTTCGTT GCTCACCTTT AATTGTTACT CTTCGCTCTG AAGTTGCGTG
GCTACCCCTT GCGGTCATCA AGTCTGTTGT CTGGACTGCA GCGAGCACTT GTCCCGTTGT
CCAGTTTGCG CCATGCCGAC CTCCTTCATG CGAGTATTTA AAGTATGAAA AGAAGAAATT
AAGACCGAAG GTACGCTTGC ACTTCGTGTT GTTGGCCATG CCATACCGAA ACGACCCCAT
CAGCAAAAAC ACTTAGTTCA GAAGGATCTG TTGCGTATAG TTTTGTCACA CGTAAGGCTT
TTATGTAGAT TTTTATGGTA GTGTATGAGC TTGTCTTTTG GTCTAGTAGT AAGTAG
 
Protein sequence
MEGTDTSFRD DTSAQTGNGS AFPLPLPPPR QTAEPSTRNS SVGEEDSSRP DWLRDSPEQL 
PLGSETVQRE ASNASGRSAG STGNQIVQRT SSQLQVLSSS IARGTNKAFE ALFGDAKPPF
VSRRRLADGE AARYIVSRTL LPASVIFSQP TKMTNQKALD SNDVMEASKS LRAFSLPSER
QAKCLAQAWT PPRMEPFANH PLFRLALLDG DQDKAVALYA TGNINIVCPF GNVKGELFYP
VHACVLGESL SILRWLVDEN CCPIKSVRVN GRTKDGICNY TAIVTSKGRS LLGIAMENNL
IPIVRYLVVE KGLSLAEEKS LTRETLVQNL QLALRAIPNA TTSTEAIEMD VSEALYHDAT
VNDSAEADLG DTTSPLSSEY QNVVPVPIPD REQGSERDLP GGRTLSEEAR NFGAISRPGR
GSFSYDGRQD ENEFAWLPLA VIKSVVWTAA STCPVVQFAP CRPPSCEYLK YEKKKLRPKI
FMVVYELVFW SSSK