Gene PHATRDRAFT_49641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49641 
Symbol 
ID7198273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp282064 
End bp284495 
Gene Length2432 bp 
Protein Length779 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184436 
Protein GI219128471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCGT CGCCGCCCAC GGGGGTGGAC GACCACACTG GGCAAGCCTT GCCCCGTCGT 
CGACGACGAC GACGACGACG ACGACGACGC CGGACCGTTT TCTACACGAG TAGTAATACT
CCCTACCGTA TCCTCCTACT CTGTGGACTA GTGGCATGCG TACACCACGG ACTAGTGTCC
CAGTGGGTGG GCACCCACGC GTGGGTCGCG GTCCATACGG TCCTCCACCC ACGACGGTAC
GGGGTACCGT ACGGACGCGA CTGGTCAGGG CAGTGCACGG CACATCCCCC CGGACAATTC
CCTCCACCAA TATTGGCGGA CGACGAAGAC CCCGACCATG CCTCACTCGC GTCGTCGCGA
TCCAACACCA ACCAGAACGA AAGTGCTGGA TGGAAAGAAA CCCAAAGAAA AGTCGAACAG
CAACAGCAAC AGATTGATCT CTTGCTCCAA CTTGTGCAGC AACAACAATC ACAACAATCA
CAGCCATCAC AACAATACTC CCCCCGTGCA CCCGCACCCG TTTCCAGCCA CGAGGTCACC
GCAACCCGGG CACCAGAGCG AGACCCAGAG GCGCTTTCCC CGTCGACCAA CGGCAACTAT
TCCAGATCCG TTCCGTCGAC GGACAGCATC GCACCGGCTA CTGCCAACAC GGGTGTCGTG
CCTCTCAAAG CCATGCTCTT TATTGACGGA ACGTGGTTGT ACTACAGTAT CTACGAACGC
ACCGAAGCCC GCTGTCCCAT TATCCAACGC TACGGTCGTG GTTGGCAGAA TCGGTACGAT
TTCCATTGGG CCGCCCTGCC ACGGATTCTC TGCGAAAGCT TGCGGGATCC CGGATGGAGT
ACCAACACGG CCGCTCCCGC ACACACGACA ACCCACGAGT CTGCCACCAA AAACGCCCGT
CCCATGGAAA TTGTCCGCGC CAGCGTCTTT ACGTCCTACA AGGCCGACAC ACCCACGTCC
TCCTTTCGGT ACCAAATGTT TCAAGACATG CAGGCCGCCA ATTACGACGT CCACATGATG
GAAACGGTCG GCCGCGGCGA AAAATGCGTC GACATACAGT TGGCCGTGGA AATGATGCAC
TACGCCACCG TACCCAACGC CTACGACGTC GCCTTGTTGC TCACGGGCGA CAAGGATTTC
ATGCCCGCCA TGATCCGGAC CCGCCAAAAA GCCCGCAAAG TCGGTCTCGT ATCCATGAAA
ACCGGCTGCA ATCGGGCTCT GTACGAAACA CCCGGATTGA AGGATTACGA CGTCGTGTGG
CTGGAAGACC ATTTGGACGA ACTCATCGTA CCGAAACGGG GAAAAGTTAA CGGCTCCAAC
CATGTGGAAG CAGTGGTTTC CGTCTTTACG CTCATGAAAA TTTTGTACGA TTTTGTCACC
GAATCGGGCC TGGAACGAGT GACGAGCCGG GATATCGGAC GGTATCTCAA GATTTTGAAA
CTTGGGTCGC GGTCGGTTTT AGAGGAATTG AAATTGTCCT ACGGTGGACT CCGACAATTT
TTGACCATGT CGGGTGTCTT TGTCATTGAA ACAAGGGACG ATCATTACCA AAAGGAAGAT
CCTAGTGACA AGGCGTACTG GGTACGAGTA CGACTACCCG AAGCCACGGT CGCATTGACC
GAACGGGCAC GGAGTACTCG TTTGAGTGCT GCCGAAAAAG ACTTTCTCGA AACGTACTCG
CTGACTATTC TTCAAGACAA GGCCACAGCC TATTATCACT CTTTGCTGCT TTTGGATACT
CTGCCCGATG CACCCAGTGT CTCCCGAGAT GCGGCCAACG CTTTGCGTGC AGACGGTGTG
GAGCTGCCGG ACGATCTGAC TCGCGATTAT AGCCTTTGCA AGGTTGCCGA GCTCAAAGAC
TGTTGTCGTG CTCGTGGTTT ACCAATTGGT GGAACCAAAG CAGTGCTAGT GGACCGTATT
CGAAGTGATG TTGAGCAAGA AATTGCACGC TTACAAACAG CAGCGCACTC ATCACCCCGT
AAGTACCAGC ATTTGAATAT GCCTCACTTA TTGTCCGACC CCAGTGAAGA AACGGGCACT
ACTGTCTCCG ATGAGACGGA TACCTATCTT AAGGAACTCG TCTTTGAGTA TCTGAGAGCC
AGTCACGGCC AGGCTAGTTC TCGTAACGTT GGACGGTATC TCGCCAGTAA CAAGTCTTCG
ACTGGAGAAT ACAAGAAAGG TCGGCAGTCG GCACTACACG AGCTGAAAGC ACACTACGGA
GGCCTGGCGA GTTTTGTAGG CCACCACGAT AAGCTTTTTG AACGACAGGA TACGTTAGGA
TCAGACAACG ATCCAGCCTC AACCTATGAA TTTGGGGTCG GACTTCGAAA AGGAGCATGA
TCCAGAATAG ATTACTTTTC TGTGCCGGCC TTACCTACTT TTTGACTGCG TGACGGACTG
CCCCAGACAA CTTCCACTCG TCGTGGTTGG GT
 
Protein sequence
MPSSPPTGVD DHTGQALPRR RRRRRRRRRR RTVFYTSSNT PYRILLLCGL VACVHHGLVS 
QWVGTHAWVA VHTVLHPRRY GVPYGRDWSG QCTAHPPGQF PPPILADDED PDHASLASSR
SNTNQNESAG WKETQRKVEQ QQQQIDLLLQ LVQQQQSQQS QPSQQYSPRA PAPVSSHEVT
ATRAPERDPE ALSPSTNGNY SRSVPSTDSI APATANTGVV PLKAMLFIDG TWLYYSIYER
TEARCPIIQR YGRGWQNRYD FHWAALPRIL CESLRDPGWS TNTAAPAHTT THESATKNAR
PMEIVRASVF TSYKADTPTS SFRYQMFQDM QAANYDVHMM ETVGRGEKCV DIQLAVEMMH
YATVPNAYDV ALLLTGDKDF MPAMIRTRQK ARKVGLVSMK TGCNRALYET PGLKDYDVVW
LEDHLDELIV PKRGKVNGSN HVEAVVSVFT LMKILYDFVT ESGLERVTSR DIGRYLKILK
LGSRSVLEEL KLSYGGLRQF LTMSGVFVIE TRDDHYQKED PSDKAYWVRV RLPEATVALT
ERARSTRLSA AEKDFLETYS LTILQDKATA YYHSLLLLDT LPDAPSVSRD AANALRADGV
ELPDDLTRDY SLCKVAELKD CCRARGLPIG GTKAVLVDRI RSDVEQEIAR LQTAAHSSPR
KYQHLNMPHL LSDPSEETGT TVSDETDTYL KELVFEYLRA SHGQASSRNV GRYLASNKSS
TGEYKKGRQS ALHELKAHYG GLASFVGHHD KLFERQDTLG SDNDPASTYE FGVGLRKGA