Gene PHATR_33633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33633 
Symbol 
ID7204073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1422369 
End bp1424825 
Gene Length2457 bp 
Protein Length818 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186250 
Protein GI219113333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0303682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATACG CGCTTTCGAT CATCATGGGT CGAGCCATTC CCGACGCACG CGACGGTCTC 
AAGCCCGTAC ATCGCCGAAT ATTGTATGCC ATGGACCAGC TTTCCCTCTA CCCAAATACA
GGACACCGAA AATGCGCGCG TGTTGTTGGG GAAGTCCTGG GCAAGTTTCA TCCGCACGGA
GACATGGCTG TCTACGATGC TTTGGTTCGC TTGGCGCAAC ACTTTAGTAC GGCCTATCCA
TTGATCGACG GACACGGAAA CTTTGGGTCC ATCGACGCGG ACCCCGCTGC TGCTATGCGT
TACACCGAAT GTCGCCTGAC AAAATTGTCG CAAGCGGCTT TACTAGAAGA TTTACAAGAT
GATACGGTAG ACTTTTTACC CAATTTTGAC GGAAACGAAA TAGAACCCGC AGTTTTACCG
GCCAAGCTCC CGATTCTGCT ACTCAACGGT TCGTCGGGCA TTGCGGTTGG CATGGCTACC
AACATTCCAC CGCACAACCT CAACGAGATT ATGACGGCCT GTACCGCCTT GGTAAAGGCG
CGCCAAGGGG GCGCGGCAGT GACGGACAAG AAGCTCTTGC AAATGGTCCC TGGACCTGAC
TTTCCCACCG GGGCGTCGAT TCTTAGTACC GGCGGTACTG AGAAATTGTA CACAACAGGC
AATGGTGGGA TTGTCATGCG GGCAGTGACC AAAATCGAAA AAATTACGAC CGGCCGCAAA
AGTTCCATCA CACGAACGGC CATAATTGTT ACCGAGCTAC CGTACCAGGT CAACAAGGCG
GCATTGTTGG AAAAGATTGC GGGACTAGTC AATGAAAAAA AGCTGGACGG TATTGCGGAT
CTACGCGATG AATCCGATCG TGATGGAATT CGGGTCGTCA TTGAGCTCAA ACGAGACGCC
GTCGCGGCCG TAGTCTTGAA CAACTTGTAC AAAAAGACAC CCTTGCAGAC AACTTTCTCT
GGGAACTTTT TGGCGCTGAT GACGGCGAAT CGTGACAGTA GCAGCAGCCT AGTACCACAA
CGATTCACTC TTCGCCAAGC CATGGACTGC TTTCTTGACT TTCGCTTTGA AACGATTCGT
CGCAAGTCAC AATTCCAATT GACCAAAGTC AACGCCCGTT CCCATATTGT TGCAGGCTTG
TTGATGGCAC TGGACAAGGT CGATATGGTC ATTCAGATTG TTCGTGCTTC TGCGGATCAG
CAGGCTGCTC GAGAAGCTTT ATACATTGAG CTCGGCACTT CATCAGAACA AACTGACGCT
ATTTTAAAAT TGCAACTTGG GCAACTTACT CGATTGAACA AGGGCAAGCT CGAGTCGGAA
AAGGCCGACC TCGAAGAATC TCGTGAAAGT TTAACACATC TCCTGGAAGT TGATGATGCG
GTTTACGATG TCATGAGGGA AGAGTTCATT GACATGATGA AGAGATTTGG CGGTGAACGA
AAGTCAAGCA TCATTGTTGA AGATGACGGC GATTTTTGCG ATATGGACCT TATCGAAAAC
TCGAGATCGG TTATTGTTGT CACTCGTGGT GGCTACATCA AACGCATGCC GTTGAAGACA
TTCGAGAGTC AAGGAAGAGG CACTCGCGGC AAACGCGGCA CTTCCGATGG TGGAGAGTCA
GCTGACAGTG AAGTGGCACA TTGCTTCACG TGCAACGACC ACGACACGCT CTTAATGGTT
ACCCAGAATG GTATTGCCTA CGGGTTACGG GCGTACCAAG TTCCTATCGC GGGCCGTACG
GCAAAGGGAC AGCCCATTCC ATCCGTATTG CCAGTTCGCG GTGACGAAGT TATTACGGCG
ATCCTCCCGG TGTCCGAGTT CTCTGACGAA GAATACGTTG TTCTGACAAC AGAACAAGGC
TGGATTAAAC GAACACCGTT GGATGCTTTC GAAAAATTGT CGAGCCGTGG CTTGACCATC
GCTACTTTGG AAGATGGTGA TCGCTTGAAG TGGTGTCATC GAGTTCGAAA CGAGGATGAC
ATCTTAATCG GCACTGTAGG TGGGATGGCG ACTCGATTCG GAGCTGCCAA GCTACGACCC
ACTGGTCGAA CGAGTCGAGG CGTGAGGGCG ATGAAGCTCC GGGAGGGCGA CACAATTGCA
GATATGAATG TACTCAGTGG CAAGAACAAG GAGTACATTT TAACAGTAAC TGCACAAGGC
TACGGAAAGC GTATTGCGAC GAGCGAGTTC CGGGCCCAGG CTCGTGGCGG AGTTGGTGTA
ATTGCCATTA AGTTTAAAAG AGGGCAGGAG GAGGACAAGG TAAGCTGCCT CCGAATTGTA
AAGGACGACG AGGAAATATT GGTTATTACA GCAAGAGGGA TAATGGTCCG ACAGAAAGCG
TCCGATATTC CGTCACAAGG TCGATCTGCG ACTGGCGTTA TGGTACAGCG CGTGGACGAT
GGAGACCACA TATCTAGTGT GAGCATCGTA CCACAATACG AAGAAATTGA CGGCTAA
 
Protein sequence
MQYALSIIMG RAIPDARDGL KPVHRRILYA MDQLSLYPNT GHRKCARVVG EVLGKFHPHG 
DMAVYDALVR LAQHFSTAYP LIDGHGNFGS IDADPAAAMR YTECRLTKLS QAALLEDLQD
DTVDFLPNFD GNEIEPAVLP AKLPILLLNG SSGIAVGMAT NIPPHNLNEI MTACTALVKA
RQGGAAVTDK KLLQMVPGPD FPTGASILST GGTEKLYTTG NGGIVMRAVT KIEKITTGRK
SSITRTAIIV TELPYQVNKA ALLEKIAGLV NEKKLDGIAD LRDESDRDGI RVVIELKRDA
VAAVVLNNLY KKTPLQTTFS GNFLALMTAN RDSSSSLVPQ RFTLRQAMDC FLDFRFETIR
RKSQFQLTKV NARSHIVAGL LMALDKVDMV IQIVRASADQ QAAREALYIE LGTSSEQTDA
ILKLQLGQLT RLNKGKLESE KADLEESRES LTHLLEVDDA VYDVMREEFI DMMKRFGGER
KSSIIVEDDG DFCDMDLIEN SRSVIVVTRG GYIKRMPLKT FESQGRGTRG KRGTSDGGES
ADSEVAHCFT CNDHDTLLMV TQNGIAYGLR AYQVPIAGRT AKGQPIPSVL PVRGDEVITA
ILPVSEFSDE EYVVLTTEQG WIKRTPLDAF EKLSSRGLTI ATLEDGDRLK WCHRVRNEDD
ILIGTVGGMA TRFGAAKLRP TGRTSRGVRA MKLREGDTIA DMNVLSGKNK EYILTVTAQG
YGKRIATSEF RAQARGGVGV IAIKFKRGQE EDKVSCLRIV KDDEEILVIT ARGIMVRQKA
SDIPSQGRSA TGVMVQRVDD GDHISSVSIV PQYEEIDG