Gene PHATRDRAFT_34138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34138 
Symbol 
ID7197648 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1188144 
End bp1190307 
Gene Length2164 bp 
Protein Length647 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178381 
Protein GI219115171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGAC GGTACAAGAT ACGAATGTGG AACCCTGAAT CTTTCATCGT CACGCGGGTC 
AGAATAGACT CGTGAAAAGT TTCGGTACGT CACATTTCAC TGGGTTTGGT CTGTGCGTGT
GTGTGCGTGC GTTGTGCGAC ACGTCCAGTC CACTGACAGT GTCGGACTCT CACATACATC
AAGTGCAGCC ACCGAAAAGA CAATTCCGGA AACGCCGATC GGAACTCGCC GTGATTGTCC
CGTGTCGTCT GGGGACTAAC AACCGAAACG TTGATGGATG GATGGATGGA TCACAGGAAA
AAACTCCCTC CTCGTCGTTG ACCTTGCTTG ACTACAGAGA TTCCAACTCG GTGTTCCGTG
TAGCAGTAAT GTCTACTCGT CCCCGAGTCC TTGTCTGCAG CAACTTTTGC TTCAGCGTTT
GTCTTTGGTA CGGCATTGGC ACTTGCTGGG GTCGTTTCCT CACCGAAGCA TCGGCATCAG
CATTTCTCCC TTCTCGAACG GAATGGGGGC ACACTTCCAT TCCCACCAAC GCCCCAACCC
GTACTGGCAC ACGAAACCAC AACTCCGTAC ATTCGAGACG GCGGCGTTTC CTGCATTCCG
TCACACCTTT TCAAAACCAC CTTGGTCGTT GGCCTTCCCA CAACGATACC GTGACTGCCC
GTACCGGTAG CCAATCGCTG CGTGCCGCAC GTACCAATAC ACATATCAAC AACACTGTGA
AATACAAAAC CTTCGCTTCC AACGTCATCA ATATCAGCAA CAGTACTAAC AGGAATCTAG
AAATTCTAGT CACTCCCGTC CCATCCTTGG ACAATGGAAA ACTACACACC CGTACTACCG
GATACAACTC CCGACAAAAA GGCCGTCGCC GAAAGGAAAA CCAGCACCGT TGGCTCAACT
GGCTGTACCA TCAGTGGTCA ACTACACCGG TAGGACAACT CGAACACGCG GTGCTCAAAC
AAATGGGACC GGTCATGGCA CACTACGCGA AACGAAAATC GCAACGATCT GCCGACCGCG
CTCACGCCGT CCTGCAGCGC TACATTCAGG AGCATCAGGC GGGCAACGTC CACGCTCCGC
TCCATACGGC ACTATTCAAA CCGCCATGGA TGCCTACGCA CAACTCGGGC AACCGGAACC
AGCACAACAG ATTCTCAAGC AAATGTTGCT GTTGGCGCGA CAATCAACGG CCACGGGAGC
CGTCACGACG CGACACCGTG CTCTTCAACC CGATGCCGTG TCTTTGGCGA CCATCGCCAC
GGCATGGGCC AAATCACACC GCGTCGAAGC CGTACCCAAA ACCCTCGCCT TGTTGACGTA
CATGGAACAG CAGAATCTTG CCACCTCCAA TCACGCTTAC AACATTGCTC TGGCAGCGAT
TGTCGCCAGT CCGGAACACA ACAGTAGTAA CCGTGGGAAC TACCACAAGG CCGCTCCAGC
ACAAGCCATC CTCGAACGCA TGCAACAGCG TGCGGCTCAA GGTTATTCAA ATTGTGCTCC
CGACCTGTAC ACTTACCAAA CGTACATGAC TGCCTTGAGT CATACCCGAC AACGCGACAC
CCCCGATATC GTCATGGCCG TCTTGGAATT CTTGACCACG CAATCCGACC CCACGTCCCC
CGCGACAACA CGAGCCCACT TGGCACCGAA CGCACACTGC TACACCGCCG CGATCCACGC
GTGGGCCTAC AGCAAGGCAC CTCACAAGGC CCGTAACGCC TACGATTTGT TGCAAACGAT
GCGTCGTCGA CACGAAGTCG ATGGTCGTCT GGATTGTCGG CCGAATGTGG TCGTCTTTAC
TGCCGTGCTC AATGCTTGCG TCAAGCCCCT GCCAGACGAT CGAGAAACGG CATTTGGCTT
GGCGCAACTC GTTTACGAAG AATTGCTGCT GTCGGGATAC GGCCCACCGA ACTTTCTCAC
CTACGCTACT TTTTTGCACG CCATACAGAC GTGTTTGGAC GACGACGATC CGCGTCGGGA
CGTTGCCGTA CGCCGTCTCT TTGCGGATTG TCGCGACTTT GGACACGTTG GGCACATCGT
CTTGGATAGA TTGCGGATCG TCGCTTCCAC CTCGTTGTAC GAGGAATTGT TGGAACGTCA
CGAACGTCAC ATCATACCCT GGTCCTGGTC ACGGAATGTC AAGGGGGAAA GAATCCGAAA
GTGA
 
Protein sequence
MDGRPLTVSD SHIHQVQPPK RQFRKRRSEL AVIVPCRLGT NNRNVDGWMD GSQEKTPSSS 
LTLLDYRDSN SVFRVAVMST RPRVLVCSNF CFSVCLWYGI GTCWGRFLTE ASASAFLPSR
TEWGHTSIPT NAPTRTGTRN HNSVHSRRRR FLHSVTPFQN HLGRWPSHND TVTARTGSQS
LRAARTNTHI NNTVKYKTFA SNVINISNST NRNLEILVTP VPSLDNGKLH TRTTGYNSRQ
KGRRRKENQH RWLNWLYHQW STTPVGQLEH AVLKQMGPVM AHYAKRKSQR SADRAHAVLQ
RYIQEHQAGN ILKQMLLLAR QSTATGAVTT RHRALQPDAV SLATIATAWA KSHRVEAVPK
TLALLTYMEQ QNLATSNHAY NIALAAIVAS PEHNSSNRGN YHKAAPAQAI LERMQQRAAQ
GYSNCAPDLY TYQTYMTALS HTRQRDTPDI VMAVLEFLTT QSDPTSPATT RAHLAPNAHC
YTAAIHAWAY SKAPHKARNA YDLLQTMRRR HEVDGRLDCR PNVVVFTAVL NACVKPLPDD
RETAFGLAQL VYEELLLSGY GPPNFLTYAT FLHAIQTCLD DDDPRRDVAV RRLFADCRDF
GHVGHIVLDR LRIVASTSLY EELLERHERH IIPWSWSRNV KGERIRK