Gene PHATRDRAFT_38493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38493 
Symbol 
ID7203470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp238150 
End bp240015 
Gene Length1866 bp 
Protein Length621 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182644 
Protein GI219124718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAGTA GCGACGACGA AGATTTAAAT CTTGCCTTAC GGGCCTCACA AGAAACGCTT 
ATCGTCGAAA GGAATCGTCG ACAACCCGTA CTCCAGCCAG ATGATGACTC CATTAGTACG
GATGGTGAGC AAGCCCTGGT GGTTCTGCCG CCAGCTAAGC GTGCACTCAA GTCCAACCTG
CAAGATACAT CGACCTTTCC GCGGCTGTCC GCTACCGGGG ACAAGCCTCG AGATGCGATC
GTATTGCACG ACGATGAATC CGACTGCTGC GATGACGATG CCGCCAACAT GGAACTATCC
CTCGTGGCAC GACTGCAACG CAAGCACCAA ATAGAATTTG AGGAGGCACC ATCTGCTAAG
AAACCCGTTA CTTTGCCACA AGGTACTAAC GGTGCTCTTG AAGTCGCAAC GGTAGGCCAC
AGTCCGTCTC CAACAAATAC AGCACCTCCA CGAACCGAGT TAAAGCAGCA GAAACACACA
GCTCCAAAAA ACGCTCCGGT TGCTACGTCC ATCTTATTAA ATTCAGACGA TTCGTCTGAC
TCGGATGACT CTATAAAAGC TTTAAAATTA CGATTAGCTG CATCGTCGGA GCGCTCCCGT
CCACAGTGCT CACAGGATAT CGACTGTTCC GGGTCAGCGT CCCCATCTCC ACCAGCGCGC
AAATCCACTA ACGTCTCCAA GCGTAAAGAA CGTCCGGCGG TCGAAACGAA GCGCGTCCAA
GAGATCAGTA AGCGCCAGAA GGAGATTGAA AGGGAGCGTT CTCGAGAAGA GAAAGCCCGG
TGTCAACAAT TGAAGGCGGC TGAACGTCAA CGGCTCAAAG TGGTAAAGCA AACTTCGGTA
GCTCTAGAAA AAGCTCGCAA ACAGAAACAG CGTCAAGCAT CGGATCAAGC ACGCGGTAAA
TTTTGCCACA AAGAAATTGC TGTTTTGATT GAGCAAGATT GGGTCCAGCA ACCTGTCTGG
AAGGAAGCGA TCACTGACGG CTTAGTGGCA TCGGATGATC ACCCCTACAT TTGGCATGAG
TACGCCACCT TGCTTGGTTG TCCCACTGTC CAATGGATTC GCAAAGATTA CTTACTAGGC
GGTGCTACTG ACGCTTGGCA GCAACTTCGC AAAGGGAATC ACGCAGGCTA TCACCATATC
CCACTCCTAT GTGTTATTGT TGAACCTGAC ATTTTCTTGA AGCTGTTGCA TCGAGACAAC
AGCGAAGATG ACGATTATCC CGAACTTGAG AACTGGTTGA AAGGAATACA GGCTGGCTGG
AAAGGAGCTT GGAGCCATCA ACAAAAAGGC ACCCCGCGAA TTATTATTCT ATTGTACAAG
GTCAGAGAAA CGCTGGACCG CTTGTGGGTT AAATACAAGC GAGAAAGCCG TGGGCGTCGA
GTGAGCTCTT CACCGCAACC ACCAACGGCC GAAGAATTGC ATGACGCGCT AATCTGGATG
ATGATTGACT TTCAAGTAGA GTGCATTCAT TGTTCATCGT CAGAACAGAC TGTACACGAG
TTGAGCAAGA TGACTCGCCT CTTGTCGGAA AAGCCGTATC AGAAGCACGT AACGGAGCTT
GATTGTGTTC GCAAGCTCAA ACCACGAGTG GACGAGAATT CCACCCTCCA AGAACGAGCT
GAGGATTGCT GGTTTCGACA GTTACAACAA ATCCCACGAA TAAGCCTTAC GGTGGCTCGT
GAATTTACGC AGCACTATCC GACTGCTCGA TCGTTATGGA TTGCGTACCA GAATCCCGCA
CTTTCGGAAG AGCAGAAAAG GGTTCTCTGT AAAAATTGCT TCTCGCAGAA GGCCTCGCAC
GCCAAACTTT CAACTTGGAT GTACAAGACA ATGACGGGAA ATGACCCCAA TGATTTACTA
CGATAA
 
Protein sequence
MWSSDDEDLN LALRASQETL IVERNRRQPV LQPDDDSIST DGEQALVVLP PAKRALKSNL 
QDTSTFPRLS ATGDKPRDAI VLHDDESDCC DDDAANMELS LVARLQRKHQ IEFEEAPSAK
KPVTLPQGTN GALEVATVGH SPSPTNTAPP RTELKQQKHT APKNAPVATS ILLNSDDSSD
SDDSIKALKL RLAASSERSR PQCSQDIDCS GSASPSPPAR KSTNVSKRKE RPAVETKRVQ
EISKRQKEIE RERSREEKAR CQQLKAAERQ RLKVVKQTSV ALEKARKQKQ RQASDQARGK
FCHKEIAVLI EQDWVQQPVW KEAITDGLVA SDDHPYIWHE YATLLGCPTV QWIRKDYLLG
GATDAWQQLR KGNHAGYHHI PLLCVIVEPD IFLKLLHRDN SEDDDYPELE NWLKGIQAGW
KGAWSHQQKG TPRIIILLYK VRETLDRLWV KYKRESRGRR VSSSPQPPTA EELHDALIWM
MIDFQVECIH CSSSEQTVHE LSKMTRLLSE KPYQKHVTEL DCVRKLKPRV DENSTLQERA
EDCWFRQLQQ IPRISLTVAR EFTQHYPTAR SLWIAYQNPA LSEEQKRVLC KNCFSQKASH
AKLSTWMYKT MTGNDPNDLL R