Gene PHATRDRAFT_38213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38213 
Symbol 
ID7203101 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp345907 
End bp348273 
Gene Length2367 bp 
Protein Length479 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182203 
Protein GI219123795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0083412 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACA TTAGCGATAC GGCATCCGAT GACCAGAAGG ACACGGAGTA TGTGAGTAAC 
AGTGGGACCA TCAAGTTCAA AGGAAATGAG GATGAATGGC GTACCTGGAA AGGGAAGACG
ATTGCGACGG CAATGCAACA TCGCTTCTAC CAAGCGTTAT TCATGCAGGA AAGCCTTGCA
ACCATGGAAG AGGTCAATGC AGGGACTGCA AGCAAGTCGA CGCGGAAAGC ATTCTTGAGA
AACGTACAGG CGTACGCTCA TTTGGCACTG TGCTGCGAAA AGACCGCATA TTCGTACGTC
GAAAATGCGG TGACAGAACG AGCGCCGATG GGAGATGCCT ACGAAGCGTG GAAGAAACTG
TGTGAAAGGT ACGAACCGAA AGAAATCGAA TCGGACTACA CAAGTATTGA ACATTCATTT
AAGGACTGTA CTATTGGCAG TATAAATGAA AATGTGGAAG AGTGGTTTCT GGAGCTGGAG
TACTGGAATA CCCGTATGGG TAAGATCAAG CAGTCCTACA TGCGCGATGA CCTTCAGATG
AAGGCACATA TAATTGACCA ACTCCCCGAG GCCTATGAGG CAGTGAAAGT CAAGTTGAGT
GGAGCATACA CGACCACCCC AATGGAGACG TTTAAGAGGA TCATTTTAGA CTTCTGGAAG
AGGCACAGTA AGAGTGACAA TAACAAAGTG ATGTCTCATT CTGAAGTGAA AGAAAAGTGT
GGTCACTGTA GGAAAGCAGG GCATACACAA GACAAGTGCT GGAGCAAACC AGAAAATGAG
CACTTGCGCC CCAACGGAAA GCAGGGCGGA AAGAACAAGA AGGACATCGA GTGTTTTAAT
TGTGGAAAGA AAGGACATTA CAAGAATGAA TGTCAAAGTG AGAAGAAAGA CGGAGAAGGC
GGACCAAGCC CAACCAATGA AGGAATGTTT GTTGGGGCCC ATGTTGAGCA AGAGCAGAAT
TTGAACGCCG GTTCAAGCTG GGAACAAGTG CTAGCTGACT CCGGAGCCAG CTGTCATGTA
TGGAATTCTG CTAACGAACT TCAGGACCAA GAGAAGATCT CAGAACAGGT CCAAGTTGGA
AAGAGCGGTA GTTTTGTGAA TGTGACAATG CAGGGTACAG TATACTTGGA AACGAAAGAC
GGAGCGAAGA TAAAGCTACT CAAGGTTAAG TATTCGCCAG ATTTCGAGAA ACGAATCTTG
AGCGTTGGAT GCCTATTGGA CAAGGGCTGT TGTGTCACGG AAATGACCGC GACAAAAGTT
GTTATCGTCT CTGCAGACAA GAAGAGGAGT ATCACGGCAG ATCGGACATC AAGCAGCAAG
TTGTACTACT TCACCTGCAC GGTGTTGAGC GGACCGAAAA AGGAATCGAC TGAGGAGGTA
TTTTCCAGTA CCGAACAGAA AGGGCATGTA TACGCAGTGG AAGAAGCAAA CAAACCCTAT
GGACATACTA TTGCCCTGCC TAAAGCGGGG AGGATCACGA ATACGGCAAA ACCCCTCAAG
GCTATGAGAA CTACGAAACA AGATGTGTTA AGCCCGACGG AAACAAGCAG TCAGTGGAAG
TTTGCCGAGG AAGAACGCTT GATGCAGGAC AAGTATAATA CCAAAGAAGA AGAAGAAGCC
AAAATGTCTG AGGAGCCGGA GCCGCACAAA GACAAGGGAT TTAACGAAAT GACCGGAAAT
CCGGACCCGC GAGAAGATAT TCTGCAGGAA TATCACGATG TGCCCATCAG CAAAAACAGA
AACGGATGGA ACGCGATGGA AAAGCCGAGT ATCCAGGACC GAAAGGGTTT AGGAGGGCTT
ATCAAAAGGA ACAGCGTACC CACCAGAGTT GGTAACCGCA AGAGTTTCAA ATGGAAACAA
GCAATTTTTG ACATCATGGT AAACCTGGGA ATTGCTTTGG GCTTTATCTA TTGGCATGCA
ATGAAGGATT TAGACAAATT CAGAAAGTTA CTTGGAGTAC ACTGCATGGC TGTAAACAAC
GAGATTGGGC AATACATTGG AGTCAAAACA ACAGAGTATG AGAAAGAATT CATCCGGGAC
CACGAACTAG AGATCTGGGA GGAAGCCAAG ATTTTTAGCA TTCCTGGATA CCACGAAGAA
TTTCACAAGG GAAATGAAGG ACAATCAGTA TCGAAGAAAC AGTCCAGTTT ACTTATTGGA
AAAATTGTAT TTTTGGTAAG CAAATGGATC ACGGACTGCA TGACTGCACT CCGCGAGCTC
TCCACGTTGC TGGACACAGT AAAGTGGAAA ATGAAATCAT CGAACAGGGA GGATGTCAAG
TTTGGCAGTC AAGTGGGACA TGTCCGGCAG AATGCGACAT GTCACACAAA GAGATTTGAA
AAAGAGCGAG CTCTCATCTC GAGCTAA
 
Protein sequence
MDDISDTASD DQKDTEYVSN SGTIKFKGNE DEWRTWKGKT IATAMQHRFY QALFMQESLA 
TMEEVNAGTA SKSTRKAFLR NVQAYAHLAL CCEKTAYSYV ENAVTERAPM GDAYEAWKKL
CESINENVEE WFLELEYWNT RMGKIKQSYM RDDLQMKAHI IDQLPEAYEA VKVKLSGAYT
TTPMETFKRI ILDFWKRHNV LSPTETSSQW KFAEEERLMQ DKYNTKEEEE AKMSEEPEPH
KDKGFNEMTG NPDPREDILQ EYHDVPISKN RNGWNAMEKP SIQDRKGLGG LIKRNSVPTR
VGNRKSFKWK QAIFDIMVNL GIALGFIYWH AMKDLDKFRK LLGVHCMAVN NEIGQYIGVK
TTEYEKEFIR DHELEIWEEA KIFSIPGYHE EFHKGNEGQS VSKKQSSLLI GKIVFLVSKW
ITDCMTALRE LSTLLDTVKW KMKSSNREDV KFGSQVGHVR QNATCHTKRF EKERALISS