Gene PHATRDRAFT_38067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38067 
Symbol 
ID7202749 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp796853 
End bp799295 
Gene Length2443 bp 
Protein Length698 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182137 
Protein GI219123654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00714206 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCTA ATGCTAAGAC AGAAGAAATA CGGCAGCATG GAGTCAAAGG CGTATACTAT 
GCTGGAAACT GGTTAGATAT TGAAGACTGC GATGGTGTCT TTCAAAATGT GGAGAAATCC
TGCTCCGGAT GCGACTTTCA GGACGCTTGT CCACGCTCTC GATCGCGTAA AGTTGCCGCA
AAAGATGATG ATACGTATGA TGTCGTAATT ATCGGTGCAG GCTGCATCGG GGCAGCGATT
GCGAGAGAGT TGTCACGTTA CAAAATTAGT GTGCTTTGGG TCGAAGCCGG CGACGATGTC
TCGCAAGGAG CGACAAAAGG TGAGTGTGTG CTTTCTACGA GTCGCTGGAT CATGCTTCGC
TCTCACATGA CAAACGCGCA TGCTATTGCG TAACTTATTT GCAGGAAACT CGGGAATTGT
TCACGCCGGA TATGACGACA AACCTGGTAG TAATCGGGCC AAGTACTGTT GGAAGGGAAA
CCAGATGTTT GCTGCGTTGG ACAAAGAGCT ACGATTTGGC TATCAAACTA ACGGATCACT
TGTTTTGGCT TTTAAGGAAG CCGACAAGAA GGTGCTGAAT AATCTTCTCA AGCGTGGAAA
GACCAACGGC GTCCAAAACT TGAGGATCGT CGAACGGGCC GAGCTACTTC GGATGGAGCC
ACACGTGCAT CCAGACGCCA TTGCGGCCTT GTATTCACCC GATGCAGGAA ATGTTATTCC
GTATGAGGTA CTTAGCAGAT TCCAGTTTTA TGCTTGTGAG TCGCTCTGTT CTTTGAGGTC
ACCTGACTTT CTTCTCCTTG TCGCACATTT CAGTACGCCG TGGCTTTAGC TGAAAACGCA
GTCGACAACG GCGTTGAGCT TCGTATTCGT CGGCAAGTAA TGGACATTCA AAATAAAGAT
AAAGGTCATA TGATGGTGAC TTTGAAATAC TGGGAACCAG AAGACTACGT CAGAGCCATC
GCACAAGCTG GTAAAGCTAC AGTCTTCAAC TTTGCCATGT ATGCCGTAGG TGCTACCACT
GTGGCTCATT TTTTTGTCAC CAAAGGCAGT GCACACAAGC AAAATGAAAA GTATCATTTA
GCGGTTCTGT GCTTCTTGTG GATTCTAAGC AAGCTGGTGC CTTTCATCTT TCCCAACGCT
GCAACATCCA AGGTTGATCG CAGTATTCCC CTGTGTCATT TAGTGGACCA GGCTAGTCCT
CCTGTTGGTA CAGGAGGCGG CCATCCTGTT TCAGTGCCGG ACATGCTGGT GGGAGGATCT
GGAGGTCCTC GTCCTATGCA AGGCAAGATT GTATCGACGG AAAAAATTAA AACAAAGTTT
GTCGTCAACT GTGCAGGCGG CGCCGCCGAC GAGATTGCTC GCTTGGTGGG TGACGATTCA
TTCAATATCA AGCCTCGTCT TGGAGACTAT ATTTTACTCA ATCGGAATCA GGTAAGCATA
CTCAAGTATC ACGGGTTCTA GTACGCAACT ATATCGGCGG GACGTGCGCG AGACTCAACC
AACGTTTTTC TCGCTTTTCA AATTCAATGT TCCAGGGCTA CCTGGCTAAG CATACTCTGT
TTCCGTGCCC AGATCCTAAG CTTGGAAAAG GTGTCTTAGT CCAGACAACG CTATGGGGAA
ACTTGATTCT TGGTCCGACG GCTCGTGATG TAGGCAACGA GGAAGCTAGG AAGATGTCTT
CAGCGGCAGT GCAGGAGTAC ATCCTTGCTA AATGCAAACA GCTCGTTCCT GGTTTTGACC
CTCGCGAAAC ATTTCATGCG TTTTGCGGAG CACGTGCGAA ATCGGATCGT GGCGACTGGA
TAATTGAGCA TTCCAAGAAC GATGCCCGCA TGATTCACGT TGCTGGAATC GATTCGCCTG
GATTGGCTGG CTCTCCAGCA AGTACGTGTA ATTGCTGCAA CTCCAAAATG CTTGGCATTG
CAGCTTGAAG CTGACACAAA ATATACTCAT TTTTACTATT TTAGTTGCTC TCGACGTGAT
TGAAATGCTG CGTAAGGCGG GCCTCACGAC AGAAACAAAT CAGAGCTTCA ATCCTAATAG
AGCGCCGATC GTCATCCCCA AAGTTGGGAT GAAAGGGCTG AAAATGGGAC CCGTCGGCAA
GTTCGACAGC GATGGTAGCA ATATGGAGCA AATGGCTGCG AATGTAGTTT GCAAGTGCGA
AAAGGTTACA GAGCTAGAAA TCGTTCGAGC GATTCGTCGT TCCCTGCCAA TTGATTCGTC
GCAAGGAATT AGGAAGAGGA CTCGGGCTGG TATGGGTCAT TGTCAGGGCG ACCCTGAAAA
CTACAACTGC GAAGCTCGTG TACGAGCTAT CATCGCGCGA GAAAACGGTG TGCCCATTGA
ACATGTGGGA GGCCGTCCAT GGCCCGCCAC GTCAACGCTC TCCCAACGCT GGATCAATGA
AAAGGAAAAA CAACATCTCG TGGACTGCAT GAATGTAGAG TAA
 
Protein sequence
MTPNAKTEEI RQHGVKGVYY AGNWLDIEDC DGVFQNVEKS CSGCDFQDAC PRSRSRKVAA 
KDDDTYDVVI IGAGCIGAAI ARELSRYKIS VLWVEAGDDV SQGATKGNSG IVHAGYDDKP
GSNRAKYCWK GNQMFAALDK ELRFGYQTNG SLVLAFKEAD KKVLNNLLKR GKTNGVQNLR
IVERAELLRM EPHVHPDAIA ALYSPDAGNV IPYEVLSRFQ FYASENAVDN GVELRIRRQV
MDIQNKDKGH MMVTLKYWEP EDYVRAIAQA GKATVFNFAM YAVGATTVAH FFVTKGSAHK
QNEKYHLAVL CFLWILSKLV PFIFPNAATS KVDRSIPLCH LVDQASPPVG TGGGHPVSVP
DMLVGGSGGP RPMQGKIVST EKIKTKFVVN CAGGAADEIA RLVGDDSFNI KPRLGDYILL
NRNQGYLAKH TLFPCPDPKL GKGVLVQTTL WGNLILGPTA RDVGNEEARK MSSAAVQEYI
LAKCKQLVPG FDPRETFHAF CGARAKSDRG DWIIEHSKND ARMIHVAGID SPGLAGSPAI
ALDVIEMLRK AGLTTETNQS FNPNRAPIVI PKVGMKGLKM GPVGKFDSDG SNMEQMAANV
VCKCEKVTEL EIVRAIRRSL PIDSSQGIRK RTRAGMGHCQ GDPENYNCEA RVRAIIAREN
GVPIEHVGGR PWPATSTLSQ RWINEKEKQH LVDCMNVE