Gene PHATRDRAFT_40200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40200 
Symbol 
ID7195833 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp391864 
End bp394177 
Gene Length2314 bp 
Protein Length710 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184124 
Protein GI219127817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00209699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACGA ATTTTGCTTT GTCGACGCGC TGCTTTGCTT CTTCATCCGA CAACCATGAC 
GAAGAGGAAC AACGAGACTC TCCGAAACAA AGATCCAAAC GCAGCCAAAC TAATCGGTCC
AAGAAATTCA AAATTGCTGA ATCAATCGAC CAGAGCAAAA TAGATAAGCT AGCACAAGCA
TTCGATGAAC TCGCTCGGAA GGAAGGCTTC GACTCGTCAA CAGCACGCTT TGCCGACGAT
GTGACGTTCG AGGACAAGTT TGACGACGAT TCGTTTCTGG ACGATGACGA TGATAACAAC
AAAGATAAAG TGGGAAACTT GCACCTAGAT GCATCCATGT TCAGTTTAAG TGACTTTATA
GATAAGAGTG AGGAAGATGG CGGCAATCCA ACCGATCAAG ATGACGAGGA CTACCTTGAT
TTTGGTGCAG ACATTGACAT GAGTATAGAA GCAAGGATTG CCGCTGCCAA ACGGGATATG
GATCTCGGTC GAGTCAGCGC CCCTCCCGAT ATGAGATCCT CGCGCAGGGA GGTAACTGCA
GCCGACCTTC GCAAACTTGG ATTTCGAACC GAGGCAAACC CATTCGGCAA CGACGAAACT
CCACGGAAGG AGCGCTTCCA GTTGGTAACA AACTCCATGT CGTGCTCCGC CTGTGGATCG
GACTTTCAAT GCCACAACGA AGATCGGCCC GGATATCTGC CTCCTGAAAA GTTCGCTACG
CAAACAGCAC TTGGAAAAAT AGAACAGATG CAAAAGTTGC AGGATAAAGC AGAAAAAGCG
GAATGGACAC CTGAAGATGA GATTGAATGG TTGATTCAGA CTCAGGGCAA AAAGGATCCG
AACAAAGAAA TGCAGGAGGT GCCCCAGATC GATGTTGATT CTTTGGCAGG GGAAATGGGC
CTTGACCTCG TAGAGCTTTC CAAAAAGATG GTTATTTGCA AGCGCTGTCA CGGTCTGCAA
AACTTTGGAA AAGTGCAAGA TTCCCTCCGA CCTGGGTGGA CGAAGGAGCC ACTGTTGTCG
CAGGAGAAAT TTCGTGAATT GTTAAGGCCA ATCAAGGAAA AGCCGGCAGT TATCGTTGCA
TTGGTCGATC TTTTTGATTT TTCGGGGTCT GTGCTCCCTG AGCTTGATGA AATCGCTGGT
GAAAACCCTG TAATTCTTGC GGCCAACAAG GCGGATCTTC TTCCAAGTGA AATGGGACGC
GTGCGAGCTG AGAGTTGGGT TCGACGCGAG CTCGAATACC TTGGAGTCAA GTCGTTGGCC
GGTATGAGAG GAGCAGTTCG GCTTGTCAGC TGCAAGACTG GAGCTGGGAT TAATGATTTG
CTGGAGAAAG CAAGAGGATT AGCCGAGGAA ATCGACGGCG ACATATACGT CGTCGGGGCT
GCAAATGCAG GAAAAAGTAC GCTTTTGAAT TTTGTTCTAG GTCAGGACAA GGTGAACAGA
TCACCCGGAA AAGCACGAGC AGGCAACAGG AATGCCTTCA AGGGCGCGGT GACGACAAGT
CCACTGCCAG GCACAACGCT TAAGTTCATC AAAGTCGATT TAGGCGGCGG TCGAAGTCTA
TATGACACTC CTGGTCTTCT GGTATTAGGC ACTGTGACAC AGTTACTGAC CCCCGAAGAG
CTGAAGATAG TTGTTCCCAA AAAGTATGTC AAACCGATCA AACTGATATT CGATTCACAG
TCAATAATGT TCAAACTAAC ACCTCGTTCC TCAAACAGGC CAATTGAACC TGTCACCCTC
CGGCTCTCTA CCGGAAAGTG CGTTCTAGTT GGAGGATTGG CCCGCATCGA GTTAATCGGC
GACTCAAGAC CCTTTATGTT CACATTTTTT GTTGCTAATG AGATCAAGCT CCACCCTACT
GACATAGAGA GAGCCGATGA GTTCGTTCTA AAGCACGCTG GTGGCATGTT GACTCCACCG
CTAGCACCCG GACCAAAACG TATGGAAGAG ATTGGAGAAT TTGAAGATCA CATCGTGGAT
ATCCAGGGTG CTGGCTGGAA AGAAGCTGCT GCTGATATCA GTCTTACCGG ACTAGGATGG
GTGGCCGTTA CAGGAGCAGG GACAGCGCAA GTAAAAATAA GTGTTCCGAA AGGTATTGGT
GTATCGGTGC GGCCTCCGCT TATGCCTTTC GATATCTGGA AAGTTGCATC GAAGTATACC
GGAAGTCGAG CTGTAAGAAA GTCATCCAAA CTGGCGAATG GGAAACGAAG AAAAGGTGTA
GGGCGTAATT AGTCTTGTTA GTCGTTAGAC TTTATTTTAA TTTGACTACT GTTAACAGGT
AAATTATAAC TTTTCTCTTT CAGTTGATAT CTAG
 
Protein sequence
MRTNFALSTR CFASSSDNHD EEEQRDSPKQ RSKRSQTNRS KKFKIAESID QSKIDKLAQA 
FDELARKEGF DSSTARFADD VTFEDKFDDD SFLDDDDDNN KDKVGNLHLD ASMFSLSDFI
DKSEEDGGNP TDQDDEDYLD FGADIDMSIE ARIAAAKRDM DLGRVSAPPD MRSSRREVTA
ADLRKLGFRT EANPFGNDET PRKERFQLVT NSMSCSACGS DFQCHNEDRP GYLPPEKFAT
QTALGKIEQM QKLQDKAEKA EWTPEDEIEW LIQTQGKKDP NKEMQEVPQI DVDSLAGEMG
LDLVELSKKM VICKRCHGLQ NFGKVQDSLR PGWTKEPLLS QEKFRELLRP IKEKPAVIVA
LVDLFDFSGS VLPELDEIAG ENPVILAANK ADLLPSEMGR VRAESWVRRE LEYLGVKSLA
GMRGAVRLVS CKTGAGINDL LEKARGLAEE IDGDIYVVGA ANAGKSTLLN FVLGQDKVNR
SPGKARAGNR NAFKGAVTTS PLPGTTLKFI KVDLGGGRSL YDTPGLLVLG TVTQLLTPEE
LKIVVPKKPI EPVTLRLSTG KCVLVGGLAR IELIGDSRPF MFTFFVANEI KLHPTDIERA
DEFVLKHAGG MLTPPLAPGP KRMEEIGEFE DHIVDIQGAG WKEAAADISL TGLGWVAVTG
AGTAQVKISV PKGIGVSVRP PLMPFDIWKV ASKYTGSRAV NYNFSLSVDI