Gene PHATRDRAFT_47224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47224 
Symbol 
ID7202203 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp857541 
End bp859018 
Gene Length1478 bp 
Protein Length463 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181457 
Protein GI219122238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000362966 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCCATGACG AGAAATGGAA GAATCAAAAA TGCATCAGCC ACTCTGGCGA TTTTGGTTGC 
TTTTGTGATT CTACAGCTAC AGAAGGACCG AGATATGAAA TTTCGAACAT CCTCACCACC
GAATGAACAA CGCATGCACC ACAGGAATGC TACTCACTTG GCGCCCACAA CGAACAGTGT
TATCCCTAGA CGTAACGGGA CGCTATGTGA TGATCTTCAT CCTTTACAAG CTTTAGAGCG
TTACAAAGCT CAACACTCGC AAGCAATCAT GCTAAGTGAA TCCCCTGCTG ATGCCGTCCA
TCGCAGGTAT GCCATTGGCT ACTACTCCTG TCCTTTCCAA GCCGGCAACC GCTTACATCA
TTTCTTCAAT GCCATGATTT GGGCCATAGT TACCAATCGC ACTCTCTTGT GGAAGTACTA
CGATGCTAAA ACGTGTCGTC TTGTGTCACA GAGAAGAAGC CAGCCACATC ACGACAGACA
AATTTGCCTA GTCGCCAACA CCGAAGCAGA ATGCGAAGTG GTGCTTCATC GAAAGGCATC
TTGGATTCCT TCGCTGGAAG AATGGGCGCC TAAGCAGCTG GGGAGTAACA CCACTTTAAC
GTCACTTTCG TATTGGAGTA CTCATCGTCC TCCATCCGAC CCGACTCGTT CCAAGGTTCG
TTGGCGGGAT ATTGATTCGA AACACCAAGG TGTGGACCTT TGGACTGAAT TTCAGATTGT
TGACTTTCCA CAAATGTTAG GGAAGGATGC GGGAAATGTT TTAGGAGACG AAAAAGGACG
CCTCGACATG TTGGCGACCG ATTCAGCGCG GGATGCAGCG CGGAACTTGT TTGCTGAAGG
CTCACATTTC ACTTACGGTA TGGTACGATC ATTCACGATT GTGTGATTGC CTTGAACATA
TTACTGTCCA ATACTCACTG AAAGACCCCT TAAACACGAC AAAGCTTTTT CGAGAAGTAT
TTGACCTTCG ACCGAGCGTG CTTTCTTTAG ACAGCACGTC TGTGCTCGAC GCCGTCAGTT
TAAGTAATCC CTTTTCCATT GCCTTACATT CCCGACATTC CAAGCCTGAG GACAACGGTT
CTGACGTTTC TACAGAACTA AAGTGTTTGA CAAGTCTCAC TCTAAATCGT ACACAAGGAG
ATAAATGTGT CGTGTACCTC CTATCCGATC GGGTCAGAAC ACTCGAGCAA TTGACGAATC
ATGTGAACGA GAATCTGAAC TGTACAGCTG TTGTTGCAAA CCACGACGGT GGTCACCACA
TCAGGGGAGA ACACGGTCCT TTCGCGGGAG CTGACTTTTT CCGGGACCTC GACCTCGCCT
CTCGTGCACG AAACGGCTTC GCTGGTTCTA CCCGGAGCTC ATCGAGTCTA TTGCAAGAGT
GGATCGAGTA CGATCGTACG ATCGAGTCTT GCAGCCCCAG AACGAAAGGG GTCCTGCCGC
CCCTACCGAA ACTCAATACG TGTAAATTAC CAAAATAG
 
Protein sequence
MTRNGRIKNA SATLAILVAF VILQLQKDRD MKFRTSSPPN EQRMHHRNAT HLAPTTNSVI 
PRRNGTLCDD LHPLQALERY KAQHSQAIML SESPADAVHR RYAIGYYSCP FQAGNRLHHF
FNAMIWAIVT NRTLLWKYYD AKTCRLVSQR RSQPHHDRQI CLVANTEAEC EVVLHRKASW
IPSLEEWAPK QLGSNTTLTS LSYWSTHRPP SDPTRSKVRW RDIDSKHQGV DLWTEFQIVD
FPQMLGKDAG NVLGDEKGRL DMLATDSARD AARNLFAEGS HFTYGMLFRE VFDLRPSVLS
LDSTSVLDAV SLSNPFSIAL HSRHSKPEDN GSDVSTELKC LTSLTLNRTQ GDKCVVYLLS
DRVRTLEQLT NHVNENLNCT AVVANHDGGH HIRGEHGPFA GADFFRDLDL ASRARNGFAG
STRSSSSLLQ EWIEYDRTIE SCSPRTKGVL PPLPKLNTCK LPK