Gene PHATRDRAFT_43361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43361 
Symbol 
ID7197403 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp245794 
End bp247649 
Gene Length1856 bp 
Protein Length464 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177893 
Protein GI219112283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAATTGCAA AGATCGTCCC ACTATAGTCA GTCCCATTCG TTTGCCATCC TACCGAGTCT 
CCCTTTCCAA TTCCCACTCA CACTGTAACG ATGGACGAAC ATTGTGTACG TGCAGAACAA
CCCCAATCAC GAGGCAATAC CGAATCCAAG TTTGACATGC GCCACAATCC GGTGTATCAG
GTAGACTGGA AGAACGAAGC TTCCGGCAAG CGCATGTCGG CGACCAAACG TCGAGTCCGC
TTTCGATTTG GTTTTTCCAA CCGCCAAGCC ATTTCCGAGG GTTGCACCGG GTCCGAATGC
CGCGGGGAAG AGCACGAAGT CTTGCTCGTT TGGAGTTTGA CTTCGGGCAA ACGTCTCGTG
CTGGCGGATG GACAACAGGT ACATTTTAGT TTCGGCAAGC GGACCGACGG CAGGTTCGAA
ACATCCTGGA CCATGTCGGG TGGACACTTA TTCAAGTTGG TCGCACACGC GGCGCCGCCC
CTCTTCGCCA CGCCCGACGG CTTTCGGCAG TTTGACTTTG GCGTGGACGG CTGCTCTTTC
TTTGACATGC CCAAGATCTT TGAATTGGGT ATGCGGAACA GCAACAGTCG TGCTCTCGTT
AAGCCCTCCT CGAATCGCTC CAGCTACGAC AATTACACGT TGCCTCACTC ACCGTCGCAG
ACTGTCACCA GTCCCCGTTC GGTTACCTTT AATGATCACG TGCAAACCAA ATTGATCCCC
TCCAAGGAAG CTCAGCGTCA CGCCGAAATG GATCTATCGT CGGCGCCAGC GTCAGCCAGC
CACTCGACTC CCCCGGACCT CATGGACAAC ACCCACACTC TTGATCGTAC CTCCCCCTCA
ACGGTAGTGG ACGAATTCGC ACCCGCGGCG ACTCACGTCA GTCCTGCCTT TCAATCCCGT
CAAATTATGG ACGCCTACGG GACGACTACG GTTGGAGTCC TCGCCCTGGC GAACGAGTCC
CATACGCACA ACATACCACC CGTTACGCCT CACTTGTACA CGCCATCCGC TCCAAACGTG
TACGCGCCAT CACCACCGGC CACTCCCGGC ACGTACCCGA ACCAGAACCA GAACCGCCAC
CAGGAATCCC TTCCCCAACC CCGGCAGTTG GCGTACCACA CCTCCCCAAT CTACAATAAC
GCAGGGTATC CGCAGCAACA GCTCACTTCA TACCAACCGA CTCCTGTGAT ACCGGAAACT
CCCCAGGCTC CTCTGCAAAT TCTCAAACCG ACCATGGAAC CCTTATCAAT GGAAGAAATG
GAAGAGCGCG AACAAACTCT CCAGTCCGAC CTCGAACGTG CCCTCAATGT CTTAGTCAAT
CTTGACGATG TTACGCAGGT CAAAACGACT CCAGAGCAAC GCAAAACGGT AGAGAAAAAG
CTCCACGTGG GACCAGCCAA ATCCAAACCG GTAGCTCCCG CTGCACCCGT ATGGCACTTG
GGCCTGCAGC CATCTCTGCA GCAGATTCAA ACCCACAAGG TCAAGAAGGA ACCGAAGAAG
GAAAGTATGC GCACCCACGC GTTTGATCCG GCCGCGGCCC ATGCCGGTAT GATGGTCCTG
TACGGGTCTA GTACCCCATC GCAACCGCCG GCGTACGCAC ACGGCCATCC GTACACGGCC
ACCCCACACC AGGCACAGTT CTGCGCGTCT CCGCAACAGG CGCAAGGCTA TTCGCCGCAG
CTACCGCAGC AACAACGGGC GTATACGGCT TATTGAGTGG GAAAACGGAT GCTGCTGGTG
GAGGTTTCTC TGTCTCTACA GTCCATTGGA ACCGTGCCAT TCATGTCGTG AACTATTGTT
TGATTCCCTT TGGTAAAGTG CATATTCTAT ATTTACGTAA TTCTGGATTT GATTTG
 
Protein sequence
MDEHCVRAEQ PQSRGNTESK FDMRHNPVYQ VDWKNEASGK RMSATKRRVR FRFGFSNRQA 
ISEGCTGSEC RGEEHEVLLV WSLTSGKRLV LADGQQVHFS FGKRTDGRFE TSWTMSGGHL
FKLVAHAAPP LFATPDGFRQ FDFGVDGCSF FDMPKIFELG MRNSNSRALV KPSSNRSSYD
NYTLPHSPSQ TVTSPRSVTF NDHVQTKLIP SKEAQRHAEM DLSSAPASAS HSTPPDLMDN
THTLDRTSPS TVVDEFAPAA THVSPAFQSR QIMDAYGTTT VGVLALANES HTHNIPPVTP
HLYTPSAPNV YAPSPPATPG TYPNQNQNRH QESLPQPRQL AYHTSPIYNN AGYPQQQLTS
YQPTPVIPET PQAPLQILKP TMEPLSMEEM EEREQTLQSD LERQNDSRAT QNGREKAPRG
TSQIQTGSSR CTRMALGPAA ISAADSNPQG QEGTEEGKYA HPRV