Gene PHATRDRAFT_43144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43144 
Symbol 
ID7196751 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2148174 
End bp2149932 
Gene Length1759 bp 
Protein Length409 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177457 
Protein GI219111411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.471661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGACACCGG TTGCCATGCA ATACGGCGGA GCAGTATCTT GTCGATGCGC TTCATTGAAT 
TCTACCAACA TAGATTGAGG TCGATAGGGG GCACGCCGTT TACATTTCTC GTTGCAAGTT
TTACTTCGGT ATAAGTGATA CTGGAGTGCG GCAGCTTACT ATTAGATTGG GGAGAATTTG
AACGCCGTCA AAAACGATGG TAGTCCCACA TATCCTTGGC CCGTCAGCTT TCCCTACTGT
GCCAATGGCA CCCCCGGCGG CATGCTTGTC GGTATTGAGC TACAATATCC TGCTTCCAAA
TAGTATGGAT GGCTGGTGGA ATTACAAAAT GTATTCGCCT CCTTTGCCGG AATCGAAGCA
ACACGTTTCG TCTTGGAATT TTCGTAAGGA CTTGTTACGT GAACGAATCG CCACAGTCGG
TAGGTTCTTG TAGTTTGCGT CGGAATTCCT CTAATGCTCG CAACGCTGAC TCTTTGTTGT
GGTATAGATG CCGACATTGT ATGTTTACAG GAAGTATCGC CGGTTTCGTT CGATACCGAT
TTTGATTTTA TGCGAGAGCT AGGCTACGAT GGAAAAGAAA TGTTCAAGAA GGGTCGATTT
CGCCCCGCGA CGTTCTGGAA AACGTCGCGA TGTGAGATTG TGACCCCTCC GGTTCACAAG
GATCGTACAT TACTAACGGC CTTTCGGGTA TTGCCGCCGC CAACGGTGTC AGATCCAGCA
GAGACCCATG TGTGGTACAT TTTAAACTGC CACCTACAGG CTGGCAAGGA AGGGGGTCGA
AGAGTTCGAC AAATCCACGA AGGAGCTCGC TCGGTTCTGA CCTTGGCAAG AAAATTAAAA
CGTACGTTTG AGCCTTTTCC GTTGTGAACA TATGCTGCTC TGGTGTCTCT GATGCGATGT
AATAAATGTT TTTTTTATTT CGCAGAACCT AATCCCGAAC AATGTACAGC ATTTATAGTT
TGTGGGGACT TCAATGGAGG CCCCGAATGT GGCGCTGTAC GTTACTTGGA AGACGGGTTC
GTGGATGAGT CTTTTATCGA AGATGGAGAT AGGGTCACCT CCAAACGCAA AGACTTTCCG
TTCGAAAAAC CACTGACAGA TGTTATGGCT GCTTCTGACC GATCGCCGCC GCCAACGCTA
GTTGTAAGCG AGCTTATATC TACCATGGTC CGCAGTAATG CGTACGAAAA TCCAGAGTTT
TCGGAAGATA TGATGGAACG TCTTGTCCGT ATTTATGAAA GATTAGCGAC AAAGTCGCAA
GAATCTGGTT GTAAGATGAT GGACGAGGAA GATGTCGAAC GCTGGTTGAT CACTACTAAC
GGGCAAGTTG GCCGGGGTAG CGAATTTCGT AACGCAGCGA AAGAGATGGG ATGGACCGAG
GGATGCAGCG CAAAACGCCA AGACGGCAAA CCGCACGTTG AACTTCCCAA AAGAGGGATT
CTTTCACTGG AAGGTTTCGT AAACGTGTAT CAAGCAGAGC TTCGGCAAGG CAAATTCTGG
GGCATTGCGC ACGACATGGC CGTTTTGGGG GAGCCCCTAC CTGATGCAGG CGTGTTTCAG
TCAAGGTTCG ATCGTATGTA CTGCTCCAAA GCTCTACAGC CAACCGCCGT AATGGACTTT
TTGTGCTTGG ACCCTTGCCC AAACGAGATT GAACCGTCAG ACCATCTCCC CGTAGCAGCT
TCATTTACAC TTTTTAGCTA AGGTCGTAAG GCTACTGGAC ATAGCAGTAA ATTCGAAAAT
ATAACACTGT TTACATTCC
 
Protein sequence
MVVPHILGPS AFPTVPMAPP AACLSVLSYN ILLPNSMDGW WNYKMYSPPL PESKQHVSSW 
NFRKDLLRER IATVDADIVC LQEVSPVSFD TDFDFMRELG YDGKEMFKKG RFRPATFWKT
SRCEIVTPPV HKDRTLLTAF RVLPPPTVSD PAETHVWYIL NCHLQAGKEG GRRVRQIHEG
ARSVLTLARK LKQPNPEQCT AFIVCGDFNG GPECGAVRYL EDGFVDESFI EDGDRVTSKR
KDFPFEKPLT DVMAASDRSP PPTLVVSELI STMVRSNAYE NPEFSEDMME RLVRIYERLA
TKSQESGCKM MDEEDVERWL ITTNGQVGRG SEFRNAAKEM GWTEGCSAKR QDGKPHVELP
KRGILSLEGF VNVYQAELRQ GKFWGIAHDM AVLGEPLPDA GVFQSSQPP