Gene PHATRDRAFT_42576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42576 
Symbol 
ID7195957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp481276 
End bp482687 
Gene Length1412 bp 
Protein Length461 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176591 
Protein GI219109674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.536498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCACAGAC TCGAAGGAGA GCGTCAATGG ACGAAATGCC CAACTTCACT CGTTTCACGC 
CGGATCAGCA ATGCTCCCCA CACATCGCAA GGAGTAACGG GCGCGAGCTC TCTGAGGATG
CTGTTTTCCA ACGCATTCCT CGTACGCTTA CCCCTCCGCA AACACCTCGA TCGTTCGCCA
TCGAAGACGA GTTTTCTTTG ACAACCAAGC TGCGTCCCTG GCGTAAACCG TTGGATAAGC
CGAAGCGCCC ACTGAGCGCG TACAATCTAT TTTTCCAACT AGCGAGACAG CGACTTATTT
CGGATACACC GAGCAACCTT CCCTTTACGG CAAAGGATGT TGAACTTATC AGTATGAAGC
ACAAACAAAA AAAGGAAAAA CGTCGTCATC GTAAGACACA TGGTAAAATT AGCTTTGCCG
ATCTTGCGCG GTCGATTGCC TCGCAATGGA AAGAACTGAG CGACGACGAC AAAGTGATCT
TCGAAGAACG CGCAGAAATG GAAAAATGTC GTTACAAGCA AGAGTTAAGC GAGTGGAGCG
CGAAGCAAGA GCCAAGCGCG GAACGAAAGG CGGCCATGTT GCGCAAGGTG TCTCTCAAAC
AAGGCTCAAG CTTTTCAATG GCTACGACTG CGGAACAACT ATCGAGCACC AGCAGAGCTC
CTGATCACGG AAACCCCATC CGCTCTACCA ATTCTTTTGG AATTTCCAGT CCACCACAAC
GGCCTCCGTA CGATTTGGAC ATGTACACAG CAAGCCACGA GGCAGCGATA CAGGAAGAAG
CATCGCTTAA CGCACTCATC GCACACCAAG GTCAATCGCT GGCTCGCTAT CAAGCCATGA
TGGAGCAGCA AATGGACCAA CATGCTTCCA TGCACCCGAT GTCTTCTATG CGGGGACTTC
CCTCTAGCAA TTACAACGAT CGTCTCCATC GCAATTCATA TAGACCAGCC CATCCTAATC
CGATGAACGG CACATTCGGG AGCGGAGGCA AAACTAACCT TCACTATAAT GATGGGACGC
CTGATTGCTA CGACATGGCT CAGCAATGCT ACAGTATGGC TCACCGACAG CAGCAGCAAA
ACTGCAGGCA CTATTCTAAA CAACAACGAC ATGGACCGCC GCCCAGTGAT ATGCACTCCG
GGATACCGAG TAACACAACC GAAGTCATGC TAGCGCGTGC GAGAACTACA ATGCAACCAC
ACATGGCACC GTCGAGCTCT GACAGCTACT GGACCATGGA CCGGTCGACC GAACACCGTC
GACTCATACC ACCAAACTCT GCAGGCTATC GGCATTCGCG CTCGGCGCAG GATGAGAGCA
CGCTAATGTT GGACCCGCAT CAAGGTCTTT CGTCGCCGCT ATTGGGGACA GCGGACGAAC
ACCTGGATCC GTTTGCGAAC GTGTACATCT GA
 
Protein sequence
MDEMPNFTRF TPDQQCSPHI ARSNGRELSE DAVFQRIPRT LTPPQTPRSF AIEDEFSLTT 
KLRPWRKPLD KPKRPLSAYN LFFQLARQRL ISDTPSNLPF TAKDVELISM KHKQKKEKRR
HRKTHGKISF ADLARSIASQ WKELSDDDKV IFEERAEMEK CRYKQELSEW SAKQEPSAER
KAAMLRKVSL KQGSSFSMAT TAEQLSSTSR APDHGNPIRS TNSFGISSPP QRPPYDLDMY
TASHEAAIQE EASLNALIAH QGQSLARYQA MMEQQMDQHA SMHPMSSMRG LPSSNYNDRL
HRNSYRPAHP NPMNGTFGSG GKTNLHYNDG TPDCYDMAQQ CYSMAHRQQQ QNCRHYSKQQ
RHGPPPSDMH SGIPSNTTEV MLARARTTMQ PHMAPSSSDS YWTMDRSTEH RRLIPPNSAG
YRHSRSAQDE STLMLDPHQG LSSPLLGTAD EHLDPFANVY I