Gene PHATRDRAFT_49130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49130 
Symbol 
ID7195508 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp2167 
End bp3957 
Gene Length1791 bp 
Protein Length472 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183790 
Protein GI219127121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCCGGTAA ATTATGAGTA TGTTTCTTGG AGAAAACGAA GGAAACCCAT GTATGCTACA 
GAAGGGCGCG ATGTCGACCA ATTCGAATTT TCTCAGCTCC TCTTTTCATG TCCGGTTCCT
TCACGCTTTA TCGAACAGAA TAACTTCAAC GTGATTAATC CACTGTTTTG GATCGACTTG
GTACCAATAC GAGCGCCTGC CCGACGCGGT CCCATGCTAA TGACCAAAGA TCAGGTAGGC
CCGCAAGAAT TTCAGCACCT GGATCGTTTC AGTATCTTGG AACACTACGG AAACGCGCAC
ATTCTCCCTG CCATCGAAGA TTCGGGTAGA ATCGTTAACC TTCCTGTTTG TCCGACAACA
CCGAAGCCAC AGCATCATCC TTCATACCAA TTGGTGGCGT GCACTTGGAC ATCTGCTTCT
TATAATCGAC GGGGGGACAC AACGACTGTG GAAGACTCTG CTGCCCGCCT GGAAGAATGG
ATTGTCTTTC ATCGGACTGT CGGATTTGAT CACATTTATA TTTACGACAA TACACAGGTT
CCACAGAACT CTTCGGAATC CGTTCTATTC AAGATAGCAT CACAGTTTCC GAGCTTTGTC
ACATATCATT CATGGCCGGC AAAATCGTGC AGCAACAATC GACCAAATCA CAAAAATCCT
GGCGAGCGTT CCTCCCAATA TGCAGCCGAG GCTTCATGCC GTGAACGTTA CGGTCCGACT
GCATCCTGGA TGGCATTTAT TGATACAGAT GAGTATTTGG CACCGATGGG GAACAAAACT
TGGTTACCTC TGCTGGATAA AATGGACGCA AAGGACATCA AAGTTCTAAA ACTGAAGAGC
AGTCGCGGCC GTCCCCGAGA AAGTTTGATG CAACTCTTGG ATGATCCTAA CGAATGCAAT
AGCCAATCAC GGCTGAGCTC TTTTTCAAAG AACGAGTGCT TGATCCCGAG GAAGAATGAA
ACTTTTCTAC GGGTATACAA TTGTGACTTC ATCCGGCCTC CACGCCCCGT TCGCTTTGCC
CGCGCGATGA AGCAAATTTA CAAGCCCAAC TTTGTTCTTA GCCACTTTGT ACATTACTCT
ACGATTACAG CAAGCATGTC ACGATACTAC AAGGATTTTA AGGCTAGAGA CCTATACACA
CGTGAGCTCA ATGAAGGGGA CTGGGGTGAT ATTTTTCTTG ACGAGCGAAC GGAGGGAACA
CTTATCCACG CAAAGTCGGT GCTTCCACAC GAGACAATGG CGCGAAAGGA CTCATGTCAG
ATTGCGTCAA AGCGACCTTG TGTGATTGGT CATGTCTGTC CCAATACGAC GCCTTTTGTA
GACGCTATTC ATCAAAAGAA CGTATTCCGA GATGCTGATG GAAATTTTTG CAACTGCTGG
TAAGTGTTGT CGAGTCCAGA TGCTTGCCGC TAGGGCAGTG TTTAACTTTT ACCTTCTCGT
CGTCGCTAGG ATCAACGAGC ATGTGGAGAA AACTTTGATT CCAAATCTTG AAAAGGCTCT
CCGGGAGCAC AAAAGAAATT CGTTCATGGC CGACTAACAC TAAGGCTTAT TGCTGTAGTG
GATGCAGCAA CTCAAGGACA CAACGTGATT CCGTTGGAGA ATGTGCCGGC TAGTCTCACG
TGCCGAAGCG CGAGATATGT AAAGAGACTA TTGACTTTCC TGTCGGTAAG AAGTTGCATT
CATGCAAGCG GACCGAAAGA TACGGACCTC GAATGTGAAC GAAAGGTTTG TCACTGTCGT
TTCGAGCGTC TCTCTACTAG ATAGAGAATT GGTAGGAAAA AGCTAGTGTC T
 
Protein sequence
MYATEGRDVD QFEFSQLLFS CPVPSRFIEQ NNFNVINPLF WIDLVPIRAP ARRGPMLMTK 
DQVGPQEFQH LDRFSILEHY GNAHILPAIE DSGRIVNLPV CPTTPKPQHH PSYQLVACTW
TSASYNRRGD TTTVEDSAAR LEEWIVFHRT VGFDHIYIYD NTQVPQNSSE SVLFKIASQF
PSFVTYHSWP AKSCSNNRPN HKNPGERSSQ YAAEASCRER YGPTASWMAF IDTDEYLAPM
GNKTWLPLLD KMDAKDIKVL KLKSSRGRPR ESLMQLLDDP NECNSQSRLS SFSKNECLIP
RKNETFLRVY NCDFIRPPRP VRFARAMKQI YKPNFVLSHF VHYSTITASM SRYYKDFKAR
DLYTRELNEG DWGDIFLDER TEGTLIHAKS VLPHETMARK DSCQIASKRP CVIGHVCPNT
TPFVDAIHQK NVFRDADGNF CNCWINEHVE KTLIPNLEKA LREHKRNSFM AD