Gene PHATRDRAFT_33901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33901 
Symbol 
ID7197747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp626345 
End bp628135 
Gene Length1791 bp 
Protein Length596 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178265 
Protein GI219114939 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.530565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCTC TCACCACTCA TACCACCCAT ACTGGTACCT TCGCCAACGG CACCGACGAC 
GGCGACAACA ACAACATGGT ACCCACCGTG GCAGATCGTG GGACGTGCGT TGCCATTCTC
ACGGACGATC CCGACGCATC GAAGCCGTGG TATCAGCGTC TCGCATGGTG GTGCACGTCT
CTGCCGTGGG TTTGGGCCAC AACGACGGTG TTGGTCTTGG TGGTGGTGGG TGTCACGCTG
TTGACGATCC AGGTCCGTGC CGACTCGTCC ACCACTCCCG TGTATGATCC GCGTGCACAC
GGAGAGGCCT ACAACCGTAC CGCCTACTAC GTCGCACTCC GGGATGTCCT CCTGTCGTCC
TCCGACGATG CACCGGTCGT GTGGACGACT CCGGGATCGC CGATGGAACG GGCCTTGGCT
TGGATGACGT TGGACGATCC CTTGGCACCC TTGCCCTTTT TCGAATCCCA CGCCACGTCC
GACGAACAAG CAAAAGAATC CACCGCCACC ACGCTATACG CGTACGAACG CACGCGACTC
CACCAACGCT TCGCCTTGTG TGTCTTGTAC TACACATGGG CCGGTCCAAC CTGGACTTTG
GAACCGTCCC ACGGCTGGTT GCACCGACAT TCCGGAATCG AGTCCTCCGG ACGACTCGCG
GACCAGTCGA TCGATGCGAC GCACGAATGT CTCTGGTTGG GGGTAACCTG CACGAGCGAC
AACCGCACCA GTGACGATCA CCGCGTGGTC ACCGGTTTGG ACTTTGGAAC CTCGAATGCC
GCACTCAAAG CGTATGGCAC CATTCCGGAG ATTGTGGGAC GCTTGACGCA TCTCCAAAAT
CTTCTCGTCT TTGATCAGCA ACTGCAAGGA CCGTTGCCTA CGACACTCTT CCTACTCACC
AATTTGCAAG CGTTGGACGT CAACACCAAT CGCCTCACGG CCATACCGGA AGCCTTGGGG
GATCACCTCG TGCATCTCCA TACACTCCAC TTGTACGGCA ACGAATTCCG CGGGACCCTT
CCCGCTTCTT TTACCCAACT CACACTTTTG GAAAATTTGC GGTTGGACGA CAATCCGGCT
TTGGTCCAGG ACGATTTTTG GTCCACCATG CTCCCCTCCT GGCCTCTTTT GCGGACCGTC
GTCACGTCCT CCACCGGGTT GGGCGGGAGT CTACCCACGG AAATTGGGAC GCTCCGTCAA
CTGGCTACCG TCTCGAGCAA CTTTGCACCC ATTTCGGGAA CTCTCCCCAC CGAACTCGGG
CTCTGTACCG GCATGGTGCA GTTCAACGTG AATCAACCCC AGGCGATGGC CGCGACCACG
CTCGCTGGTG GTTTCCAGGG TACCTTACCA ACCGAACTGG GTCGATGGAG CAATCTCCGT
TTTTTGGCCT TGCGGGGACA CGCCAATCTG GTGTCCACGT TGCCGTGGGA ATTGGCGTCC
TGGACGAATG TGCAACTGCT CGATCTGGAC CAGACGGCGG TCCGGGGGAC TCTGCCCGCC
TACGTGAGTC GCTGGTCGCA ATTGAATCGA CTAGTATTGT CGTCGACGGA TTTGACCGGC
ACGATTCCGA GCGAATTGGG AATGTTGTCG GATACCTTGA TGTCAATGGA ACTGCAAGAC
ACGGACCTGG TCGGGACCGT GCCGGTGGGT TTGTGCAACG GGGGTGGCGG CGTCGAATTC
GTCATTTCGT GTGACGGCAG TGCGGGTCCG AACGGGAGGG ACGGGAACCA AACGACGACG
ACGGCAGCAA AGGCGTTCCT CGTGTGCGAT TGCTGTCGGT GTTTGGAATA A
 
Protein sequence
MLPLTTHTTH TGTFANGTDD GDNNNMVPTV ADRGTCVAIL TDDPDASKPW YQRLAWWCTS 
LPWVWATTTV LVLVVVGVTL LTIQVRADSS TTPVYDPRAH GEAYNRTAYY VALRDVLLSS
SDDAPVVWTT PGSPMERALA WMTLDDPLAP LPFFESHATS DEQAKESTAT TLYAYERTRL
HQRFALCVLY YTWAGPTWTL EPSHGWLHRH SGIESSGRLA DQSIDATHEC LWLGVTCTSD
NRTSDDHRVV TGLDFGTSNA ALKAYGTIPE IVGRLTHLQN LLVFDQQLQG PLPTTLFLLT
NLQALDVNTN RLTAIPEALG DHLVHLHTLH LYGNEFRGTL PASFTQLTLL ENLRLDDNPA
LVQDDFWSTM LPSWPLLRTV VTSSTGLGGS LPTEIGTLRQ LATVSSNFAP ISGTLPTELG
LCTGMVQFNV NQPQAMAATT LAGGFQGTLP TELGRWSNLR FLALRGHANL VSTLPWELAS
WTNVQLLDLD QTAVRGTLPA YVSRWSQLNR LVLSSTDLTG TIPSELGMLS DTLMSMELQD
TDLVGTVPVG LCNGGGGVEF VISCDGSAGP NGRDGNQTTT TAAKAFLVCD CCRCLE