Gene PHATRDRAFT_45234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45234 
Symbol 
ID7200109 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp561953 
End bp564909 
Gene Length2957 bp 
Protein Length935 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179240 
Protein GI219116891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCTC ATGAGTCTGC AATTGTAGTG ATTGACGATT CGGAGAACGA CGATGGACCT 
GATAGTCCCA CTTCCAACAC GTCACCAGAA GTGCTTTCAT TCGAGCAACA AGAGGCTATT
CACTTGTTTT CCTCCATCAC CGACATCTCC GACACTAAAG CGGTTAAGGA TAGCCTCGAA
ATGTTCGATT ACGATGTGGA GAGAACGATT AACAAATGGA TCGAACAGCA GCATTCTTAC
CAAAGATCGC CCATCGCCGA CAACTCTGCA GCTTTTCGAC TAATCAACCG AAATGCTTCA
CGAATTAACT CGCCGGTAAA ACGAAACTTG TTTACGGGTA TGCCGAAAAA GTTGGATTTA
GATACGAAGG TGTCTGTGCA AAAAAGTGCT ATAAAGAGCA TACCTTCATC GTCTTCCTCC
AAAGTTCTCT CTCTGTCATG GCACGATTGT GTGGAGCGAG TTCTCCAATC AAAGGAACCC
TTTTTTATTG ATGCTGACTT TCCGCCAACG AGCAAATCGC TGGATGGTCG CCAACGCCGA
GCCAGTGAAA ACAAGGGGCA ACGAACTTTG TGCGCATGCG GCGTTCCTGC AGCTGCCAAA
GTTGTCCAAT CAGATGGACC GAACTACGGC CGTTTTTATC TGTCCTGTGG CAAGCAAACA
CAACGGCGAG CGCCGGTATT GGTAGTTCGT AAACAAGACG ATGCACAAAA TAGTGCTGAT
TGTCAAGACG AAGCCAAACT TCTGAAAGAT CCACCAGTTA CTGTCAACAA TCCATACGCA
AAATCTAAGC CATCAACACC ATCGAAACAT TGTCGCACTA ACCCATCATC GCCAAAGTCT
CCGCCGCAAC GGCGGTCGTG CACTTTTTTC AAATGGGATC CAGACGGTTC CATCGGAGCA
TCGGGCTACG CTACAAGATA CTCGTTGTTT GTCTGGCAAC ATTTTGGATT GGAGAACAAC
TGTTGCTTGT ACCGCACGTC AATCGATCCT TCGCAAGTAC GTCAAGGCGC GGTAGGAAAC
TGCTGGTTCC TATCAGCACT GGCAGTCGTA GCGGAAAAGT CGTATTTGGT TCGCCAACTG
TTGCCACATG ACAAATTGAA TCCCCAAGGT TGTTACGAAG TCAACCTTTG CTTGGACGGC
GCTTGGACAC CAGTTCGGGT AGATTCGACT TTGCCTGTTG TACTGCAAGA TGTGAACAAA
ACGACAGGAG GATCTTTGTT ACAATCACTC CGTCATGGTG TTCCGCTAAA TAGCTGTAAA
GAATTGGTGG CTACCCCCGC CTTTTGTTCG GCACCAGACC TCCAATTGTG GCCAGCTTTG
GTCGAAAAAG CCTACGCAAA AGCTCACGGC TCTTACGCAC AGCTTTCCGG TGGTTTTATT
GCGGAAGGAT TGACTGATTT GACGGGTGCT CCAACAGAAA CTATAATCTT TTCGGATTTA
ATAGATTTAG ATGAGTTGTG GGCGCGCTTG CTATCTTTTC ACCAAGCTGG TTTTCTCGTC
GGTGTCGCCA CTTCTCGAGG GGGTGAAGGC CTTGTTGGTG GCCATGCGTA TAGCTTATTG
GATGTAATTG AGATCAACAA CTCACTGATT GGTGAACAAA AGAAAGTGAC TGATTACTTT
TCGAGCCCTT CTAAGAAGCA TCGGAAACTT ACTGGCCCAA ACTATGACAC TTGTGCACCT
ATTAGACTTG TGCGGATTCG AAACCCGTAA GTTTTCTTTG GTGATATTGG ATTGACTCGA
ACAACACCTT AACATTTCCT TTTGGCGTGC GTTTCAGTTG GGGAAAACGG GAGTGGAAAG
GTGACTGGAG CGTTGATAGT GAACGCTGGA CTCGAGCGCT GCGGAAAAAG ATTGGATCTG
ATGCGTTTGC TAGGGGAGAC GGCACATTTT TTATGTCGTT TGAAGATATG TTGCAACGAT
TTCATCACAT GGACATTGCC AAAACTCGAG AGGTTCGTGT TCGCAGTGAG TCGCCCTCGT
CCTGTCAATT CGTTGCTTTC TGACACCTTG CTGAAATCCA ATTCGCCTAG GGCTGGAAGC
ACTCGTGCTC TGATGGTATT TTCCAAAGGA ATGGCGATCC GATCGCATCT TCTAAATACA
CTTATGAGAT TATTCCGTCC TGTCGTACTT GGGCATTTGT TTCGTTGGTC CAGAAGAAGA
AACGCGCAAA CAGTAACTCT AAGTATTGGT ATTGCGACCC TTCGATGCTA ATTTTAAAGC
GGAGGTCGGA TACTGAGGAG TGGACCTGCG AAGCTTCAGT ACTTACGGGT ATTGGGAGAA
TGAGCGATTG TGAAATCTTC CTTGACCCTG ATTTCTCGTA CATGTGTGTC TTAGTATCAT
GTATCGGTTG CATGGATACA CCGGAATCCT TTGAGTTCCG GCTGTCGACT TACAGTTCGG
AAGAAGTAAC CGTGCGGCCC GTTCTGAATG AAAGAATTCT CTGCTTAATG ACCCTTCGAC
TTCTTCACAA ATTGCTGCTC AATCGAGGAC ATAAACTCCT GTACCCGGTC GCGCCATTTG
GTGTTTTAAG CTGCATCCAT GGTAGCGGCT GCCTATACTT CGTTGCGGTA AACGGTGCTT
GCGATGAATT TTTGTCTATT CGATTGACGC TTGATATTCA AGAGGGGATG ATGCTCGTTT
ACGGAAAAAG TGGGGATTCA TTTGATATTC CTCCGAGACG CCAGCAAATC TTGGCAATCG
TTTCAAGAAA TGGGAAACGT TGTACGTGCA CTCATTTAAG CTTTCGTTAT TTGAGTAGTA
CCATTAAGTC CAGCAAAGAT GGATCTCATG TTTCAAAGTT GCAGTACTCG GGCATGAGAG
GTAGCGTTGA GCTTGGTCTC GCGGCAGACT TGCTTACGAG CAGTTCCGAC TCCTCACGGG
TCTGTATTAG GGGCGGCGAT TCACTCGAGA TTTACCAGTG GATTCCGCAA GTGGGCTCCT
GTTTTGATCT AGTCTAG
 
Protein sequence
MASHESAIVV IDDSENDDGP DSPTSNTSPE VLSFEQQEAI HLFSSITDIS DTKAVKDSLE 
MFDYDVERTI NKWIEQQHSY QRSPIADNSA AFRLINRNAS RINSPVKRNL FTGMPKKLDL
DTKVSVQKSA IKSIPSSSSS KVLSLSWHDC VERVLQSKEP FFIDADFPPT SKSLDGRQRR
ASENKGQRTL CACGVPAAAK VVQSDGPNYG RFYLSCGKQT QRRAPVLVVR KQDDAQNSAD
CQDEAKLLKD PPVTVNNPYA KSKPSTPSKH CRTNPSSPKS PPQRRSCTFF KWDPDGSIGA
SGYATRYSLF VWQHFGLENN CCLYRTSIDP SQVRQGAVGN CWFLSALAVV AEKSYLVRQL
LPHDKLNPQG CYEVNLCLDG AWTPVRVDST LPVVLQDVNK TTGGSLLQSL RHGVPLNSCK
ELVATPAFCS APDLQLWPAL VEKAYAKAHG SYAQLSGGFI AEGLTDLTGA PTETIIFSDL
IDLDELWARL LSFHQAGFLV GVATSRGGEG LVGGHAYSLL DVIEINNSLI GEQKKVTDYF
SSPSKKHRKL TGPNYDTCAP IRLVRIRNPW GKREWKGDWS VDSERWTRAL RKKIGSDAFA
RGDGTFFMSF EDMLQRFHHM DIAKTREGWK HSCSDGIFQR NGDPIASSKY TYEIIPSCRT
WAFVSLVQKK KRANSNSKYW YCDPSMLILK RRSDTEEWTC EASVLTGIGR MSDCEIFLDP
DFSYMCVLVS CIGCMDTPES FEFRLSTYSS EEVTVRPVLN ERILCLMTLR LLHKLLLNRG
HKLLYPVAPF GVLSCIHGSG CLYFVAVNGA CDEFLSIRLT LDIQEGMMLV YGKSGDSFDI
PPRRQQILAI VSRNGKRCTC THLSFRYLSS TIKSSKDGSH VSKLQYSGMR GSVELGLAAD
LLTSSSDSSR VCIRGGDSLE IYQWIPQVGS CFDLV