Gene PHATRDRAFT_42826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42826 
Symbol 
ID7196486 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1249962 
End bp1253665 
Gene Length3704 bp 
Protein Length1192 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176748 
Protein GI219109991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTAT TCTTCTTTCT CCTCCTGGAG CTTTCTTCTG TGGCGGGCTT CACTGGATTG 
TGGTCTCGAT CTTCTGGCTG GAGAAACCAA GCTCACACTT CATATTCTTC TTGGCTGAGA
GTCGTTGCTC TGTCCCCTGA TGCCGACGGC ACTACAGACC AAGATGATTT CAGTCTCCTT
AGGAATGCTA CAAATAATAA GGCACCGGTG AGAAGCAATG AAATAGGCAA TGGACGGATA
GACTTCAATC AATTGAGTGA GATGAAAGAG TCAATCGACA TTATATCGGT CATTGAGTCA
TATAATCTAG ATCGCTTCGA ACGGAAATCC GACGATCGAG CTACAGCTCT TTGTCCCTTT
CACGATGATG AGAATCCTTC CCTTTCGATT GATCGAACGA GAAAAATGTA CAAGTGTTTT
TCATGCGGAG CCGGGGGAGA TGCTTTCCGT TTTGTTCGTG AATATAGTAA ACTAAAAGGC
CAGCAAATGT CGTTTTATGA GGCCGTTCTC GAAGTCAGCA CGAAGTTTGG CGACGGCCAT
GTCGAAGGGA TCGGAAAGTC TGTCCAAGCG GAAAGCCCCG AGCTTCGACA GCGCAAAGAA
CGCACACTGT ACGCGAACGC CGTTGCGGCA GCATTCTACG CAACTTGCCT GACAAAGCCC
TCCGCAGGAG GTGCCCGGCA GTACCTCCGG CAGCGCGGGT TGACTCCAGA AGCTGTTCGA
ACGTTCGCCT TGGGATTTGC TCCGGATTTC TATTACGGTG CCAACCGATC GCGAAAGAAA
TGGGGAGAGG ATAGTTTGGT ACAGCATATG CAAAGTCTTA ACTTCACTGT AGATGAGCTT
TTGGAAGCTG GATTAGCGAC TGAGACAAAG ACGACGGAAG AACCTGTCCA AAAGTTTTCC
TTTGGAGAAG GTGAGCTTTT ATTTTTCAAC TGTGAATGTC TACTTTTATC TTTTGCTAAC
CTTACAAAGA TCCATTACTA GAAGGCGCTT TTACTTCAAA ACCAACAAGT ACAAGTTGCC
CTTTCGATTC AATCATCGAT CGGTTTCGTT TTCGAATTGT TGTTCCGATA ATGGACAAGG
CGGGAGCGAA CGTTTTGGGA TTTGGAGGGC GCATCTTGCC TTCGATCGTA GAAATGCCCA
ACGCCTACAA TCCTCCGAAG TACTTGAACA GTCCGGAGTC ACTTGTTTTT AAGAAAAAGT
GCATTCTATT TGGTCACAGT TTGGCTAAAG AGACTGTGAA GCAACCAAAA AAAAGCGAGC
ACGAGCAACT GGCGAACACT TTGATTCTTG TCGAAGGATA TATGGATGTT ATGTCTCTTT
GGACAATCGG CGTTAGAAAC GTGGTAGCAG CCATGGGAAC CGCTGTCACT ATGGACCAAC
TGGCTATCGC AGCAAAAAGC ATTCGAAATG GAAACTTAGT GCTGTGTCTG GATAATGACA
GCGCTGGATT ATTGGCGCTC GAAAGGCTTT GTTCGAATGG ATTACTTTCG CGGATCGTAT
CGAAGTACGG GACTGAGATT AGCATTGCTT TACTTCCAAG CGGAATCAAA GACCCAGGGG
AGTTCATAGA ATCCAAAGCT GAAGCGGCCT CAACAACCAT TGCCGATGCC TTTCAAACCG
AGGTACTTTC TAAATCGCAA GATTGGATTG ACTGGTATTT GCAGCAATTG CTGGAGAGCT
ATGATTCCAA GGCCGGTAGA GGTAGGGCCG GTAGTTTCGG CGATGTTTTT GAACGTGTTG
CAAATTTTTT GGCCAACAAT ATGGGTCCAG CGGATCGAAC GAAACATGCG TGCGAAATTG
CCGTGTCTCT TGCTAAGACA ACTGCAAACG AAATGGGCTC TGATCACGCA TCAAGCGCGG
TCCAAAACCA GTTAGAGTCA GATCTTATTG ATCTCTCTTC TCGTTTAGCG GAGAAAAAGA
GAGCCATTCA GAGGCGGACT GAGTTGGTGA ATTCAGATGG TAATGTAGGA TCACAGAAAG
ATGCTCTTTT TGCGTTAACT AGAGGGAGCG GGCCAAGTGC GGATGAAAGT GATAAGCTTT
CAAGTAGTGC GTCAAAAGGT AGCGATCTGC TTTCAGTTGC TCTAAACGAC GTAGATTTGC
TAGACAAGTT TCCCGCACCC TCTTTTGAAA GAGCATCTCT AACCAGGAGA AAACGGACAC
ATGATGCGGC AATTTCCAGA ACTCTGAACA AGGCTTTGAC GCCACATTTC TCAGGTTTTC
GGTTTATGTA CAAGAGTGAC TCACAGTGGC TGGGTGTCGA TGAAGATAAG GTTTGTTCTG
CATCTTGATG AGGATTCTTG TCTACAAATT TCTTTTTCTC ATATCTTTCA TTGCAGCTAA
AAGGCGGCGA ATTGACCTTG GGCTATATCA AGAATACCTG GCGAAAGAAA GAAAAGAGTA
TTTATTTCAA TTCTAACGAT TATCATGGCC ATCAATTTCT CACTGAAGAT GCTATGGACG
CAGGTTATGT CAACCGCAAT GTTAGACGTG ACCCTTCGTT TGTCGAAAAG GGCGTCGCCT
GTTTAGTCTA CCGAGACACG GAGCTGATGC TGAAGACTGC AGAAGATAAC ATGCTGACTA
CGCTTGTGGA CTGCCCATCA GCGCGCACAG TATTGAAGAA TATGCTTGAT GCACGCTCGG
CTACTGGGTC CAGCAATCTA GTTGAGTGGA CCAATACCGA AAAAGAATGG CTCTTTTCTA
CTTTGGTATA TTCCTCTTCT TCCATCCCAG AAGGCTGCAC CAATCGAACT GAATTGATGC
GCTTTCTTAA ACGTATTCCT GACTGTCCAC CTCAGGCTTT TATCCATACC ACGGTGTCCG
CAAATCGCAA CAGCAATTCA AAAAAGGACT ATACCAAGAG TCTTCCTGAA TCTTCAAATG
AGGTGATTGG CGGGAAAGAA ATTTATGGAA GATTTGATAC CTACTCTAAA GTAGGGAACG
GGACTTTATC CCTCTTCTTC AACGGAGTTC GAAACGATCT GGATAGCGCA GCCTCTTTGA
ACGTAACCCG AGTAAACGTT CTTACACAAG AGCGCTGGGC CGCTGTACTG TGGGCATCCA
CGGCACATCA AGCAAGACAA ATTCGAGAAA AGTTGCTATC GCTCTCGAAT GCTATGGAAG
GTCGTACGGC GACTGGCATC AAAAACGTCC GTTTTGCTGT ATCACCAGAA GTTACGCTCA
GCTCAGACCA TCCAGTAGAA GGCGACGGAA ATCCTTGTGA CTCAACAAGG CTCCGTGACC
TCACCATTGC TCTGCAAAAC AAGAATCGTG TTCTTCAAAC ACTTTCTGAT TCTTCAAAGC
GCCTTTGGAC GAAGCTGGTT GATGAGACTC TGTCTGACGG TATAGAGGGA CATGTTTCGG
CATCCCTTCA AATGGACCTA TCAGTCAGGT TAGACGAATA TTTAAACGCA TTCATCGACG
TCCCTTCGCA GAAAGTGGAA ACGACACAGA AACTGGAAAC GATATTGTCG GGCTTAGAAG
ATGAAGAGCC TTACGAAGAT ACACTTGAGC GAATAGCGAA AGATTGGGGT GAATGGGCAG
ACGACGATTA TTTATGGACA ATGGATGACG CCATTAGCAA GACGAATAAA AAGCAGGTTG
CTTCCCCAGA CTTAATTTCA GCAATTACCG AAGACGAAGA TGACGAAAAT GTGGAAGATG
CACTACAAAG GATTTCTCGA GATTGGGCTG AGTGGGATGA GTGA
 
Protein sequence
MQVFFFLLLE LSSVAGFTGL WSRSSGWRNQ AHTSYSSWLR VVALSPDADG TTDQDDFSLL 
RNATNNKAPV RSNEIGNGRI DFNQLSEMKE SIDIISVIES YNLDRFERKS DDRATALCPF
HDDENPSLSI DRTRKMYKCF SCGAGGDAFR FVREYSKLKG QQMSFYEAVL EVSTKFGDGH
VEGIGKSVQA ESPELRQRKE RTLYANAVAA AFYATCLTKP SAGGARQYLR QRGLTPEAVR
TFALGFAPDF YYGANRSRKK WGEDSLVQHM QSLNFTVDEL LEAGLATETK TTEEPVQKFS
FGEDPLLEGA FTSKPTSTSC PFDSIIDRFR FRIVVPIMDK AGANVLGFGG RILPSIVEMP
NAYNPPKYLN SPESLVFKKK CILFGHSLAK ETVKQPKKSE HEQLANTLIL VEGYMDVMSL
WTIGVRNVVA AMGTAVTMDQ LAIAAKSIRN GNLVLCLDND SAGLLALERL CSNGLLSRIV
SKYGTEISIA LLPSGIKDPG EFIESKAEAA STTIADAFQT EVLSKSQDWI DWYLQQLLES
YDSKAGRGRA GSFGDVFERV ANFLANNMGP ADRTKHACEI AVSLAKTTAN EMGSDHASSA
VQNQLESDLI DLSSRLAEKK RAIQRRTELV NSDGNVGSQK DALFALTRGS GPSADESDKL
SSSASKGSDL LSVALNDVDL LDKFPAPSFE RASLTRRKRT HDAAISRTLN KALTPHFSGF
RFMYKSDSQW LGVDEDKLKG GELTLGYIKN TWRKKEKSIY FNSNDYHGHQ FLTEDAMDAG
YVNRNVRRDP SFVEKGVACL VYRDTELMLK TAEDNMLTTL VDCPSARTVL KNMLDARSAT
GSSNLVEWTN TEKEWLFSTL VYSSSSIPEG CTNRTELMRF LKRIPDCPPQ AFIHTTVSAN
RNSNSKKDYT KSLPESSNEV IGGKEIYGRF DTYSKVGNGT LSLFFNGVRN DLDSAASLNV
TRVNVLTQER WAAVLWASTA HQARQIREKL LSLSNAMEGR TATGIKNVRF AVSPEVTLSS
DHPVEGDGNP CDSTRLRDLT IALQNKNRVL QTLSDSSKRL WTKLVDETLS DGIEGHVSAS
LQMDLSVRLD EYLNAFIDVP SQKVETTQKL ETILSGLEDE EPYEDTLERI AKDWGEWADD
DYLWTMDDAI SKTNKKQVAS PDLISAITED EDDENVEDAL QRISRDWAEW DE