Gene PHATRDRAFT_44938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44938 
Symbol 
ID7199839 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp693744 
End bp696568 
Gene Length2825 bp 
Protein Length757 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178828 
Protein GI219116066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGG TTAGAGCTGC CCTTTTCCAA GCTCTAGCTT TAGCTGTTCT CTTGAATGTA 
CCAACTCTGG AGAAGGTAGA CGCTTTTATC CTACCCTTTT CTGCTTCCTC ACTGGCAGCT
CGTCCAGTCG TGAGTCGCCC AATTCCCCCA ACGGCCTCGC ACTCGTCGTC AGTGGACGAC
TGCGGTAGTG GCGATGCTAC TATTTTTTCC GGCGATCCAC CGCAGCTGGC CAAAGAGATG
GACATTCGGG AACGGATTCG AGCCGAGTCC TTTCTCCGCG TGAATGGAGA AGCTGTACGT
ATGGACGACT TGATCGGAAG CCCCAACGCC GAGCACTCCC AAAGGTCCAT AGTCGTTTGC
TTGCGTTCGC TCGGGTGACC CTTATGCCAG GAGCTCATCG TTCAATGGAG TCGAAGAATA
GATGAGTTAG AGCAAAGCGG TGTCCGCTTG GTTATGGTGT CGATCGGTAA ACCCGAGAAA
GGCCGTCAAC TGATTCAGCA CTTGGAAATT CCAAGTGGCG AGGATTATTT GTTTGTCGAT
CCCGTGAACG CACTGTACGA CGCCATCAGT CTAAATCGCG GAGTGGACCG TACCTTTTTT
AACATTAATA CGCCCCTTGC CTTCGTGGAA CGCTTAACGA AAGAGGATGG AATGAAGGAT
TTGGTGAACA TTCTCGGCAA ATGGTCTAAA GGTATGTGGG TACGATGATT ACTTCGAGAA
TCTTGCCTGC TTCTCTTGCA TGCAGACTTT TTTGACTTTT AAATTCCATT ATTACATTCG
CAGCGTTCTT CATTCCCCCC AAAAATGAGC AGGCCTTTCT GCAGGGAGGA ACCTTTGTTT
TCGACGGCGC CAAGACTTTA TTCGCGCATT ACGATCCTTC GACTGCTTCC CACGCTTCTG
TGGACAACGT TCTGGAGATT GCTTATAGAA AACAGAGTCT AGCGCCAGCA AAGTAGATTC
CGACAAGACC GGCTGGAACA AGTGGCTGTT ATAGTTTCAA TATGTGTAGT GAATGACGCT
TTTGTGGTAC AGTAAACCTC GATCTTACTG TCGACAGAGG CCGATCAAAA TAGAAACTAA
GTATAAGAAC TTTACAGCTA TAGTTCGAAT GAAATGGGAT CCTTTACGTT AGGGTACGCC
TTTCTTTGAC TGTGTGAAAA TATAGTCTTT GTTTGAACTC CGTTTGCGAT GTTGAAAATA
GGGTAGGGCA CGTTGCTTTT ACAGTCCCAC CGTCAGTGAC GCTCATTTTT TCGACCCGCG
CGCACGAAGT CGTAAAGCCT CAGCCTTCGA TGTGAATCCG CACCGCGCAT CGCCAATGGC
ATTTCTCTTC CGTCAATCAG TGCTAGACAC TTTGCTTCCA ATTGCACGGT GTACGCGGGT
ATCTGTGGAC ACTGATATAG TAATGGGTTT GCCTATGCTT GCACAGCTGG TTGAACGAGC
CGTCAAGCAT CCAACGGTGA GCCGCGCGGC TCTTTTTTGG CCAATGTCGC ATCGTTTTTT
GTCGTCCCAA CGACCGAAAA CAACTTCGAC GTCACTCTTT GACAGCGCGT CCGATTTTAT
GCATTCGCTA CGGGTCAAAG CCACTAACGC GCTCACTGCT ACACTTTCGG TACAAGAACG
AGAGCAGCTT TTCTCGCGAC TTACTCCGCC TCAACAGGAA AAACAAGACG GCAAAACAAA
TGATGACAGG ATGGACATGG AGCATAGCAT TGCGGAAGCC GTTGCTGCAG CTCGTGCACA
AGAAGCCAAG AAACAAGAAG ACAAATGGTC CAAGACGAAA GAAGCAATTG AGAAAGAAGC
TGAAAAAGCT GCTAGAGAGC GAGTGGAAAA TGAGTTTAAA ATTCAACGCC GAAGAATAGA
GTTTGAACGC TGGCAGACAC AGGTCGAAGA AGAAAAACAA CGAAGGAGCA ACAAGTCGAA
AGTAGAAGCA GCTCCTGGTC CGGTTACACG AGGTGAAGTC GTGGAACAAA GCGTGGAAAG
CGAAATAAAC GTTCACCCGG TTCTGGGCGC AGCGATTGCC GATTTTGGAC ACAAGCGAAT
TCATGTGGTA TCCGCTCATG CTTTATCGAC GATTCCTGTC TGGAAAAAGC AACGAATTTA
TCGACACGAT CGCGCTAAGG CTATGGCGAG TGACAAAATG AAAACGCTGC ATTTAGGGAT
GCCCGGAATA ATAGGGTTGC ACGAGGTGAG GGAAACAAAC CCAAATAAGC TTCTGTACGG
GAAACTGGCT TAAATTCTGT GCATTTTCGC TTTAGGACCT GAATGGGAAA TTATCGATTA
TTGATGGGCA ACACCGTGTG GGAATGATGA CTATCCTTCA TGAAAAATGT GCATCGCATG
ATGACTTTGA CTTGGATCGA GTTCTAGTGG AAGTTTATCC ACAGAACCCT GATCATGTGG
ACACACATGC TCAGGACCTT TTTCTCGAAG TGAACAAAGC CGAACCAGTC AAGTTGGTTG
ACATGCCTGG GGTCGCCAAA GGATCGGATC GTAAAATAAT TTCAGAAGGA GCTGAGCGGA
TTGCCGAGAA GTATGCGGAG ATGTTCAAAT CCAGTCAAAA ATGTCGCCCC CCTCATTTGA
ACATTGATAA TCTTCGCGAC GCACTGTTTG CGTCGAATGC AATAAAAAGG CACAACCTCA
AAACCTCGAA GGCTGTAGAA GCTTGGATGC TCGCGAAAAA CCAATCTTTG GCTGATCTCT
ACAAGGACCC AGCTGAACAA GAGAAGGTTT CTAAGACGGC TTACGAGAAA GCAAAAAAAT
TTGAATTCTA CTTGGGACTT GACCTTAGTT GGCTGTACAA ATAAAGCATG GCGTGTCAAA
ATTCC
 
Protein sequence
MKRVRAALFQ ALALAVLLNV PTLEKVDAFI LPFSASSLAA RPVVSRPIPP TASHSSSVDD 
CGSGDATIFS GDPPQLAKEM DIRERIRAES FLRVNGEAEL IVQWSRRIDE LEQSGVRLVM
VSIGKPEKGR QLIQHLEIPS GEDYLFVDPV NALYDAISLN RGVDRTFFNI NTPLAFVERL
TKEDGMKDLV NILGKWSKAF FIPPKNEQAF LQGGTFVFDG AKTLFAHYDP STASHASVDN
VLEIAYRKQS LAPANPTVSD AHFFDPRARS RKASAFDVNP HRASPMAFLF RQSVLDTLLP
IARCTRVSVD TDIVMGLPML AQLVERAVKH PTVSRAALFW PMSHRFLSSQ RPKTTSTSLF
DSASDFMHSL RVKATNALTA TLSVQEREQL FSRLTPPQQE KQDGKTNDDR MDMEHSIAEA
VAAARAQEAK KQEDKWSKTK EAIEKEAEKA ARERVENEFK IQRRRIEFER WQTQVEEEKQ
RRSNKSKVEA APGPVTRGEV VEQSVESEIN VHPVLGAAIA DFGHKRIHVV SAHALSTIPV
WKKQRIYRHD RAKAMASDKM KTLHLGMPGI IGLHEDLNGK LSIIDGQHRV GMMTILHEKC
ASHDDFDLDR VLVEVYPQNP DHVDTHAQDL FLEVNKAEPV KLVDMPGVAK GSDRKIISEG
AERIAEKYAE MFKSSQKCRP PHLNIDNLRD ALFASNAIKR HNLKTSKAVE AWMLAKNQSL
ADLYKDPAEQ EKVSKTAYEK AKKFEFYLGL DLSWLYK