Gene PHATRDRAFT_46559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46559 
Symbol 
ID7201699 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp702068 
End bp705505 
Gene Length3438 bp 
Protein Length1009 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181058 
Protein GI219120648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTG CCTACATTGG ATGTCTTCTG TTGGCGCTCT TTGCGGATGA AGGCGAAGGC 
GCCTCTGTCC GTGGATCCAA AGACGACGTT GTCGCCGTCG AAGCACCCCC TCGAGTCCGT
CACTCTTCCA GCACCGCGCT TGTATCGGTT CCTCAACGTC GACTCGCGGA AGATGCTTTT
GAACCACTCA CTTGCAATTC GAGCTTTTCC AGTTGCATTC CATGGACATC GCGTTGGGGG
CGGAGTGCCG TCAAGACCAC CCTCATCGTC ATCCCATGTG GCCAATGTAT TGTGATGAAT
CTCGACAGTC CCAAGCTCAC CCTGCAAGAA GGTATCGATA TCCGGGGTAA GCTCGTATTT
CCCGACCGTT ATGAGCTTAC TGTGGAGACA CCAGAGGTTG TGGTTCAAGG CGAGCTCGAG
ATGAGAAGCT CCAAGCTTGT CGACGGGAGC CCCGCAATTA AGTTTGTCTT GTATGGTAAC
GGTGATCGAC ATTTTGATCC AGTCAACAAC AACAGGAACG CTTGCGGCGG GTCCAGCTGC
AACGGAGGGG CTCGCCCCAT TACAGTAGCT GGTGGAAAAG TCAACCGTAA GTTTATGTCA
ATGAAAAACG AATCACGCTG GTTTCTGCAC ATCGATTCGT GACGATCGGC ACTCATAAAA
ATCTCTCTAC GCTATTTCCA GTTAATGGCC TTCCTACAAA CACTCCAACC TGGCTACACG
TTTACGACGT TATCGGAAGC TCGGCCATTG TCGTTTCCAA TTCGGTCCGT AATAAATGGG
GTGCCGGCGC AACTATCGTC ATCACCTCTG AGCACCAAGG TTACTTTGGC GAGCAAGTTC
GTAAAATCAC CAGCATTTCG AACGTCGGCT CCAATAGCGT CCGTCTCAAT CTCGACCAAC
CCATCAACCG TCCGGTCACG CTTCGCGACA GCCCAGACTT TGCCACCGAG GTAGCCTTGC
TGTCACGCAA CATTGTCTTT GAAGGGGCCC CCGGAACGAA GGGTGGCCAC TTTTGGATCA
TGCACACTCC CCGAGTGAGG CAGCGTATCG AAGGAGTGGA ACTCGTTAAC TTCGGACAAG
AAGGCCTCTT GGGTAGATAC CCAATCCACT TTCACATGTG CGGGGATGTC TCAGGCTCGG
TAGTAGTCAA GAATACCATT CGCAATTCCA ACCAGCGCTG CGTTGTCGTC CACGGTACCA
ACAACCTCCT TGTTCAAGAA AATGTAGCCT ATTTCACCAA AGGACACTGT TACATGTTAG
AAGATGGCAT CGAGACGGGC AATCAGTTTA TACGAAACAT TGGCATACGC ACTATTAAAG
CAAAGGTCAC TATTCCGAAC ATGGGTAGCA ACGGCAGGGA ATCTGATAGG TCTGCCAGTA
CCTTCTGGAT CACGAACGCT GACAACTCGT GGATCGGAAA TGTAGTAGCC GGATCCGAAG
CCCTGGGCTT TTGGTTTGAG CTGTTGGTCC GTGGAAACCT GGCCAACGAG CACCAAGACT
TTGATCCTAT GATGGTTCCG ACCCGCAAGT TCGAAGACAA CGTCGTTCAT AGCGTATTTG
GGGTAGGGAT GACCTACTAC TTGAGCGGTT ACATCCCGGA AACACTGCAG TACTTCAAAA
ACAACAAGTT CTTCCGCAAC CATCACCTTG CGCTCCGTAT CCACCGGACA CGAAACATCG
TTCTGACTGG CAATAAGTTC TCGGACAACA GATATGCCAT TCAAATCGAT CGTGACGAAG
AAATTCATGT CACCGACACA ACCATTGTTG GCTATTCCGA TCTATTCAAG GACGTGGTCA
GAAGGAATCG CTTTGCCCAA GCACCTTGCG CCCAAGGGAT ATCTTTCCAA TCGACTAATC
CATGGAAAGA CAAGATGGAT TCGGAGCTCA ACGGTGTCAT TTTGGACAAA GTCAGATTTT
CTGGATTTTC CAACGCGGTG TGTTCGTCTT CAACCGCAAT CGAGCTGGAT TCCCGACTCG
ACGGGTACAA ATCGTTCGAG ATGTTCTCAC AGTACTCCGG CGCCACCGTA AGTGACGCGA
ACTCGATTGA CTTTTGTCGT GGAAAGTCTG CCGGTGCACG TGATGTGTAC GTGTCCGATA
CCACAGGTTC TCTTCTCGAC GGCGTTGCCT CTGCTCCTTC TACTCTGATG GTCAACTCGC
CGGAGCTGGC AAGCTTTGTC AATCCCAGTG CATGTACCGA AAACGCAGCT CGATGCTACA
CCTACTGTAG CAACACTTGC TTCCGCACCG TTCACTACTA CGTCCCAGTG GGCCAAAGTC
GAGACTACAA GCTCAAAGTA TGCGACCGCA AGGACGCGTC CGACTGTACC GTACTCTCCG
GTTACGTACA CTTTAACCAC CCCTGGCCTC GCCGATTTGC CGTGCATGTC CCGTCAGGAC
GGGAGTACGA CACTTACTTC CTCGACAAGG GAGTGCCTGT GTACCCAACG AATGTCGAGA
TTGTGTTCCA GGAAAAGCTT TGCCCAACGG CACCGGATGA TGATGATATC GCTCTTCTCT
ACAAGGCTCG AGGTACCACC TTTCCCCCCA CGCCTAGTCC GACGACAGCT CTCGCCGCAT
GTGGAAACTT GATCGCAAAC TCGGACTTTG AGCGTGGTTA CAACGGATAT TGGGATGCAC
AAGGAGCCGG TACATTGTCT ACCACAGCTG GCTATAAATC TGCCACAGCT ATGTACTACG
CCTCTGGTAA TCGCAACCGG TACTGGGTGG GACCATCACA CCAATGGCGA GAAGGTCTGG
ACTTGAAATG TCTAAAGCAG GGTACCACAT GGGAGTTTTC TGCTCGCTTG AAGCTTGTCG
ACTCAACGAC TGGAAGGGGA GCCTCATGTA ACACAGGCTC GTCCTCGGAA GGCGAAATGT
GCCCTCAGGT GCAGCTCATT GTGCGCGACC AATCTTGGAC TCAGCATTCT TTCCGGATCA
GCGGCTTCGA CGGTGGGGAC ACTTGGGTAG CCAATGGATT TAACGAGTTC AAAGGCTATT
GGACAATCCC AGCGAATGGT TCAGGATGGC GAGGGGGTGT CGCAAACATG CGAGTGATAC
TTTCCGAATT CCCACTTGGT ATGGATTTGG TTGTTGACGA TTTTGAGTTG ACCCAATTTG
TTTAAAGCGA CGTACACGTT ACCGTTGGAT CGATTTTGCT GGAAGGGACC ATCCGTGCAT
GCCAATGTTG GTTTCTTGGA ATTTTAATTT GAGGTACAGT TAAATCTGGC GATCGATGAT
TTCGGTTGTG GCCTGAGCAA GACAACATTT TCGGATTCCG CCCACAGCCG TACCGTGGTA
GAACTACTTG CTTGAAGCTC ACTTGCGTGC AATAGCATTA ATCCAAGTAT AGCAAATTCC
TTATTTGTAA GTTGGTTCAA GATAATGCTC TACCAAAAAA AGTGCAGGGG CTCCTTCCGG
ATCGTCAGTC TCCTCATG
 
Protein sequence
MKVAYIGCLL LALFADEGEG ASVRGSKDDV VAVEAPPRVR HSSSTALVSV PQRRLAEDAF 
EPLTCNSSFS SCIPWTSRWG RSAVKTTLIV IPCGQCIVMN LDSPKLTLQE GIDIRGKLVF
PDRYELTVET PEVVVQGELE MRSSKLVDGS PAIKFVLYGN GDRHFDPVNN NRNACGGSSC
NGGARPITVA GGKVNLNGLP TNTPTWLHVY DVIGSSAIVV SNSVRNKWGA GATIVITSEH
QGYFGEQVRK ITSISNVGSN SVRLNLDQPI NRPVTLRDSP DFATEVALLS RNIVFEGAPG
TKGGHFWIMH TPRVRQRIEG VELVNFGQEG LLGRYPIHFH MCGDVSGSVV VKNTIRNSNQ
RCVVVHGTNN LLVQENVAYF TKGHCYMLED GIETGNQFIR NIGIRTIKAK VTIPNMGSNG
RESDRSASTF WITNADNSWI GNVVAGSEAL GFWFELLVRG NLANEHQDFD PMMVPTRKFE
DNVVHSVFGV GMTYYLSGYI PETLQYFKNN KFFRNHHLAL RIHRTRNIVL TGNKFSDNRY
AIQIDRDEEI HVTDTTIVGY SDLFKDVVRR NRFAQAPCAQ GISFQSTNPW KDKMDSELNG
VILDKVRFSG FSNAVCSSST AIELDSRLDG YKSFEMFSQY SGATVSDANS IDFCRGKSAG
ARDVYVSDTT GSLLDGVASA PSTLMVNSPE LASFVNPSAC TENAARCYTY CSNTCFRTVH
YYVPVGQSRD YKLKVCDRKD ASDCTVLSGY VHFNHPWPRR FAVHVPSGRE YDTYFLDKGV
PVYPTNVEIV FQEKLCPTAP DDDDIALLYK ARGTTFPPTP SPTTALAACG NLIANSDFER
GYNGYWDAQG AGTLSTTAGY KSATAMYYAS GNRNRYWVGP SHQWREGLDL KCLKQGTTWE
FSARLKLVDS TTGRGASCNT GSSSEGEMCP QVQLIVRDQS WTQHSFRISG FDGGDTWVAN
GFNEFKGYWT IPANGSGWRG GVANMRVILS EFPLGMDLVV DDFELTQFV