Gene PHATRDRAFT_42822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42822 
Symbol 
ID7196427 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1227886 
End bp1230842 
Gene Length2957 bp 
Protein Length837 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177245 
Protein GI219110987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA GCCCACTGCA AACAAAGGAA AAATTCCCTC CAATCGGCAA ATCCCAAAAA 
TTATCATCAA ATGCCAACCC GACGCAAAGT TTCAATCAAG CCGTGGCACA CGAACGCGAA
AGTGGCAAGC CACGAAGTAG TGCAGAGGAA TCTGTTGATG AAGGATGCCG CGTTTGTGGA
ATGGATGACA ACTATTCCAG ACTATTACTT TGTGAAGGAT GCAACGGAGA ATATCATACG
TACTGCTTAA CTCCTCCACT TGAAAAAGTC CCAGTCGAAG ACTGGTATTG CGGTAGGTCG
CAGAAGCAAT GCAGCGTGTT TTCATCATTG GAGCACTGTA TACGCTAACG ATATCTATTT
TTCTAGATCG GTGCACAGCT CTTGTCGAAA TCCTGAACAA GAAAAGTGGG GGAGAACCGA
TTGGCTCTAT CCCTCTTATT ATATCTCAGG GATCCGAAGG CAATTCCAAG GCTGATCCAT
CCCTGCCATC AAATCACTCT ACTAAAAAAA AGTCTTTGGT CGAAAGTTCG CCGTATGATA
CTGCAGATGA GGAATTACCA ATGAAAGAGA TTGATACGCG TAACGGATCC ACCGCTCCAG
AATACCTCGA TGAAAACACG CTACGTTTGG TTAGGCTGTA CGTTGCGAAG TGTCGACCTC
AAGATATGCT AAACGAAGAC GACGTGATTC TGCTCATGCA CAGAATGGAC CGGCTCACTG
CATACCAGGC TGAGTTTCAG CTGAATATTC CTGGGCCAGC CAATGAAAGA CAAGAGGTTT
TGCTGGAAGA AATTAAAGAC GAAGAAGAAG ACCATGTTCT TGGGCGAGAG TTTTTCCACA
AACTTGTCGC AAAAATTCAG AAGGAAAATG TTGAATCACT CGAGAAACGC ACACTCGCCT
GGTTAAAGCG GGCTACTGGG ATCCATAGGA GTAGTACTGA ATCGTTCGTA AGAGACAACA
CATCTAAAAC AGGCTCCTCT AGAACGAAAG GGTCTGGTCG GCGATGCACT AGAGGAATGG
TTCGTCAATT CAGAAAAAAA AAGAGTGAGA AAGGTCTCGA AATACTAAAA GGGGCATTGG
ATGAGCTGGA GCCGACGCTT GGACCCTTTC ATTCCGCTTT TCGGAAGCGC AGGCGTTCAA
TATCGACCTA CGATGACGAC GAAAGAAGTT CGGGGAAGCA GCGAAAAAAA CAGGAACAAG
TACAACGAAA GAATCAAAAA GGAATGAAGG CTAAGATGAA TAAAGAGATG AAGAATGGGC
TTAAAAGAAC GAAGCCAAAT AACGAAAACA AAAGGAAAGC CTCTCCAAGC TATCCCAATC
TGCTTCATGC TTTGGATTGG AAGAGACGAA AGCAGCTACA TTCGAAAAGG AATAAACCTG
GATCGAGCAT ATTGCTGGAC ATAAACCATG CGCAGAGGTC GTTGAAGTTG CCAAAGAAGC
TCCGAAACAA GGGCATCGTT ATGACTTTGC CTCGCTGGGG CAGCATCAAA CAGGCAGAAT
CATTCGCTCC CAAAGAATCG GTGTTTGGTC CCCCTCGTGG GCTTTACTCT GACCAACACT
ATTCTTTCTG GTCTTTAAGA CTCATGAACT TTCTTCAAAG CAGCGCGAGG ACTTGGGTGT
CGCATGAGTT TTTTTACAGT GACCTTGATA AAGCTTGGTA AGTTTCATCT ACCGTCGGTC
TTGCCATTCA TCGAGTTCCA TATGCTCACG ATTTTGCGTT AGGTACAATA GCAGTGCTCT
TTCGAAAATG GCAAGAAGGT TTGGTGTAGA TCCAACGATC AGTTTAGATT CAGCAGAATG
GAAGTGTGTC AGACGTGCTT TACATGGGAT AAAAGCGAAG CCGCGTCGCT TTTCACGCTG
CTTTATTTCT GAACAGCTAC ATGAAAGGGA CGAATTTCGA AGTGGAGTCC GGCTGCTTCA
ACAAAATTTA GGTGCATCCC ACGCCGCCTA TGATTTGAAG TCCTGTATCC CCGTGGGATC
AGTTGTAACT GCATACAGTC AAACGTTCGG TATGCTTCAA CGTGGTACTG TCTTGACGTT
TGAAGCTCGG AACGCCCATT ATCTGGTGCG GTTCGAAAAT ATGGACTTTG GGTACGAGTA
CTGCCCAGAC TCTGAAGTCG CGAGCCATGG TTCGGTATTG CCACTCCATG TGAGCGGCGG
GGCGGAGTCA AAGACAACTA CCTCTAATGC AATACTTCGA AAGTATTGTG GTAAGTGCAG
ATACAGCTCG ATTGACAAAC TAAATTTCTC GTTTGACAGT GCACTGATGT ACTTATCTCT
CTCCCCGCAC AAAATTTTGT TGACAGCGCC GGCATGGTCA CGGTCTATGA AGTTGGCAAC
GGAGCTCGAA GGTTGCACAG AATTTACCCC TTTTCGCAGC CAATCGCAAA AAAAGGGACG
TCGGTCCATT TCGGAGACAG ACAGCCATTG GGCATTTATA AAAGAAGCAG CTGAAGAAGA
AACGTTGCAA TGTCTACTTG ATGTCATCAA TACGGCAGCT ATACGAAAGT CTGCACTTTT
GAAAGCGATC GACACAGTAA TCACTCCTGC AAATTCAGAG TTGGAGAAAG CAAAATCGGC
CATTCAAACA CAAGAACGAG AAGGAAATCT TGCTTTGCTT GTTTTGAACT TGGAGAAGAC
GAACAAAACG ATTCGTAATA GTGTCCGGAA AATACGGCTT CTGTACGCCC AAGTATATCC
ATTACGAATG TAAGTTGATA TGTGAAAGTT GGCGTAGCAA GCTGCCTTCA CTGACGCAGA
AATCTTTATT CACCTCCTCG CAGCACTCAA GTTACGTCTC ATCATGATAC AATTTCCCGG
GCCATATACC ATTCTTCTGA ATGCGGATAT GGTCCAATCT CTGATGCGCT TATATATCCC
TGGGTTACGA GTCTTTTGGA GAATACCGGG ACTATAGGAA AATTTATCGC GGCATCTCTA
CTTCCCCAAC GGCGTGA
 
Protein sequence
MTKSPLQTKE KFPPIGKSQK LSSNANPTQS FNQAVAHERE SGKPRSSAEE SVDEGCRVCG 
MDDNYSRLLL CEGCNGEYHT YCLTPPLEKV PVEDWYCDRC TALVEILNKK SGGEPIGSIP
LIISQGSEGN SKADPSLPSN HSTKKKSLVE SSPYDTADEE LPMKEIDTRN GSTAPEYLDE
NTLRLVRLYV AKCRPQDMLN EDDVILLMHR MDRLTAYQAE FQLNIPGPAN ERQEVLLEEI
KDEEEDHVLG REFFHKLVAK IQKENVESLE KRTLAWLKRA TGIHRSSTES FVRDNTSKTG
SSRTKGSGRR CTRGMVRQFR KKKSEKGLEI LKGALDELEP TLGPFHSAFR KRRRSISTYD
DDERSSGKQR KKQEQVQRKN QKGMKAKMNK EMKNGLKRTK PNNENKRKAS PSYPNLLHAL
DWKRRKQLHS KRNKPGSSIL LDINHAQRSL KLPKKLRNKG IVMTLPRWGS IKQAESFAPK
ESVFGPPRGL YSDQHYSFWS LRLMNFLQSS ARTWVSHEFF YSDLDKAWYN SSALSKMARR
FGVDPTISLD SAEWKCVRRA LHGIKAKPRR FSRCFISEQL HERDEFRSGV RLLQQNLGAS
HAAYDLKSCI PVGSVVTAYS QTFGMLQRGT VLTFEARNAH YLVRFENMDF GYEYCPDSEV
ASHGSVLPLH VSGGAESKTT TSNAILRKYC APAWSRSMKL ATELEGCTEF TPFRSQSQKK
GRRSISETDS HWAFIKEAAE EETLQCLLDV INTAAIRKSA LLKAIDTVIT PANSELEKAK
SAIQTQEREG NLALLVLNLE KTNKTIRNSV RKIRLLYAQV YPLRMKIYRG ISTSPTA