Gene PHATRDRAFT_42429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42429 
Symbol 
ID7196618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp30314 
End bp34211 
Gene Length3898 bp 
Protein Length625 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176507 
Protein GI219109505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAACAAAGCG CGATTGCAAA CAGCAAGCGC CAACGGAAAA GCGCCAGTTG TTGGCTCGAC 
TCCGTTTGGT TTGCACCACT GGAGTAGCTC TCTACAGTCT TTTGGATTTG TGAGCTCTCC
ATTGGTATAG AAGGTCTGGC AATTGCAAGG ATCTTCGCCA ATTCAGTTGG GAGACCTTTA
GGGCGGCAAC AGTCAAACAA GGTCAAGCGC AGCGATCGAC CGCGCGAGGG TTGCTTATCA
GCTGCCTTGA GTGACGCTCC CTACGTACAC CGCTTTTATC ATGAACACGA AAGAATACCT
GTATTCGTGG CATCTCTCGC TGCGAGCGGA CCGTCCAGTG GAAGGTTGTG GGATGCGCCC
GCACGCCTAC ATGTACGGCA AAAAATTAGA CGAACGCGAA GACAAAACGC TGCCTCCGCA
TTCCAAAAAA ATGAAAGAGC CACCACCGCA GCACGAATTC TCCTATCGAT GGTTCCGCAG
TCCTTTGCAT GAGCCTTGTG CCTACGAGAA TTGTCCTCGT CGAACTTCGT TCTCTCCACA
CGATTGGTCC AGACATGCTT TAGGTGGAAC GGAATGCGGA TTGCAGTGTG TATCGACGCA
AAGTTCCTTG TTTCGATGCA CGTTTTGTAA TTCTACTTGC TTTGTGAATG CTTGGAAGAC
TCAGTACAGT GTTCCGAAGG AGGCCACTCG GACGGAAACT CATGGTCGGA CCCGTTCACA
ATCTTTTGGT AGTAATGATG AAGACGTCTT TGACGATACG GGTAGTGTAC GGTCTTCGAA
TGGATCTAGT CCAGCTCTCG ACACCCTCAG CTCGCCACCA CCATCGACTC CCCGTGGGTT
CCTCAGTGGA TACTCGGCGG GCAAGCAGCT GAACCCCGCC TCTGGTAGCA GTATGTACCA
CTCGGAATAC GATGCTGGTG ACGATTGGGT GGAATTCAGT CGAGATCAGC TTTACATGCC
AGGCCCTGAA GATGTCGGAC ACAAGCTCAA AATTGAAGCT GCGGCGTATT CAACCGATAC
CAGTGAACTA CTTATGTCAC GTGTTGTCAA AACAGACGTT GTTCTAGGAA GGGCTCCTGA
CCCACTTAAA AGGCAACTAG TTACCACTAA GGGCGGTGGA GGAGGTGGTC CTCGTTTCCG
CGTTATAACG TACAATGTTC TCGCCGAGAT TTACGCAACT CAACAGCAGT ATCCTTACTG
CGACTTCTGG GCACTTTCAT GGGATTATCG ATTCCAAAAC ATTCTTCGCG AGATCATTGA
TGCATCGCCA GAAGTTGTAT GCTTGCAAGA AATTCAAGCG GATCACTACG AGAATCACGT
TTACGTGGCC ATGGCTGACG CGGGATTTGA AGGCGTCTAT AAGCAAAAGA CGAGACAGAG
TATGGGACTT GCTGGAAAAG TCGACGGATG TGCTTTGTTT TGGAGACGTT CCAAATTTCA
TTTGGTCGAA TCCTACAGCA TTGAGTTCAA CGAAGTTGCG CAGAGACAAG CGACTCAAGT
GTTAGGCCTC AATCCACGAA GCGAAGAAGG TGTGGCCTTT TTAAACCGTC TATCGAAGGA
TAACGTGGCA CAGCTTGTTG TTCTAGAATT CATCCAGCCT AGTCGATCGA ATCGCGAAAT
ATCGCAAGTG TGTATTGCCA ATACGCATTT GTATAGCAAC AAGGACTTCC CAGACGTAAA
GCTGTGGCAA ACATGGCAAC TTTTGCAAGA GCTGGAATCA TTCATTATGA GTCGCGGAAC
GAATCTTCCT TTGATTATTT GTGGAGACTT CAACTCGACT CCAGATACAG CCGTCTACGA
TCTACTCTCG AGACAGACAG TCCATCCCGG CCATCCTGAT GTAAATGTTA CGACTGGCGA
CGACGTTCCT AACGTTCTCC CTGATGCGAT GAATATTACT CATTCGTTCC AGCTGGGCAG
CGCCTATCAA ACAGTATTGG GAGAGGAGCC GTGGACGACG AACTTTACTG TCAATTTTAA
GGGCGTTTTA GATTACATAT GGTATTCCGC CCAGAATTTG CGGCCGCTCT CAGCTGCCCC
GATACCAGAG GAAAAGCAAT TGACAAAGAA TGGGGAAGCT TTACCTTCGA CAGAGTACAG
TTCAGATCAC ATCATGCTGA TCTCAGATAT GCAAATTATT GGCAATGGAG CACGATAAAG
AAAGATCTAG GAGAGAATGA AAGGATGTGT GCTATTGTGG ACCGCCTTCT TTTGTAGTAG
CGCTCAATTT TGTTATGGTA GAAGTAGAAA CTTTGATGGT GGGACGAGGA AGAATAAAAT
CTGGCAATAA TGGCATTACG ATCACTTGGC ACTCCCCCCC AAAAGCATGG CGTAGTCATG
GCCTCCGTCG TTTTCCAAGT GAGTCACCAT CTTTTCAAAA TCTTGGACCC ACATCATTTG
CTCTGGTCCG CGGCCTGATG GCATTTTGCG TTTGGATGCC GCGTCGAACT CAACGGAGCG
AATAGGCTTG CCGTTCAGCA GTACACGAAC GACGTGCGAG TGTGAGTCCA CCGGTCCCTC
TTTGTGCAGC CGCACAAGTT CAAAGACCAA AGAACTTCCG TATTCCGGCC AAAAGCGCCA
GTCGGCACTG GTGTCGTCGG CCAAAAAGTC GGCTCCTATT CCGTACAAAA GACCGAGGAT
TGTGATGTCG TGACAACTAT AAATGGTGAA TGGTCGCTTT TCTTCGGTGT TGAGGCTTGG
TGCAACCTTC AGCGATTCCG TAATCTCTCT CAGTGGCGGT GCGGCGATTG CGGCCAAAAG
GCGCTTATTC TGGTACCATT TTCGGAATCT CCATGACAAG TGCATCAATG TTTGATGCGA
GAGTGAACAT AGCATTTGTT CAACTGAAAC ATCATGCTCG TAGTCAGAAA ACCGGGCCAA
GTCCATTCCG TGTGATGATC TGCAGACAAA ATGATCAGCG GCTTCGACCC AGTTGATCCC
GCTCGGTGCT CGCGAGCTGA AATCGCTCTT TCGGGGCCTC ACTAAACCAG GAAGAATATT
GGCCAAACGA GCCGCTAGTG GCGCCGCAGC CCCATCGCGC AGCATGAAGT CTTCTGATGA
GATGACTTCA TCAACTAGAT CAGCCATGAG ATCTGGGTTG CGATCGAACG CATTTAACGG
GTCTCGGGAG AGCTCTCGGA CACGGACTTT GACAAGTTCA TCCTTGCCAA AACTCTTCCC
TCGCCAAGCG TGGTTCGGTA CGCGAGCTTC TTCAAATATG TCGGGATTTA GCGTCCGTTG
CGGGGAGGGC GTATAACAGT TTGCACCCAG CATGCCATCA AGAAAGCTTT GAACTGACAT
AATGGTACGT AGATAGTTCG TAGAAAACAC TTTAACATCC CAAACGGACA GGAAGTCTTC
GGGGTTTTCC CATCGCCATT CGCTAAGGTT CGGTGAGTGA TGCCCATAGT GATTGTATCG
ATGGAAGAAT CGGTGTCCGT TTTCTTTTAA CTGCGACAAT CCCATCTGCG TTAAAAAACC
GAAAGGATTA CGGCCCACAT CAAGAAACTG ACCATGATTC GTGTTTTGAT GAATGTCGGG
CGGGAAGCAC CTTGAGTATG CTTCAAAGGC TGTGGCAGAA TCAGGTGATG GCAAACGTGT
CATCCAATAG GCAGCTTCTT CCTTTCTACG ATGGGATGGA GAAAGCGGCC TACTTGGTGT
TCTATCTCCA TGTCGGCAAA ACATCCAGAC GCCCTCGACG ACACCATCAT TCTTCAAGTA
GCGATGTGGG TCTTCATCCG CTTTTGACCC AAATGCGCCG GATGAGTTGC GTCGAAACCT
ACCTATAAGT CGAGAAGATC GTTTCGACCC TTCCAAGCCA CGAAGCGCTG TTGAACCCAA
CTTCTTGGGT CTCATTGTGA CCCTTAAGGG CAACCTCAAG AATACGCGGG AGTGCGCA
 
Protein sequence
MNTKEYLYSW HLSLRADRPV EGCGMRPHAY MYGKKLDERE DKTLPPHSKK MKEPPPQHEF 
SYRWFRSPLH EPCAYENCPR RTSFSPHDWS RHALGGTECG LQCVSTQSSL FRCTFCNSTC
FVNAWKTQYS VPKEATRTET HGRTRSQSFG SNDEDVFDDT GSVRSSNGSS PALDTLSSPP
PSTPRGFLSG YSAGKQLNPA SGSSMYHSEY DAGDDWVEFS RDQLYMPGPE DVGHKLKIEA
AAYSTDTSEL LMSRVVKTDV VLGRAPDPLK RQLVTTKGGG GGGPRFRVIT YNVLAEIYAT
QQQYPYCDFW ALSWDYRFQN ILREIIDASP EVVCLQEIQA DHYENHVYVA MADAGFEGVY
KQKTRQSMGL AGKVDGCALF WRRSKFHLVE SYSIEFNEVA QRQATQVLGL NPRSEEGVAF
LNRLSKDNVA QLVVLEFIQP SRSNREISQV CIANTHLYSN KDFPDVKLWQ TWQLLQELES
FIMSRGTNLP LIICGDFNST PDTAVYDLLS RQTVHPGHPD VNVTTGDDVP NVLPDAMNIT
HSFQLGSAYQ TVLGEEPWTT NFTVNFKGVL DYIWYSAQNL RPLSAAPIPE EKQLTKNGEA
LPSTEYSSDH IMLISDMQII GNGAR