Gene PHATRDRAFT_49176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49176 
Symbol 
ID7195667 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp129996 
End bp133982 
Gene Length3987 bp 
Protein Length869 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183932 
Protein GI219127417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.450709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCGG TCGTTGTTAC CACGGCGTGT GGTTGTTTCG CACCGCTCCT GGACTCTCCC 
CCCAAACAAT CCACCCAGTT CTTGTACGAG TACCCCAACA CGTCCTTCGA CAACAACTAC
AACAACAGCA TCAACGTTAG CACTAGCAAT ACTAGCGACC GTGACGGGAC TACCGACATT
GACCGCAACA GGGAATGCGC AGCCGACGAC CAACTTGGGG ACGACGCATC GCACGAATAC
CCGTACGGGA ACGTTCCACA CTTCCGCAAC GACAATAGCA GTAGCAACGA CGAACACGAC
GACGACAGTA GCAACGGTAC GACGACGAGT CCGCGAATGG GTCGTCGGCG TCGCAAGGGC
ATGAACTTTT TCGCTTCCAA AAAGTCTCGC GCCGCTGCCG CGGCCGCCGC GGCGGTGGAA
ACGGCTCCGG CCGACACGTC CAGCTTTCGA CCGGCCGTCA AGGCCGTCTT GGAACCATTC
ACGTCCAAAA GTAGTGTCGG ATCTCCGCGT GCCGATTCTG TCGTCGCCAA GGAAAACCCG
CTGCCTCGCA ACCACGACAC TAGTCAACCT CAACACACTG AGGATAGTAA GAAGGAGAAG
GATGGCGGCA GTACCAGTCC AAGCAACGGC ACCAACCACA ACAATGGTAA CTCCGGTACG
CACCCGAACG ACAGTAACAA CAACAACAAC AACCAGAACA AGAACAGCGA ACCGTCGTCT
CCCGTGTCAT CCGACGAACA ACGCGATCAC CATGACGACA CTCATAGCAA CCACAACGAC
CACAGCAACA GCAACGACGA CGACAGCAGT ATCATTTCTA TTCACAATCA CGACAGGATC
GACAATTCAC CCAAACGGCG AGAAGAAACT ACCGAACGCG TCACCACCAC CACCACCGAA
GACGACGACG ACAATGACAA CAACGAGCCG GGAGAGGACG ACAATAGTAT CCTTGGTATC
CGCGACGACA GTTCCTCCAC GACGAACCCA GCCCACTCTC CCATTCAGCG TCACGTTGCC
ACACCAGAAG AAAACACAAG CAGCACGAAC AGCTCCGTCA TCCTCGAACC CTCCTCAGCT
CTGCTCGACA ATCCACACAC CGCACTCCAC ACAACTCCCC ATCGCATCGA CAGCAGCGAC
TCGGAACCCC GTCTGTTCCG GACGTTCACG CCCGGTGGAT CCACCACGGA CATTCCCGCC
GTCGAAACCG CCGTATCCGA TCTCGGCGAC GTCTCGGAAG TTCCCCACGG GGGTCTTCCA
GCCCGACAAC CAAGCGGGCC CTTGCGGACG AAGCTCCAGT TGGACCTGCA TACCGATCTG
CTGAAGCGAT TGCGAAACAC CAACCGCATG CCGGAATACA TTCTCGCTAC GGAATCGTCG
CGACCCAGCG AATCGCCCCC GCGACCCAGC ACCCCCACCG AGGCCATGTC GACGACAACC
TCCCACGTGT CGGAACCCTC GTTGACCCCG GAAACGCCCG TGCAAGCCTC CCCCCTCGCC
GTGGAAACAG ATTCGAGTCC GGCAGAATCT TCCCGAACGG TACCGGAGAC GCCCGATTCG
TCCCCACGGA CACCCGTATC ACCGACAGCA ACGACATCCT CGGACGACAC ACCGTGCAGC
CCAACAACGA CGCACTCTTG CGTATCGGAA TCCCATGATC TCCCCGAATC GTCACCGGCG
GTACCGAACA CCGCTGCCGG AGTCGCCACC GAAGCCAGCG GACCTTCCTC GAGTCCAGAA
GCGTCTTTAA CAGTATCCCC GGAATCGTCA AGCGTGCCTA GACCGCCGTT CCCGACACAA
ACAACCGCCA CCTTGTTGGC TGAAGAAGCA CCCCAGAGCC CGGCAGAATC TTTCCAAACG
GTACCGGATA CGCCCGATTC GTCCCCACGG CCACCCGTCT CACCGGCAGC AACGACATCC
TCGGACGACA CACCGTGCAG CCCAACAACG ACCACCACGA CCGGCTCGGA CACGGCTGAA
GAACTATCTC CTGCTCCGGA AAGACTTCCC GTACCCAAAC AGCTACTGAC GGTGCCGGAC
GCAGCCACCA TCTCCGCAGC AGCCGAAGAA ACATTCTCGA CTCCGGAATC GTCTCCAAAA
GCGGTACCGG AAACGACCGA CTCATCTTCA GTGCCATTGT CGACAGAATC AGAGACAAAT
ACATCAGTGG GAGAGCTGCT GTCGAGTCCG GCAGCAAAGT CCTCGACAGC CTTGTCGGAC
GGAGTTAGCC AATCGGCTGC ACAAATAGTC CTGCAAGCAT CCACATTTAC AGAAGAAAGA
CCTTCGAGAC CTGTGGCGGA GTCGGAGCCG ACCAGTGTAT TGGCAGCGCC TTTGACGATT
CTGGAAACTA GCCCATCATC AGGTAATTCA TGCTCGAGTC CGTTAGAGTC GCCCGAAAAA
GACTCAAAAA CGCCCAGCAT AGATGTACCG CCTTCGAAAC CCGCACCAGA AATTACCGTA
TCCGTCGGGG TGACACCCTT GGTATCCCTA ATGGGGTGTG AGGCAACGCC CAACGTACTT
GTTCGGCCGT TGGAAACATT ACGAGAAAGC ACTCCGTTAG TAGTGAAATT GCCTTTGAGT
CCAGCCGAGT CGTCAACGTC GGACGCGGAA ACACCGATGG CAAAATCGTC ATCGCCTCTT
CCGAAAACAG CCAATTCAGC CGGAGCAGCG GCTTCTTGCC CAGTGGAATC CCTAGTGGCG
GCCTCCGAAA CTCTAAAAGG TGCTGCAATA CCCTTGAAAG TACCGGAAAC AGCCATAGTA
GCAGGAGAAA TCCTCACTTT TCCGGCAGAG TCGTCGATAG GAGTACCGGA AACGCATGAT
GCATCTGCGC CTGCGCCAGT AATGCCGGAA TCTCCAAAAT CATCCGAAGA CACGCCCTCT
AGATCCCCAC CAATAATACC TGTATCGGAA ACGCCCCATG GAGGGGCAAT CAATGAGAGG
CCGGTAGTGT GTCCAGTAGA ATCGGAAGCC ACCACCTTAA CTGGAGAAAC GTCCCTAAAA
CCGACGGTTA TCCTCACGGC ATCAACAGAC GAGTTGTCTT CTCCGAAAGA ACCGGAACGA
TCTTCATCAG ACAGAGAAAC ACCATCGAGT CCATTGCAGA GTGTGACATG CCAGGAAAGG
AGCGGGTCCC AAGGAACTTC TCCCGCTTCA GACTCCGTAA AACCATCTCT CATGCAAACC
CCGCCATCCG ATACTCATTC GTATCCACAA AATGGAGGAT TGACACCCTC TCAAACAATT
GCAAGCGCTA TCAACACCCC AATCGTATCA GAAACCATAT CGCCCCCCAA GTCCGATACA
GTGGCTGCTA CATTAGCATC GGGTTTGTCC ACGCTTTTAT CCACCAAGCA GTTCAAGCCT
CCGTCAATAT CTCAGGTTAT TCGCAAGGAT TTATGGAGTT CGGAAACCGG TGTTGTGTTT
GAAGCACTTC AATGGATTAC AATCGAAGCC TTCCACGACG AAGGAGCCCG GGATACAATT
GCTCGGACCG GTGGCCTGCT GGCCATTGTG CGGGCTATGG AGACGCATTC TTCGCACGCA
CCGATTCAAA AAGCTGCTTG CCAGGCTTTA GAAAAGCTTG CATTGGATAT CGAGAACGAA
CGCGCCATCA GCGATGTTGG CGGAGTTGAA GCAATTCTAG CGGCCATGAT GGGTCATTTG
AACAACGTAT CAGTTCAGGA GGCGGCCTGG TCGGCCCTGC AAAATTTAAC ATGTGGGAAC
GCCCAGGGGG CCATGACAAT TGACACTACG GGTGGCATGG TCTCGTTGGT CTCTGCAATG
CGAACGCATT CTACCGAGCC ACGAGTACAA GCGAGCGCGT GCGGGACCTT CGCCAATTTA
TGTCTGGATC ACGAAGATCG TTTAACGGCG TTGGCGCAAG CGGGCGGCTT TAGCGCCATG
GCGGATGCCC TACAACTCCA TTGGGAAAAT ATGGAAGTAC GAAAGGAAGC AAGTCGAGCG
CTGGCGGATT TGTTGGAAGA TGTTTAA
 
Protein sequence
MLPVVVTTAC GCFAPLLDSP PKQSTQFLYE YPNTSFDNNY NNSINVSTSN TSDRDGTTDI 
DRNRECAADD QLGDDASHEY PYGNVPHFRN DNSSSNDEHD DDSSNGTTTS PRMGRRRRKG
MNFFASKKSR AAAAAAAAVE TAPADTSSFR PAVKAVLEPF TSKSSVGSPR ADSVVAKENP
LPRNHDTSQP QHTEDSKKEK DGGSTSPSNG TNHNNGNSGT HPNDSNNNNN NQNKNSEPSS
PVSSDEQRDH HDDTHSNHND HSNSNDDDSS IISIHNHDRI DNSPKRREET TERVTTTTTE
DDDDNDNNEP GEDDNSILGI RDDSSSTTNP AHSPIQRHVA TPEENTSSTN SSVILEPSSA
LLDNPHTALH TTPHRIDSSD SEPRLFRTFT PGGSTTDIPA VETAVSDLGD VSEVPHGGLP
ARQPSGPLRT KLQLDLHTDL LKRLRNTNRM PEYILATESS RPSESPPRPS TPTEAMSTTT
SHVSEPSLTP ETPVQASPLA VETDSSPAES SRTVPETPDS SPRTPVSPTA TTSSDDTPCS
PTTTHSCVSE SHDLPESSPA VPNTAAGVAT EASGPSSSPE ASLTVSPESS SVPRPPFPTQ
TTATLLAEEA PQSPAESFQT VPDTPDSSPR PPVSPAATTS SDDTPCSPTT TTTTGSDTAE
ELSPAPERLP DLWSSETGVV FEALQWITIE AFHDEGARDT IARTGGLLAI VRAMETHSSH
APIQKAACQA LEKLALDIEN ERAISDVGGV EAILAAMMGH LNNVSVQEAA WSALQNLTCG
NAQGAMTIDT TGGMVSLVSA MRTHSTEPRV QASACGTFAN LCLDHEDRLT ALAQAGGFSA
MADALQLHWE NMEVRKEASR ALADLLEDV