Gene PHATR_43802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43802 
Symbol 
ID7203944 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp47141 
End bp50787 
Gene Length3647 bp 
Protein Length1189 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186268 
Protein GI219113369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.327125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCA ACCCCAACAA AACCTCCGCC AAGGAACGTG AGATTCTCGA TCGCCAAAGA 
CAACTAGCGG CAAAACTCAA GCCGCCAAAA CCGCCAGCTT CGTCCATTGG TAGTAGCACT
TCTCCACCCG TGCCCCCTTG CGCAAAGGGT GAGAGCCACC GCACGTCGGC TCCTCCCCCG
AACGTGATTG ATTTGACTGA ATCCACCGCG TCCCATCAAT TCGAATCCCG TTCCAATTCA
TCACGCAATT TTACATCCCA ATCACGTCCC CTTCCGGCCA AGCGGAAGCG TGTAGAGCCC
TCCAGGAAAG CCTCCTCAAC GAGACCGGCA ACCGACACCG GCTCTCAACA AGCCCCACCA
ACCAAGCCTT CATCCTCCTC TAATCACCCA GTGACTGTGC AAAAGCGCCC CACACTCAAA
CGCAAAGGCA ACATTCCTCC CGCGGCAGCG TTGATGGCCG CGGCAAGGGT CAAAGCTGGA
GCGAACGATC CCAAACAGTC TGTGCACGCT ACGACCACCG GAAGCAAATC GTCTCCACGG
CTTCAGCGCA AAACGGTATC CACTCGCAAC GCCAATGCGG CGAGTGGGTC CAAAACAGTC
ACCAGCGGTA GTCTAGCACA ACTGGTACAA AACGTATCTT CAACGCCGCT GGATGCTGCG
AACCTCAGCG GTGCCGCAAG TGGTGTTAAC GCAGTCCACG CCGACGACTT CTGGAAACAT
TTGCGCGAAT GGGATTTCGT ATCCCAGTAT GCGTCTTACC AGAGATCCCA ACGGCAGCAA
CTAGACACAA ACGATCAGAG TACCACAATG CAGAAAAAGC CGCTTCCCAA CGTCTTTTTG
AACGCGCGTC ACTACATGGC AGCCTGGGCT CCACTCTGCC TGGCCGAATG CCGGGCCCAG
CTACTGCAGG AGGCTGGACT CAACGCATCC GCACCTCTCG CTGTGCAGGT CCAAACCTCC
ACCAACGGAC CCCGAAGGTT TCGGGGGACT GGTGATATGT TTAACGCCTC CAGTGGCTGG
GATGAACACG ATACCGGAGG ATACGTGACG ATCCAACCGC AACAAAGGGG GACCGGGCGG
GGTATGAAAT TCTTCCCACA CGATCTCGTC TTGTTACTGA TTCCTCCATA CGAACATATC
TTGCGAGACT TGTCCCAAAG CCGCAAGACA CCACCAGCAC CACCTTTGGG ACAAGATCCC
AACGATCCTG CTGCCTACAA AGATGTCGGA CTGATTGGTC ACGTTGAAAT GAGTCGGGGC
GAAGTGGCTG GTTTAACTCT GAAAATTTCC AAGCGGTTGT GGGCCAAGCT GAGTACCAGG
AACGGTTCCG CGCCGAGATC ATCCAATGCA TCTCCCACCA CCACCAACAT GTTTCTCGTC
AAAATTGGGA GCAACGTTAC AGCGCTGCGT GAATTCACGG CGCTCTGTCA AGTTGACACG
CTCCCGGTGC AACGGTACCT TTTGGCCGAA CATCTCGCCA ACGCGCAGAA TCGTCGCAAA
TTGAGCCGGA ACCAAACAAC GGAACAGCTC CTGGAACGAA TGGGCGGCGC CAATGCGCTG
GGCAAGGGCT TTTTGGACTA CGCCGAACAC AAGTTTAACG CATCGCAGCT TACAGCCATT
GCTGCGTCGG CACACGAATA CGGAGAAGGT GGATTTACTC TCATCAAAGG ACCGCCAGGA
ACTGGAAGTA AGTAGCCCGA AGGACGTATT CGTAGCAGCA CGCACTGTCT CCAGCTGACC
TCCTCTTTTC TCGCTGTCGA GCAGAAACAA CCACGCTCGT GGCTGTTTTG AACTCCTTAC
ATATTCGTCA GTACAACAAA TACTACGAAT CGGTCCGACG TATTGCGACG CAACCCACCG
GCACGCGTCA GGCTGCTTTG GACATGGCCC GTCGCGCCAA ACCTCGATTG CTCGTTTGTG
CTCCATCGAA TGCAGCCGTG GATAACATAA TATTGAAAAT TATGGAGGAT GGCTTTGTGG
ATGGACGGGG TCAACGGTAC AATCCAAGCA TGATTCGTGT TGGCGTGGGT AAGGGTACTG
CAGTAAAACC TGTCGCTCTG GAAACCAAAG TAGACGCTAT TCTGGCGGAG AATATGGACG
CTGGCCGGCT GGAAACCTCG ATCGCGGGCT ATCGGATGGA ATTGACCAGA ATTTCGCAGG
ACATTGCCCG ACTGCGACGT CGAGTGCACG CCATGACGAA CGCCAGTGCG TGGCCGCTTT
CCAAGGATTG GGAAATCCGT ATCGATGAAG ATACCTTTGA CGAAACGGGA AAGGTGTATT
TTGTTAATCA CCGCGCCCAC TTAACCACGT ACGAAGCTCC TCCGCCACCA GAACCGGGAG
AGACGCACTT CCCAGCTACG GCAATGCCTG AGTATCGAGC ATTTATGAGT CGGATTGTGA
AGCTTGTGGA GAACTACTTT TCGGTAAAAG CGGAATTAGA ACGATGCACA ATAGTCAAGG
GATCGATGGA TAATGGTACC AATCATATTG AAGTTCGTCA AAACATGGAA ACACACGTCC
TAAATTCTGT ACACATGGTG ATGACAACTT TAGGGACGGC TGGCAACCGT GTCATGGAAG
CCGCCGACAA GTTTGAAGTT GTGGTCGTCG ATGAAGCTGC GCAAAGCGTG GAACCGGCAA
CTCTATCTGC GTTCCAATTG GGATCGAGAC ATGCTGTGCT AGTTGGCGAC CCCCAACAGC
TTCCAGCGAC CGTCTTTAAC ATTTCGGGAC GCCTTTCTAA ATACGATCGA TCCCTGTTTC
AGCGTTTGGA AGAAGCTGGG CAACCCGTGT ACATGTTGAA CGAGCAATAC CGAATGCACC
CCAGCATTTC TCACTTTCCT CGCCATATTT TTTATGGCGG CACTCTTTTG GATGGGCCAA
ATGTACGAAA ATCAGATTAT GGCAACCCAC TGCTTGGTAT GGTCACTCGG ACTCTTCCAA
GCTTCTCTCC CTTAATGATT CTCGACCTCG ATTCTAAGGA AGAACGTGGC GGCACAAGTT
TGTCCAACTC TGGAGAAGCT CAGCTGGCCG TCTACTTGTA CATGCGATTG AAAGGAATAA
GTCGAGGGTT GTCGGCCGAA ACCAAAGTTG CTGTTATTAC TCCCTATGCT CAACAAGCTC
GTATGCTTCG CGAGTATTTC GGGGATGCTT TAGGGCCGAA CTACGAGAAA TTCGTGGAGG
TGAATACGGT CGATGCCTTT CAGGGGCGAG AGGCCAACAT TGTAATCTTT TCGGCAGTCC
GTGCGGCGGG TAGTCACGGC ATTGGCTTCC TTTCCGACGT GCGTCGAATG AATGTCGCTC
TGACTCGCGC AAAGCATTTC TTATTTGTGA TTGCACGCTG CGATTCGATT GTGGTAAATC
CATACTGGAG CGATTTGGTT ACTCACGCCC GGAAAACTCA CGCTGTGCTG AAGGTTCCGA
TTTTTGGGGG CGGTCGGGCG CTGTCCTTTG GAGAGCTCAA CGAATGGCAG AAGGAAACTC
CGAAAATTAT AGACAATGCT CCGACTGGAC TGACAGCGAC CGAGCCTCGT GAGAGCAAGC
CGATCCCGCC TCCACCCTCT CGACCTCCAG ATCCTCGCAA AGCTCCGAAG GCACCGCCAC
CACCTGCTAC ACCGGCAGCG AACAGAGTGG ATCCTCGAAA ACGCTAG
 
Protein sequence
MSSNPNKTSA KEREILDRQR QLAAKLKPPK PPASSIGSST SPPVPPCAKG ESHRTSAPPP 
NVIDLTESTA SHQFESRSNS SRNFTSQSRP LPAKRKRVEP SRKASSTRPA TDTGSQQAPP
TKPSSSSNHP VTVQKRPTLK RKGNIPPAAA LMAAARVKAG ANDPKQSVHA TTTGSKSSPR
LQRKTVSTRN ANAASGSKTV TSGSLAQLVQ NVSSTPLDAA NLSGAASGVN AVHADDFWKH
LREWDFVSQY ASYQRSQRQQ LDTNDQSTTM QKKPLPNVFL NARHYMAAWA PLCLAECRAQ
LLQEAGLNAS APLAVQVQTS TNGPRRFRGT GDMFNASSGW DEHDTGGYVT IQPQQRGTGR
GMKFFPHDLV LLLIPPYEHI LRDLSQSRKT PPAPPLGQDP NDPAAYKDVG LIGHVEMSRG
EVAGLTLKIS KRLWAKLSTR NGSAPRSSNA SPTTTNMFLV KIGSNVTALR EFTALCQVDT
LPVQRYLLAE HLANAQNRRK LSRNQTTEQL LERMGGANAL GKGFLDYAEH KFNASQLTAI
AASAHEYGEG GFTLIKGPPG TGKTTTLVAV LNSLHIRQYN KYYESVRRIA TQPTGTRQAA
LDMARRAKPR LLVCAPSNAA VDNIILKIME DGFVDGRGQR YNPSMIRVGV GKGTAVKPVA
LETKVDAILA ENMDAGRLET SIAGYRMELT RISQDIARLR RRVHAMTNAS AWPLSKDWEI
RIDEDTFDET GKVYFVNHRA HLTTYEAPPP PEPGETHFPA TAMPEYRAFM SRIVKLVENY
FSVKAELERC TIVKGSMDNG TNHIEVRQNM ETHVLNSVHM VMTTLGTAGN RVMEAADKFE
VVVVDEAAQS VEPATLSAFQ LGSRHAVLVG DPQQLPATVF NISGRLSKYD RSLFQRLEEA
GQPVYMLNEQ YRMHPSISHF PRHIFYGGTL LDGPNVRKSD YGNPLLGMVT RTLPSFSPLM
ILDLDSKEER GGTSLSNSGE AQLAVYLYMR LKGISRGLSA ETKVAVITPY AQQARMLREY
FGDALGPNYE KFVEVNTVDA FQGREANIVI FSAVRAAGSH GIGFLSDVRR MNVALTRAKH
FLFVIARCDS IVVNPYWSDL VTHARKTHAV LKVPIFGGGR ALSFGELNEW QKETPKIIDN
APTGLTATEP RESKPIPPPP SRPPDPRKAP KAPPPPATPA ANRVDPRKR