Gene PHATRDRAFT_52368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52368 
SymbolATPase-P4 
ID7203524 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp516801 
End bp521911 
Gene Length5111 bp 
Protein Length1013 aa 
Translation table 
GC content50% 
IMG OID 
ProductP4, P type ATPase 
Protein accessionXP_002182702 
Protein GI219124839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGCG TAAAGGTCGT GGCGCCGGAC CGCATTTTGG TCCCGGGGAA ATCCAACCTC 
GATCGATGTG ACAACAAGGT CGTTTCGGCG CGATACACTG CGCTTACCTT TTTTCCGGTG
GTACGTGTGT CCCTGCGTAG TATCCGAGCA GTTTTGCTGC TGTTGGTTTT TAGGACTCTG
TTGCGCCAAG GTTGCCGTTT ACTTGGTCTT TTGCGTAGTA TGGAATTGAA AAAGACGCGG
CAACGTATCG GCAGTAATGT CTTTTTTGGC TCTCGTTGTT CTGCACACTA TAAAAAGGAT
TCCGTCGACT CGGTGGAATG CGGGAGTGTT CCTCGTTTGT CTCATTTCGT TAGCCTGTAC
ATTCTCATTC ATACTTCCGG TCCGTTTTTC CTATAGGCAA TTTTGGAGCA GTTTCGTCGG
TTCGCCAATC TTTACTTTTT GATTGTGGGC TGCATCATGG CGTTGGGCGA GTACACGGAC
GCCTTTGAGA CGGCCATTTC CCCCTGGACG ACGCTGGGTC CGCTCGCCTT TGTCATTAGC
ATTTCTCTCC TTGTCGAAGG CTCGGCCGAT TACAAGCGTC ACAAGAACGA CGGAGACACC
AACAATGCCC CCTGTACGGT TATTCGACGC GCGGATGAAC TCGCCCTGGA AGAAGACGTG
GAACGCGACA CCAACGTCAT GAAAGGCAAG GATGTGTTGG TGAATCTCAA CAAGGCGTTT
TATGCCGAAG CTAGTGTGCC GACACCCCGG GTATTGGACC ACACCGAATC CAACGCCTCG
TCGGGGCCGG GAACACGCAC AGCCGAGGTC AAGATCGCTT TTCAGAAGGT GCGACGCATG
GATATTCGAC AAGGACAGTT TGTTCTCCTC AAGAATCGCA ACATGGTTCC CGCCGATATT
GTTCTTTTGG CAAGCTCTAA CGATCACGGG GGTGCCTACA TTGAGACTTC CTCAATCGAC
GGTGAAACCA ACCTCAAATT GCGTAATTCT CCTCGTTTAC CACCTTCCGT TCTTCAACAT
TTGAAGGAAG GGACGACAAT GGATAAGATT GAAGAGTCCG ACGATGAAGG GGAATCTAAA
AATCAAGTGA CGGAGGAAAT GGCCGACGTG CACGACTTTG AAACCATCAG CGAAGCGACC
AAGCGGATTA CACGCTTTTC CTCGTTGGGG CGACCCGGAG GTCGATGTGT TCTCGACCAT
CCCGAATGCG CCGTCTCGAC CGAAACAGCG GGAAATCCGA CAGCGGAACC GGAGAAAACC
GTGGGAGAAC GCTTCCATCG GTTCTCCGGT AAATTCATGA GGGGCATTCG GGGTGGGATT
GATGCCGTGA AGCGGGGTTC GGCGTTCGAT GCCGACAGCA GCCCTCCTAG TACTGGCTTT
GATGACGGCA AGTATATTGC GGCACTCGTC ACGGAGCCAC CAAATCCAAG TGTGCACACT
TTCAGCGGAA AGCTTACCCT GCCGCCATTC AAGACCGGAG AATCTCCTAT CGACATTCCG
CTGGGTGCAG ACAATGTCTT ACTTCGTGGG GCTGTTCTAC GAAATACGGA ATGGGCTATT
GGTGTAGCCT TCTTCACGGG AACCGACACT AAATTGATTC AAAACTCTTT CGAGACGCCG
TCGAAATTCA GTGTATTGGA TCGTCTCATG AACTACACTG TATTGGCAAT TCTTTGTATT
ATGGTCGGAT GCATTGCCTA CTTGGCCACG CAGGCGGTCC GTTCCAGCAA CGACCAATTC
GACAATTTGT GGTATGCCGG ATTCAACAAG GATGAGTCAG AGCCATGGCC TTACTTGCCG
AATCTTCCTC CCCCGGCGTG GGAGACTTCC TCGAATAACT GGCTTCAATT TTTCTTTCTG
TATGTGACGT TGTTGAACAA CTTTATTCCG TTGAGTCTAT ACGTTACGGT TGAGTTTATT
ACTTTTTGCA TGCTTTGGTT CATCTACGCC GATCGCGAAA TGTACGACGA CACTACTAAT
ACACGTGCTG TTGCTCGGAG CACTATTGTT ACCGATCTTG GTCGGGTTCA GTACATTTTC
TCCGATAAGA CCGGTACTTT GACTCAAAAC GTAATGCGCT TCAAACGCTG CTCAGTGGAC
GGAATGGCAT TCGGGGCGCC CATCCAGAAA GCGGCTCCCG GGGCCCAAAA CGATGCCGAC
GACGCTCCTT TCCTGCCTCT ACGTCAGCTT CTTGTGGGTC AATTCAAGTC TCAAAGGACA
GCTGGTCTGG AGGGACTGGG AGGCTCCACT TCTTCGGATG CGGCTCCTGT TGACAAGAAA
TTAACCTTCA ATGCCGAAAT GTTTCTCCGA GTACTAAGTC TGTGCCACAC TGTTGTTGTC
GAAAAAGATT TGGACAAGAA GGAAAACATC AGCAGCGGAG CCTCGGCCAT TTCTAGCGCA
TCCAATCAGA AAAGCCTCCA ACTTCGCGCG AGCAACATGG CCAAGAGCTT GTTTAGTCGG
AAGCGGAGTG ACACAAATGG GTCGGGCGCC GACTCTCCCT TGACGTTGGT CACTGATCCC
GATGGTCTTG GTCCTGGTAG CGTTCATGGG CTTCGAAACA GAGCCCGGAC CGTCAGTGTA
AATTCTATAC CCGAGGACAG AAACGGACGA GGGGACGAAC GGGAAAAAGG CCCGGATGGT
GCACCGTACG GCTATGCCTA TCAAGCAGAG TCTCCAGACG AAGGGGCCCT CGTTTCTGCA
GCAAGTACAA CGTATGGATT TCAGGTCATT GGTCGCGATG CGGCCGGTAT TCGGTTGCGT
GTTAGTGCTC CGTCTCATTT GGAAGAAACC AAAGTGACCG AAAGTTTGAA AAGCGGAAAA
ACTAGTCTCA AACGGCTTGC TGCGCAGTCC GCCAGCGAGC TCGACTACGA TGCGGTGACA
CCAGCCAAAG GGCTTTCGAA CAGCGTGCAT GAATACTTTG AACAAGAAGA AAAGGAGGAA
ACATGGACCA TTCTAGCCGT CAACAAGTTC GATTCAGACC GAAAACGCAT GTCGATTCTA
TTGCGCTCAC CGCCTGAGCT CGGCTCCCTT CCTATTCTGT TTTGCAAAGG AGCAGACTCG
GCAATGCTCG ACCCTGCAAT TGTTTCCAAT GTTGCAATGA TTACCGAAGC GGATAACGAG
AGTGGCCTCC ATTTTCCAAA GCCATTGTCT GTTCAAGATC GGCAGCGCGA CGTGTCAGCT
CTTTCTGCCG TCGACGAAGG AGAAGACGAA AACAAAGACG GCGTTGACAC TGAAGGCTGG
GAGATGGCCA ATATGTTGGG GCTACAAGCG CACCTTGGTG ATTTTGCTTC GGAAGGGCTG
AGAACTCTAG TACTGGGCAT GAGGGTTCTG ACGGAGGCGG AGTGCGAAGA GTGGCTCATA
GTTTACAAAG AAGCCGCTGT CGCACTTAAA GATCGATCGG AACTTCTCAC AAAAGCTGCA
CTCCAGATTG AGCGAAATAT TCATATTGTT GGTGCAACAG CAATTGAAGA CAAGCTTCAG
AAGGGTGTTC CAAAGACAAT TGCAACTCTT GGAGAAGCAG GCATCAAACT TTGGGTCTTG
ACCGGGGATA AGCGTGAGAC TGCAGTCGAA ATCGGATATT CTACCCATGT TCTTACTCCA
AGAATGCATT TGACTCAAGT TCCTGACAAT GGCAAATATC ATGTTCGGAC TCAGATGTCA
ATGGAGTTTA TACGGTTGGT CAAGATGGGT AAACTCCCTG ACTACCAGCA TTCGCAGCTC
AATGAAAGCG GGCCTACAAC TTTTATTCAT CGCTGGGAAA GCTTCCTGTT CCGTTTTCGT
CGTTGTTGGC GAAGCGTGAA GAGATTTTCT TGTCGTTCAT GGGGAATTTT GTACGGAATT
TTGGGCTTGG CAAAAGCCGC CGAAAAAAAA CGAGATGAAG TTAAGGATTC CGAGAAAGCT
GAAAAGCTTG TGCTGAGAAC TAGGGACCGT CGTCGCAGAG TCAGAAGACG TGCCGATGAA
AATATCAAGT GGTGGATGCA GTCCGACGAG GGGAGGGCGC AAAAAAAGAC ACGTGAGAAT
GTCAAGGACG ATACAGATGA CGATTTATCT TTGGCTTCAG AGGAAACGCC AATGGTATTC
AATAGAGCGA GCTCAGCAAG AGGGCTTCTG AATGATCTCC GCAGTTCGGG GAGACTATCG
CAGGCAGATA TTCGTCAGTT GTCCTTGGCA CACTTAACAG CACAGCAAGC GAGCAATGAA
GAAGAACCTT TGGTGGACGA GGATACGCTG TCACTAGACA GCTTTTTTCC AGGAAACGCC
ACCGATTTGA AAGGGGATTT TGATACAAAG AAACGCACTT TTCTGGAGAG GATGTTTGCG
ATTGATCGAC AGGTTCGGAA AGGACATTTG AAGAAGCACA TGCATTGGCA GCGACTTGCC
GCCATCGAGG AAGATGGGGG ATCAAGAAGA GCAGTTACGG TAGTGCCACC GAAAGCTAGT
GACGGGCCTA GAGCGTTAGT AATTGAAGGT GCTGCTTTGA AGCATTTGTT GGGTGATCCG
GAAATGGAAG AGATTTTGTT TTCCGTTGCA AGCAGCTGTG ATGCTGTCAT TGCCTGCCGT
GTAAGCCCGC AACAAAAGGC TCTTTTAGTG AAGCTGGTAC GATATAACGT AAAGCCTGAA
CCAATCTCGT TGGCAATTGG CGATGGAGCC AACGACGTCG GAATGATTCA AGAAGCTCAC
GTTGGAATTG GCATCTCGGG GAAGGAAGGC CAACAAGCTG TAAATGCTTC CGATTTTGCG
ATAGCTCAGT TCAGGTTTCT GGAGGAACTC GTTTTGATAC ACGGCCGGTG GAATTTTTTT
CGTCTTAGTA CGGTGGTTTT GTACTCATTT TACAAGAACG CTTTAATGGC TGGAATTCTT
ATCGTGTTTG CATCTCGGAC CGTCTACAGT GGCACGCCCT TGTTCGACGA GTGGTTGATT
GCGATGCTTA ACTTTGTAGC CGCGGCTCCA ATTGTGGCCC TCGGGCTTTT TGATCGCTGC
TTAAGCAAAG AATACGTTCG AAGCCATCCC GAAGTCTACA AGGCAACACG AGAGAACGAG
TTGATCACCT TTCGGACCTT GTTGCGATGG ATTGCTTTGA CATTTGTCCA TATTTTCACG
CTCTTCTTTT T
 
Protein sequence
MARVKVVAPD RILVPGKSNL DRCDNKVVSA RYTALTFFPV AILEQFRRFA NLYFLIVGCI 
MALGEYTDAF ETAISPWTTL GPLAFVISIS LLVEGSADYK RHKNDGDTNN APSEVKIAFQ
KVRRMDIRQG QFVLLKNRNM VPADIVLLAS SNDHGGAYIE TSSIDGETNL KLRNSPRLPP
SRGSAFDADS SPPSTGFDDG KYIAALVTEP PNPSVHTFSG KLTLPPFKTG ESPIDIPLGA
DNVLLRGAVL RNTEWAIGVA FFTGTDTKLI QNSFETPSKF SVLDRLMNYT VLAILCIMVG
CIAYLATQAD ESEPWPYLPN LPPPAWETSS NNWLQFFFLY VTLLNNFIPL SLYVTVEFIT
FCMLWFIYAD REMYDDTTNT RAVARSTIVT DLGRVQYIFS DKTGTLTQNV MRFKRCSVDG
MAFGAPIQKA APGAQNDADD APFLPLRQLL VGQFKSQRTA GLEGLGGSTS SDAAPVDKKL
TFNAEMFLRV LSLCHTVVDR NGRGDEREKG PDGAPYGYAY QAESPDEGAL VSAASTTYGF
QVIGRDAAGI RLRVKEKEET WTILAVNKFD SDRKRMSILL RSPPELGSLP ILFCKGADSA
MLDPAIVSNM ANMLGLQAHL GDFASEGLRT LVLGMRVLTE AECEEWLIVY KEAAVALKDR
SELLTKAALQ IERNIHIVGA TAIEDKLQKG VPKTIATLGE AGIKLWVLTG DKRETAVEIG
YSTHVLTPRM HLTQVPDNGK YHVRTQMMFA IDRQVRKGHL KKHMHWQRLA AIEEDGGSRR
AVTVVPPKAS DGPRALVIEG AALKHLLGDP EMEEILFSVA SSCDAVIACR VSPQQKALLV
KLVRYNVKPE PISLAIGDGA NDVGMIQEAH VGIGISGKEG QQAVNASDFA IAQFRFLEEL
VLIHGRWNFF RLSTVVLYSF YKNALMAGIL IVFASRTVYS GTPLFDEWLI AMLNFVAAAP
IVALGLFDRC LSKEYVRSHP EVYKATRENE LITFRTLLRW IALTFVHIFT LFF