Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_52368 |
Symbol | ATPase-P4 |
ID | 7203524 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 516801 |
End bp | 521911 |
Gene Length | 5111 bp |
Protein Length | 1013 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | P4, P type ATPase |
Protein accession | XP_002182702 |
Protein GI | 219124839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGCG TAAAGGTCGT GGCGCCGGAC CGCATTTTGG TCCCGGGGAA ATCCAACCTC GATCGATGTG ACAACAAGGT CGTTTCGGCG CGATACACTG CGCTTACCTT TTTTCCGGTG GTACGTGTGT CCCTGCGTAG TATCCGAGCA GTTTTGCTGC TGTTGGTTTT TAGGACTCTG TTGCGCCAAG GTTGCCGTTT ACTTGGTCTT TTGCGTAGTA TGGAATTGAA AAAGACGCGG CAACGTATCG GCAGTAATGT CTTTTTTGGC TCTCGTTGTT CTGCACACTA TAAAAAGGAT TCCGTCGACT CGGTGGAATG CGGGAGTGTT CCTCGTTTGT CTCATTTCGT TAGCCTGTAC ATTCTCATTC ATACTTCCGG TCCGTTTTTC CTATAGGCAA TTTTGGAGCA GTTTCGTCGG TTCGCCAATC TTTACTTTTT GATTGTGGGC TGCATCATGG CGTTGGGCGA GTACACGGAC GCCTTTGAGA CGGCCATTTC CCCCTGGACG ACGCTGGGTC CGCTCGCCTT TGTCATTAGC ATTTCTCTCC TTGTCGAAGG CTCGGCCGAT TACAAGCGTC ACAAGAACGA CGGAGACACC AACAATGCCC CCTGTACGGT TATTCGACGC GCGGATGAAC TCGCCCTGGA AGAAGACGTG GAACGCGACA CCAACGTCAT GAAAGGCAAG GATGTGTTGG TGAATCTCAA CAAGGCGTTT TATGCCGAAG CTAGTGTGCC GACACCCCGG GTATTGGACC ACACCGAATC CAACGCCTCG TCGGGGCCGG GAACACGCAC AGCCGAGGTC AAGATCGCTT TTCAGAAGGT GCGACGCATG GATATTCGAC AAGGACAGTT TGTTCTCCTC AAGAATCGCA ACATGGTTCC CGCCGATATT GTTCTTTTGG CAAGCTCTAA CGATCACGGG GGTGCCTACA TTGAGACTTC CTCAATCGAC GGTGAAACCA ACCTCAAATT GCGTAATTCT CCTCGTTTAC CACCTTCCGT TCTTCAACAT TTGAAGGAAG GGACGACAAT GGATAAGATT GAAGAGTCCG ACGATGAAGG GGAATCTAAA AATCAAGTGA CGGAGGAAAT GGCCGACGTG CACGACTTTG AAACCATCAG CGAAGCGACC AAGCGGATTA CACGCTTTTC CTCGTTGGGG CGACCCGGAG GTCGATGTGT TCTCGACCAT CCCGAATGCG CCGTCTCGAC CGAAACAGCG GGAAATCCGA CAGCGGAACC GGAGAAAACC GTGGGAGAAC GCTTCCATCG GTTCTCCGGT AAATTCATGA GGGGCATTCG GGGTGGGATT GATGCCGTGA AGCGGGGTTC GGCGTTCGAT GCCGACAGCA GCCCTCCTAG TACTGGCTTT GATGACGGCA AGTATATTGC GGCACTCGTC ACGGAGCCAC CAAATCCAAG TGTGCACACT TTCAGCGGAA AGCTTACCCT GCCGCCATTC AAGACCGGAG AATCTCCTAT CGACATTCCG CTGGGTGCAG ACAATGTCTT ACTTCGTGGG GCTGTTCTAC GAAATACGGA ATGGGCTATT GGTGTAGCCT TCTTCACGGG AACCGACACT AAATTGATTC AAAACTCTTT CGAGACGCCG TCGAAATTCA GTGTATTGGA TCGTCTCATG AACTACACTG TATTGGCAAT TCTTTGTATT ATGGTCGGAT GCATTGCCTA CTTGGCCACG CAGGCGGTCC GTTCCAGCAA CGACCAATTC GACAATTTGT GGTATGCCGG ATTCAACAAG GATGAGTCAG AGCCATGGCC TTACTTGCCG AATCTTCCTC CCCCGGCGTG GGAGACTTCC TCGAATAACT GGCTTCAATT TTTCTTTCTG TATGTGACGT TGTTGAACAA CTTTATTCCG TTGAGTCTAT ACGTTACGGT TGAGTTTATT ACTTTTTGCA TGCTTTGGTT CATCTACGCC GATCGCGAAA TGTACGACGA CACTACTAAT ACACGTGCTG TTGCTCGGAG CACTATTGTT ACCGATCTTG GTCGGGTTCA GTACATTTTC TCCGATAAGA CCGGTACTTT GACTCAAAAC GTAATGCGCT TCAAACGCTG CTCAGTGGAC GGAATGGCAT TCGGGGCGCC CATCCAGAAA GCGGCTCCCG GGGCCCAAAA CGATGCCGAC GACGCTCCTT TCCTGCCTCT ACGTCAGCTT CTTGTGGGTC AATTCAAGTC TCAAAGGACA GCTGGTCTGG AGGGACTGGG AGGCTCCACT TCTTCGGATG CGGCTCCTGT TGACAAGAAA TTAACCTTCA ATGCCGAAAT GTTTCTCCGA GTACTAAGTC TGTGCCACAC TGTTGTTGTC GAAAAAGATT TGGACAAGAA GGAAAACATC AGCAGCGGAG CCTCGGCCAT TTCTAGCGCA TCCAATCAGA AAAGCCTCCA ACTTCGCGCG AGCAACATGG CCAAGAGCTT GTTTAGTCGG AAGCGGAGTG ACACAAATGG GTCGGGCGCC GACTCTCCCT TGACGTTGGT CACTGATCCC GATGGTCTTG GTCCTGGTAG CGTTCATGGG CTTCGAAACA GAGCCCGGAC CGTCAGTGTA AATTCTATAC CCGAGGACAG AAACGGACGA GGGGACGAAC GGGAAAAAGG CCCGGATGGT GCACCGTACG GCTATGCCTA TCAAGCAGAG TCTCCAGACG AAGGGGCCCT CGTTTCTGCA GCAAGTACAA CGTATGGATT TCAGGTCATT GGTCGCGATG CGGCCGGTAT TCGGTTGCGT GTTAGTGCTC CGTCTCATTT GGAAGAAACC AAAGTGACCG AAAGTTTGAA AAGCGGAAAA ACTAGTCTCA AACGGCTTGC TGCGCAGTCC GCCAGCGAGC TCGACTACGA TGCGGTGACA CCAGCCAAAG GGCTTTCGAA CAGCGTGCAT GAATACTTTG AACAAGAAGA AAAGGAGGAA ACATGGACCA TTCTAGCCGT CAACAAGTTC GATTCAGACC GAAAACGCAT GTCGATTCTA TTGCGCTCAC CGCCTGAGCT CGGCTCCCTT CCTATTCTGT TTTGCAAAGG AGCAGACTCG GCAATGCTCG ACCCTGCAAT TGTTTCCAAT GTTGCAATGA TTACCGAAGC GGATAACGAG AGTGGCCTCC ATTTTCCAAA GCCATTGTCT GTTCAAGATC GGCAGCGCGA CGTGTCAGCT CTTTCTGCCG TCGACGAAGG AGAAGACGAA AACAAAGACG GCGTTGACAC TGAAGGCTGG GAGATGGCCA ATATGTTGGG GCTACAAGCG CACCTTGGTG ATTTTGCTTC GGAAGGGCTG AGAACTCTAG TACTGGGCAT GAGGGTTCTG ACGGAGGCGG AGTGCGAAGA GTGGCTCATA GTTTACAAAG AAGCCGCTGT CGCACTTAAA GATCGATCGG AACTTCTCAC AAAAGCTGCA CTCCAGATTG AGCGAAATAT TCATATTGTT GGTGCAACAG CAATTGAAGA CAAGCTTCAG AAGGGTGTTC CAAAGACAAT TGCAACTCTT GGAGAAGCAG GCATCAAACT TTGGGTCTTG ACCGGGGATA AGCGTGAGAC TGCAGTCGAA ATCGGATATT CTACCCATGT TCTTACTCCA AGAATGCATT TGACTCAAGT TCCTGACAAT GGCAAATATC ATGTTCGGAC TCAGATGTCA ATGGAGTTTA TACGGTTGGT CAAGATGGGT AAACTCCCTG ACTACCAGCA TTCGCAGCTC AATGAAAGCG GGCCTACAAC TTTTATTCAT CGCTGGGAAA GCTTCCTGTT CCGTTTTCGT CGTTGTTGGC GAAGCGTGAA GAGATTTTCT TGTCGTTCAT GGGGAATTTT GTACGGAATT TTGGGCTTGG CAAAAGCCGC CGAAAAAAAA CGAGATGAAG TTAAGGATTC CGAGAAAGCT GAAAAGCTTG TGCTGAGAAC TAGGGACCGT CGTCGCAGAG TCAGAAGACG TGCCGATGAA AATATCAAGT GGTGGATGCA GTCCGACGAG GGGAGGGCGC AAAAAAAGAC ACGTGAGAAT GTCAAGGACG ATACAGATGA CGATTTATCT TTGGCTTCAG AGGAAACGCC AATGGTATTC AATAGAGCGA GCTCAGCAAG AGGGCTTCTG AATGATCTCC GCAGTTCGGG GAGACTATCG CAGGCAGATA TTCGTCAGTT GTCCTTGGCA CACTTAACAG CACAGCAAGC GAGCAATGAA GAAGAACCTT TGGTGGACGA GGATACGCTG TCACTAGACA GCTTTTTTCC AGGAAACGCC ACCGATTTGA AAGGGGATTT TGATACAAAG AAACGCACTT TTCTGGAGAG GATGTTTGCG ATTGATCGAC AGGTTCGGAA AGGACATTTG AAGAAGCACA TGCATTGGCA GCGACTTGCC GCCATCGAGG AAGATGGGGG ATCAAGAAGA GCAGTTACGG TAGTGCCACC GAAAGCTAGT GACGGGCCTA GAGCGTTAGT AATTGAAGGT GCTGCTTTGA AGCATTTGTT GGGTGATCCG GAAATGGAAG AGATTTTGTT TTCCGTTGCA AGCAGCTGTG ATGCTGTCAT TGCCTGCCGT GTAAGCCCGC AACAAAAGGC TCTTTTAGTG AAGCTGGTAC GATATAACGT AAAGCCTGAA CCAATCTCGT TGGCAATTGG CGATGGAGCC AACGACGTCG GAATGATTCA AGAAGCTCAC GTTGGAATTG GCATCTCGGG GAAGGAAGGC CAACAAGCTG TAAATGCTTC CGATTTTGCG ATAGCTCAGT TCAGGTTTCT GGAGGAACTC GTTTTGATAC ACGGCCGGTG GAATTTTTTT CGTCTTAGTA CGGTGGTTTT GTACTCATTT TACAAGAACG CTTTAATGGC TGGAATTCTT ATCGTGTTTG CATCTCGGAC CGTCTACAGT GGCACGCCCT TGTTCGACGA GTGGTTGATT GCGATGCTTA ACTTTGTAGC CGCGGCTCCA ATTGTGGCCC TCGGGCTTTT TGATCGCTGC TTAAGCAAAG AATACGTTCG AAGCCATCCC GAAGTCTACA AGGCAACACG AGAGAACGAG TTGATCACCT TTCGGACCTT GTTGCGATGG ATTGCTTTGA CATTTGTCCA TATTTTCACG CTCTTCTTTT T
|
Protein sequence | MARVKVVAPD RILVPGKSNL DRCDNKVVSA RYTALTFFPV AILEQFRRFA NLYFLIVGCI MALGEYTDAF ETAISPWTTL GPLAFVISIS LLVEGSADYK RHKNDGDTNN APSEVKIAFQ KVRRMDIRQG QFVLLKNRNM VPADIVLLAS SNDHGGAYIE TSSIDGETNL KLRNSPRLPP SRGSAFDADS SPPSTGFDDG KYIAALVTEP PNPSVHTFSG KLTLPPFKTG ESPIDIPLGA DNVLLRGAVL RNTEWAIGVA FFTGTDTKLI QNSFETPSKF SVLDRLMNYT VLAILCIMVG CIAYLATQAD ESEPWPYLPN LPPPAWETSS NNWLQFFFLY VTLLNNFIPL SLYVTVEFIT FCMLWFIYAD REMYDDTTNT RAVARSTIVT DLGRVQYIFS DKTGTLTQNV MRFKRCSVDG MAFGAPIQKA APGAQNDADD APFLPLRQLL VGQFKSQRTA GLEGLGGSTS SDAAPVDKKL TFNAEMFLRV LSLCHTVVDR NGRGDEREKG PDGAPYGYAY QAESPDEGAL VSAASTTYGF QVIGRDAAGI RLRVKEKEET WTILAVNKFD SDRKRMSILL RSPPELGSLP ILFCKGADSA MLDPAIVSNM ANMLGLQAHL GDFASEGLRT LVLGMRVLTE AECEEWLIVY KEAAVALKDR SELLTKAALQ IERNIHIVGA TAIEDKLQKG VPKTIATLGE AGIKLWVLTG DKRETAVEIG YSTHVLTPRM HLTQVPDNGK YHVRTQMMFA IDRQVRKGHL KKHMHWQRLA AIEEDGGSRR AVTVVPPKAS DGPRALVIEG AALKHLLGDP EMEEILFSVA SSCDAVIACR VSPQQKALLV KLVRYNVKPE PISLAIGDGA NDVGMIQEAH VGIGISGKEG QQAVNASDFA IAQFRFLEEL VLIHGRWNFF RLSTVVLYSF YKNALMAGIL IVFASRTVYS GTPLFDEWLI AMLNFVAAAP IVALGLFDRC LSKEYVRSHP EVYKATRENE LITFRTLLRW IALTFVHIFT LFF
|
| |