Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55200 |
Symbol | ATPase2-3A |
ID | 7199258 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 90455 |
End bp | 93721 |
Gene Length | 3267 bp |
Protein Length | 969 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | P3A, P type ATPase |
Protein accession | XP_002185425 |
Protein GI | 219130548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0530284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGATC TGGAAGAAGG AGGCCGTCTC ACCGAAACCG CCGCCATTGG GGGCGGTACC CACCCCACGA CGAGCACGAT CCCCCAACAG CCCACGGAAG GTGAATATGA CTGGGTACCG ACGGATCCCA CATCCGGATT AACCTCGGAA CAAGTCGCCC AAGCCTTGGC TCGCTACGGT CCCAACGAAA TACCCGTCCC CGACACACCC CTCTATCTAC TCTTCGTCCG ACAATTCGTC GGATTCTTGC CCTTTCTCAT TGAACTCGCC GCCATCGTCT CCCTCGCCGT CCAAGATTAC ATCGATTTCG GGATCATTCT AGGCATTCTA CTCGTCAACG GCTGTCTCGG ATTCCGGGAA GAGTACCACG CCAAGAAGAG TCTACAAGCC GTCAGCGCCA GTTTAGATTC CGAAATCGCC GTCCGCCGGG ACGGCCTCAC AGCCAGCCTT CTCGTCAAGC AGCTCGTCCC GGGTGATATT GTCTTTTTAG TGGGCGGGAC CATTGTCCCC GCCGACGTTC TCTGGATTTC CGGTGACGTT GTCCAGCTCG ATACCGCCGC ACTCACCGGC GAACCTTTGC CCCGCAAGTA TCCCAGTGCC GAACACGGAC GGACACTCTT GTCCGGAACC ACCGTCACGG CCGGTGAATG CTACGGACAA GTCCTCCGTA TCGGTACGGC CACCGAAATT GGACAAGCGC AGGTAGACAT TCTACAGGAC AAGTCCGTGC GGATCGTTTC CGTCTTTCAG CAAAAAATTA TGAAGGTCGT TCAGATGCTC ATTGCCGGTT CACTCATTGT CGTACTCGCC GTCCTACTCG TCAAGGGAAT CGTCTACGAT GGCTTTGACG ACAACGTCAA GGAAACGATT CTGGACGCCT TGTCCATCCT CATTGCGTCC ATCCCCGTCG CCCTGCCCCT GGTCGTACAG GTAAACCTGG CTCTGGGAGC CTCCTTCCTC GCCAAGGAAC ATCACGCCAT TGTCACGTCC ATTCCCGCTC TACAAGATAT TGCCTCCATG TCCATGTTGT GCAGGTACGT TGGCGTGTGT GTATGTGTGT ATGCAAGTGT ATATATGTAA TAACAGCACG TTCAATACTC ACGTTTCGGT TTATTATTTG CTGGGAATAC TCCAGTGACA AAACAGGGAC ACTCACCACG GCCAACATGT CCGTGATTCC GGAACAAGTC TTTGCGGCCG AGGGCTTTAC CACGGAGCAA GTCCTGTTGT ACGCCTACCT GTGTTCCAAT CCCGACAAAA AAGACGATCC GATTGATCGA GCCGTGGTTG CCGCCTTTTT ACAGTCGGCC AAAGCCAACG AAAAAGACGA TTACGTGCAA ACCGAAATCA TCGGGTTCAA CCCCACCGTG AAACGCGTGG TGGCCTTTGT CGGACACGGC AACGAAACCA TCACGATTGC CAAGGGCTTG CCGGCCAAAA TTGTCAATAC CCAAGCTGGA GGCGAGGACG ATCACGAACT TCAATGGCAG GTCAACCGGG CGGCCGATCG GGACTTTCTC GATCGCGTGG GAAACGTCGA TACGGGTTTG AGCAAGGCGG GGTACAAGAC GATCGGGATT GGCGTCTGTT TCGGTAACGC TCGGACAATG AAGAATCCGG TCTGGAAATT TGCCGGACTC GTACCAATGT TGGATCCTCC ACGGGAAGAT ACCCGGGCGA CAATTGAATC GCTCCATCAC GCCAACATTT CGATCAAAAT GATCACGGGC GATCATCAGA ATGGTATGTA TCTCTTTCGA AAGGACCGGT ATGCTAGTAG TTCCCCTTCG GGCTTTCTCT TTTGACACCG CTGCATTTTG ACTACAGTGG GAAAAGAGAC CGCTCGTTTG ATCGGTCTGG GAACGGACAT TCGAACCGGA GAAGAGATTC GTCATGCATC TAGTCAGGAT AAGAAACGAC TTGTATGGGA GGCGGACGGC TTCGCGGCAG TTCTACCGAG TGACAAACGT GAGGTTGTCA TGATTCTCCG CAACGAATAC GGGATTGTGA CAGGCATGAC TGGGGATGTA AGTGATGATG TTGGTGTTGC CAGGATGTGG TCGTTTCATC TTACTCATTT GAAACCAATT TTGGTCACCG TATCGTCTAG GGTGTTAACG ATGCTCCAGC CCTCTCGGCC GCTCAGGTGG GAATTGCCGT TGAAGGAGCC ACCGACGCCG CCAAGAATGC GGCAGACCTC ATTCTTACCG AGCCCGGTCT CAGTCCGATT TATGGTGCGG TTCTGGAGTC CCGCCGTATC TTTCTACGCA TTAAAGGATA CGTGATTTAT CGCGTAGCAG CGTCGATTAT CATGGTCCTG ACTCTCTCCA TCATCATATT TGCGTCGGGC TGCGCAGTTG ACTCATTGTT GGTCATCATA TTGGCATTGT TGAATGATAT TTCTATGATT CCTGTGGCAT ACGACAATGC GTCCGCCACA ACGAAACCAC AGCTTCCTCG AGCCAGCAAG TTGGTGCTAA TGTCTCTCTA CTACGGCATT TGCCAGACTG CGTTGGGTCT TTCTTTTATC TTCATCATGG ATCATGCTAA AGATTTGGAC GGGCCAATTG CACTCAATAG AGCGTGTTCG TCGGAAACGC GAGGTTTTAT TTGGTTTCAT TTGACCCTGG TTACGGAACT TATGATTTTC TCGGTGCGTG CCCCCGGATC CATGCTGTAC TCGACACCTT CCATATTTTT GATTATTTCA GTGTTGGGCA CTTGTGCCGG TAGTGCATTC ATTGCAATGT ACGGGAGTGA GTTGTCGGGG TTAAATGTGG TTTGGATTCT TCTTTTCAAC CTCGGAACAT TGGTATTGGT TGATTTCGGC AAAATCATGT TTCGTGCACT TATTGGTGAG GAGCCCGGCG ACATCATTGC AAGTGATGAG CTTTTGTCCG TTTCTCCCAT CAAGACAGAA ACGGCGAAGA ACTTGGAAAA GAAACTGCGA TATGTTGTAC ACAACGAATC GCTGTTAGAA CCAGAGGATC GTCAGCACAT GGTCCAAGTG CGACGTCGGC GCCGTATGAT GTCGGAAGGG TTTTTCTCGG GAGACGGGGG CGGCTGGAAT CAAGGCTTTG TTGATCGGCG TCGGGTTAAC AGCATCTTGG CGCAGCAGGG ATCCGCTTTT TTTCAGAATC GAGGGGATCG CTCGAAGCAG ACTACCATGC CTTGGTAGGA ATTACAAGTC TACGACAGAG AGTGAAGAAG CAATCGTCAA CAAAAAGTGA TTTCTACCGT AATTGTTAAA GGAAAGTCGT TTGCGTT
|
Protein sequence | MPDLEEGGRL TETAAIGGGT HPTTSTIPQQ PTEGEYDWVP TDPTSGLTSE QVAQALARYG PNEIPVPDTP LYLLFVRQFV GFLPFLIELA AIVSLAVQDY IDFGIILGIL LVNGCLGFRE EYHAKKSLQA VSASLDSEIA VRRDGLTASL LVKQLVPGDI VFLVGGTIVP ADVLWISGDV VQLDTAALTG EPLPRKYPSA EHGRTLLSGT TVTAGECYGQ VLRIGTATEI GQAQVDILQD KSVRIVSVFQ QKIMKVVQML IAGSLIVVLA VLLVKGIVYD GFDDNVKETI LDALSILIAS IPVALPLVVQ VNLALGASFL AKEHHAIVTS IPALQDIASM SMLCSDKTGT LTTANMSVIP EQVFAAEGFT TEQVLLYAYL CSNPDKKDDP IDRAVVAAFL QSAKANEKDD YVQTEIIGFN PTVKRVVAFV GHGNETITIA KGLPAKIVNT QAGGEDDHEL QWQVNRAADR DFLDRVGNVD TGLSKAGYKT IGIGVCFGNA RTMKNPVWKF AGLVPMLDPP REDTRATIES LHHANISIKM ITGDHQNVGK ETARLIGLGT DIRTGEEIRH ASSQDKKRLV WEADGFAAVL PSDKREVVMI LRNEYGIVTG MTGDGVNDAP ALSAAQVGIA VEGATDAAKN AADLILTEPG LSPIYGAVLE SRRIFLRIKG YVIYRVAASI IMVLTLSIII FASGCAVDSL LVIILALLND ISMIPVAYDN ASATTKPQLP RASKLVLMSL YYGICQTALG LSFIFIMDHA KDLDGPIALN RACSSETRGF IWFHLTLVTE LMIFSVRAPG SMLYSTPSIF LIISVLGTCA GSAFIAMYGS ELSGLNVVWI LLFNLGTLVL VDFGKIMFRA LIGEEPGDII ASDELLSVSP IKTETAKNLE KKLRYVVHNE SLLEPEDRQH MVQVRRRRRM MSEGFFSGDG GGWNQGFVDR RRVNSILAQQ GSAFFQNRGD RSKQTTMPW
|
| |