Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44474 |
Symbol | ATPase1-2B |
ID | 7197757 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 678687 |
End bp | 682138 |
Gene Length | 3452 bp |
Protein Length | 1089 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | P2B, P type ATPase |
Protein accession | XP_002178277 |
Protein GI | 219114963 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGA AAACAGCGAG TCATAGTTCC ACCGCGAATC CACACGGCAA GTCACGGGTC GCCGAGACGA TCCGCATGTT GGAAGCCGAA TCTTCCGATC TACGGGGTTT ACTGCAGTCT ATTAACGAAG CTCAATCCTT GGAATCGAAC CGTCACAACC TCGAGACTGT CCTCGAAGGC CCTCAGGGTC TGGCACGTCG CTTGGGAACG GACCCCAAAG CCGGTCTCGA CCGAGAAACC ATCGAAACCC GCCGAGCCTG TTTCGGAGCG AATCGCCTGC CCTCCGCCCC ACGCAAGACC TTTGGACAGC TCTTTCTCGA CACCTTCGAC GACGCTACGC TGCAGATTCT TATCGTGGCC GCACTCGTGT CGCTCGCCGT TGGCTTGTAC GACGACCCCG CCACGGGCTA CGTCGAAGGC TGCGCCATTC TCGCCGCCGT CCTCGTCGTA TCCTTCGTGA CGGCCGTCAA CGACTTTCAA AAGGAATCGC AGTTTCGCGA ATTGTCCGCC GCCAACGATG CGGTAGACGT TCTTGTAGTC CGGAACAACG TACACTGGCA AATACCGGTG GATGAATTAG TCGTGGGGGA CGTTGTCTGC GTCGAAGCCG GCGACCAAAT ACCCTGCGAC GGAGTTCTCC TTGTCGCCGA TGATGTCCAA GTGGACGAAT CTGCCTTGAC GGGAGAACCG ACTGACGTGG ATAAAAGTTT ACAAAATGAC CCTTTCGTCC TCTCCGGATG CACCATGGAA GCCGGGACGG CCAGATTCCT GGCCATTGCC GTCGGCAAAG ACTCGCAGTG GGGTATCATC AAAGCGCATT TAGACAAAGA ACACTCGCAG ACGCCGCTAC AGGAGAAGCT GGACGATATG GCGGCCATGA TCGGATACAT TGGTATGGCG GCCGCCGCCG CTACTTTTTT AGCCATGATG TTCATCAAGG TCGTTCTCAA ACCGTCCTAC CTGGCTCATA TTTCCGTCTT TAATTACGCG CTGGAAGCCT TTATTATCGG CGTTACCATT GTGGTAGTGG CCGTCCCGGA AGGTTTGCCT CTGGCGGTGA CCATTTCCCT TGCCTTTTCT ACCAAGAAAA TGTTGGCCGA CAAGAACCTC ATTCGTCACT TGTCTGCTTG TGAGACCATG GGCAACGCCA CAAACATCTG CTCCGACAAG ACGGGAACCT TGACCGAAAA TCGTATGACG GTGGTGAAAG GAATCTTTGC CGACACGAGG TGTGACGACA CCATCAATCG GGTCCCGGTC TTGATCAACA AAAAGGCACT CGAGGTTATT TTGGAAGGAA TTGCCTGTTG TTCTACCGCC AAAGTGATAC CAGCGCAAGC GGCCGTTGCC AACGAACACG GTATTGACGA TCTGCATCTC GTCGACGATC GCCCCCACAT TATTGGCAAC AAGACCGAAG CCGCCTTGCT TATCCTGGCG CGTTCATCCT GGACCCCACA CGACGATACC GACCAACGTC GCGTCGACGC CAACTTTGGA GCGGAAGGTG GCTCTCGTCT CTTTCCTTTT TCGTCGTCCC GGAAGTGCAT GACGGTGTTC GTTACCAAGG ACGAAGCTGC AGTCTCGGAT ACCAGCATTC GTACTCGTCG TGCGACCAAA AATGTCCAAT CTTATACATT GTACCATAAA GGAGCGGCCG AAATTGTGTT GGATAAATGC ACCAAATACC TGGACATTGA CGGTACCGAA AAGGAAATGT CGGACCAGAA GCGAGAAGAA TTTGCCAAGC TCATTCGCGA ATTTGCTTCG CAGGCACTGC GGTGTGTGGC ACTAGCTCAT CGTCGGGACA TACAGAACGT GGTCGATCCG CAAACGGTCA CACAACAGGA TTGTGAAAAA AAGTTGGAAA AGGAAATGTG CCTCGACGCC ATTGCGGGTA TCATGGACCC CCTGCGGCCG GACGTTGTTG AAGCCGTGGC TATTTGTCAA CGTGCGGGCA TCTTTGTTCG TATGGTGACG GGGGATAACC TAGACACGGC GGAAGCGATT GCTAGACAAG CTGGTATTCT GACGGAAGGT ACGTTGTGTC GGATCGAAGC TTGCACCTAC GTTAAGACCT GTCTTATTTT CGATCAACAT GATCGATCTT AGTTGTCTAA CTTCAGTCAA TCTAACCCTT TGACTTACTC TCGTTGCTTC CTGCAGGCGG TATTTCCATG ATTGGCGAAA AATTCCGCAA GCTAACGCCC GCCCAACTCG ACGAAATTCT TCCTCGTTTG CAAGTGTTGG CCCGCTCAAG TCCGGAAGAC AAGCATACAC TCGTCCAACG TTTGAATGGT GCAGCCATCC CGTCGACCGA ATCAGAATGG TGTGAAGCGC ACCCGAACAA AGACTTTGCG ACCCAACGTA ATTTGTTACT TCCGGGCTAC AAGGACGAAT GGGCCAAGAG CCGATTTGGC GTCGGGGAGG TCGTTGGAGT CACAGGAGAT GGAACGAACG ATGCTCCCGC ACTCAAAGCG GCTGATGTCG GTTTGTCCAT GGGATTAAGC GGAACGGATG TAGCCAAAAA AGCTTCAGAT ATTATTATTA TGGACGACAA CTTTGCTTCC ATCGTCCGCG CCGTACTTTG GGGACGCTCG GTCTTCGACA ACATTCGCAA GTTCCTTCAG TTTCAGTTGA CGGTCAACGT TGTTGCCTTG ACAATTACCT TTTTGGCTGC CGTCGTGGGG TACCAACCTC CCCTCAACGC GGTCATGATG CTGTGGGTCA ATCTGATCAT GGACACCATG GGTGCACTCG CTCTCGGTAC CGAGCCACCT CTCAAGGAGC TTTTGGACCG CCGTCCATAC CGCCGCGATT CCAGCTTAAT TAGCCGTCCA ATGTGGCGCA ACATTTTGTG CCAAGCCGTC TTTCAGCTTT CACTCCTGGT CTTTTTGCTA AACAAGGGGC CCGCCATGTT TGAATGCGAA GACGGTTCCA GACATCACTT TACTATTCTT TTCAACGCTT TTGTGTTTTG CCAGGTGTTC AACGAGTTCA ATGCACGTGA AATTGGAGAT CGCTTTGATC CACTTCGTTC CTTGTCCGAG AGTCCCATGT TTTTACTAGT AATTGTCTTT ACCATGGTGG CACAATGGGC AATCGTGGAG TTCGGAGGTG ACTTCACACA GACGTATCCG TTGAGTTGGG AAGAGTGGAA GATCACCGTT GGTCTCGGAG CGATATCTTT GCCCGTCGGT TTCTTCATGC GATTGATCCC CGTTTCAGAA GATCCCTCTA CGTTCGCCGG TATTGAGCGA AAGAACGGCA AGCCCAAAGA GATCTGGTTG TTGACTCCTC TTCTGATGAT TCTGGTTCCA GTCTTCGTGG CCATTGTCTA TCAATTGGCT TACGAAATTG ACGAGTTTGC CCATGAGACC GAGCATCATT TACCATAGTT GGAGAATATT AGGAATTCAT ACTAAGAAGT TAAACGGAGC TTTGTCACCC TG
|
Protein sequence | MPSKTASHSS TANPHGKSRV AETIRMLEAE SSDLRGLLQS INEAQSLESN RHNLETVLEG PQGLARRLGT DPKAGLDRET IETRRACFGA NRLPSAPRKT FGQLFLDTFD DATLQILIVA ALVSLAVGLY DDPATGYVEG CAILAAVLVV SFVTAVNDFQ KESQFRELSA ANDAVDVLVV RNNVHWQIPV DELVVGDVVC VEAGDQIPCD GVLLVADDVQ VDESALTGEP TDVDKSLQND PFVLSGCTME AGTARFLAIA VGKDSQWGII KAHLDKEHSQ TPLQEKLDDM AAMIGYIGMA AAAATFLAMM FIKVVLKPSY LAHISVFNYA LEAFIIGVTI VVVAVPEGLP LAVTISLAFS TKKMLADKNL IRHLSACETM GNATNICSDK TGTLTENRMT VVKGIFADTR CDDTINRVPV LINKKALEVI LEGIACCSTA KVIPAQAAVA NEHGIDDLHL VDDRPHIIGN KTEAALLILA RSSWTPHDDT DQRRVDANFG AEGGSRLFPF SSSRKCMTVF VTKDEAAVSD TSIRTRRATK NVQSYTLYHK GAAEIVLDKC TKYLDIDGTE KEMSDQKREE FAKLIREFAS QALRCVALAH RRDIQNVVDP QTVTQQDCEK KLEKEMCLDA IAGIMDPLRP DVVEAVAICQ RAGIFVRMVT GDNLDTAEAI ARQAGILTEG GISMIGEKFR KLTPAQLDEI LPRLQVLARS SPEDKHTLVQ RLNGAAIPST ESEWCEAHPN KDFATQRNLL LPGYKDEWAK SRFGVGEVVG VTGDGTNDAP ALKAADVGLS MGLSGTDVAK KASDIIIMDD NFASIVRAVL WGRSVFDNIR KFLQFQLTVN VVALTITFLA AVVGYQPPLN AVMMLWVNLI MDTMGALALG TEPPLKELLD RRPYRRDSSL ISRPMWRNIL CQAVFQLSLL VFLLNKGPAM FECEDGSRHH FTILFNAFVF CQVFNEFNAR EIGDRFDPLR SLSESPMFLL VIVFTMVAQW AIVEFGGDFT QTYPLSWEEW KITVGLGAIS LPVGFFMRLI PVSEDPSTFA GIERKNGKPK EIWLLTPLLM ILVPVFVAIV YQLAYEIDEF AHETEHHLP
|
| |