Gene PHATRDRAFT_44474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44474 
SymbolATPase1-2B 
ID7197757 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp678687 
End bp682138 
Gene Length3452 bp 
Protein Length1089 aa 
Translation table 
GC content52% 
IMG OID 
ProductP2B, P type ATPase 
Protein accessionXP_002178277 
Protein GI219114963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCGA AAACAGCGAG TCATAGTTCC ACCGCGAATC CACACGGCAA GTCACGGGTC 
GCCGAGACGA TCCGCATGTT GGAAGCCGAA TCTTCCGATC TACGGGGTTT ACTGCAGTCT
ATTAACGAAG CTCAATCCTT GGAATCGAAC CGTCACAACC TCGAGACTGT CCTCGAAGGC
CCTCAGGGTC TGGCACGTCG CTTGGGAACG GACCCCAAAG CCGGTCTCGA CCGAGAAACC
ATCGAAACCC GCCGAGCCTG TTTCGGAGCG AATCGCCTGC CCTCCGCCCC ACGCAAGACC
TTTGGACAGC TCTTTCTCGA CACCTTCGAC GACGCTACGC TGCAGATTCT TATCGTGGCC
GCACTCGTGT CGCTCGCCGT TGGCTTGTAC GACGACCCCG CCACGGGCTA CGTCGAAGGC
TGCGCCATTC TCGCCGCCGT CCTCGTCGTA TCCTTCGTGA CGGCCGTCAA CGACTTTCAA
AAGGAATCGC AGTTTCGCGA ATTGTCCGCC GCCAACGATG CGGTAGACGT TCTTGTAGTC
CGGAACAACG TACACTGGCA AATACCGGTG GATGAATTAG TCGTGGGGGA CGTTGTCTGC
GTCGAAGCCG GCGACCAAAT ACCCTGCGAC GGAGTTCTCC TTGTCGCCGA TGATGTCCAA
GTGGACGAAT CTGCCTTGAC GGGAGAACCG ACTGACGTGG ATAAAAGTTT ACAAAATGAC
CCTTTCGTCC TCTCCGGATG CACCATGGAA GCCGGGACGG CCAGATTCCT GGCCATTGCC
GTCGGCAAAG ACTCGCAGTG GGGTATCATC AAAGCGCATT TAGACAAAGA ACACTCGCAG
ACGCCGCTAC AGGAGAAGCT GGACGATATG GCGGCCATGA TCGGATACAT TGGTATGGCG
GCCGCCGCCG CTACTTTTTT AGCCATGATG TTCATCAAGG TCGTTCTCAA ACCGTCCTAC
CTGGCTCATA TTTCCGTCTT TAATTACGCG CTGGAAGCCT TTATTATCGG CGTTACCATT
GTGGTAGTGG CCGTCCCGGA AGGTTTGCCT CTGGCGGTGA CCATTTCCCT TGCCTTTTCT
ACCAAGAAAA TGTTGGCCGA CAAGAACCTC ATTCGTCACT TGTCTGCTTG TGAGACCATG
GGCAACGCCA CAAACATCTG CTCCGACAAG ACGGGAACCT TGACCGAAAA TCGTATGACG
GTGGTGAAAG GAATCTTTGC CGACACGAGG TGTGACGACA CCATCAATCG GGTCCCGGTC
TTGATCAACA AAAAGGCACT CGAGGTTATT TTGGAAGGAA TTGCCTGTTG TTCTACCGCC
AAAGTGATAC CAGCGCAAGC GGCCGTTGCC AACGAACACG GTATTGACGA TCTGCATCTC
GTCGACGATC GCCCCCACAT TATTGGCAAC AAGACCGAAG CCGCCTTGCT TATCCTGGCG
CGTTCATCCT GGACCCCACA CGACGATACC GACCAACGTC GCGTCGACGC CAACTTTGGA
GCGGAAGGTG GCTCTCGTCT CTTTCCTTTT TCGTCGTCCC GGAAGTGCAT GACGGTGTTC
GTTACCAAGG ACGAAGCTGC AGTCTCGGAT ACCAGCATTC GTACTCGTCG TGCGACCAAA
AATGTCCAAT CTTATACATT GTACCATAAA GGAGCGGCCG AAATTGTGTT GGATAAATGC
ACCAAATACC TGGACATTGA CGGTACCGAA AAGGAAATGT CGGACCAGAA GCGAGAAGAA
TTTGCCAAGC TCATTCGCGA ATTTGCTTCG CAGGCACTGC GGTGTGTGGC ACTAGCTCAT
CGTCGGGACA TACAGAACGT GGTCGATCCG CAAACGGTCA CACAACAGGA TTGTGAAAAA
AAGTTGGAAA AGGAAATGTG CCTCGACGCC ATTGCGGGTA TCATGGACCC CCTGCGGCCG
GACGTTGTTG AAGCCGTGGC TATTTGTCAA CGTGCGGGCA TCTTTGTTCG TATGGTGACG
GGGGATAACC TAGACACGGC GGAAGCGATT GCTAGACAAG CTGGTATTCT GACGGAAGGT
ACGTTGTGTC GGATCGAAGC TTGCACCTAC GTTAAGACCT GTCTTATTTT CGATCAACAT
GATCGATCTT AGTTGTCTAA CTTCAGTCAA TCTAACCCTT TGACTTACTC TCGTTGCTTC
CTGCAGGCGG TATTTCCATG ATTGGCGAAA AATTCCGCAA GCTAACGCCC GCCCAACTCG
ACGAAATTCT TCCTCGTTTG CAAGTGTTGG CCCGCTCAAG TCCGGAAGAC AAGCATACAC
TCGTCCAACG TTTGAATGGT GCAGCCATCC CGTCGACCGA ATCAGAATGG TGTGAAGCGC
ACCCGAACAA AGACTTTGCG ACCCAACGTA ATTTGTTACT TCCGGGCTAC AAGGACGAAT
GGGCCAAGAG CCGATTTGGC GTCGGGGAGG TCGTTGGAGT CACAGGAGAT GGAACGAACG
ATGCTCCCGC ACTCAAAGCG GCTGATGTCG GTTTGTCCAT GGGATTAAGC GGAACGGATG
TAGCCAAAAA AGCTTCAGAT ATTATTATTA TGGACGACAA CTTTGCTTCC ATCGTCCGCG
CCGTACTTTG GGGACGCTCG GTCTTCGACA ACATTCGCAA GTTCCTTCAG TTTCAGTTGA
CGGTCAACGT TGTTGCCTTG ACAATTACCT TTTTGGCTGC CGTCGTGGGG TACCAACCTC
CCCTCAACGC GGTCATGATG CTGTGGGTCA ATCTGATCAT GGACACCATG GGTGCACTCG
CTCTCGGTAC CGAGCCACCT CTCAAGGAGC TTTTGGACCG CCGTCCATAC CGCCGCGATT
CCAGCTTAAT TAGCCGTCCA ATGTGGCGCA ACATTTTGTG CCAAGCCGTC TTTCAGCTTT
CACTCCTGGT CTTTTTGCTA AACAAGGGGC CCGCCATGTT TGAATGCGAA GACGGTTCCA
GACATCACTT TACTATTCTT TTCAACGCTT TTGTGTTTTG CCAGGTGTTC AACGAGTTCA
ATGCACGTGA AATTGGAGAT CGCTTTGATC CACTTCGTTC CTTGTCCGAG AGTCCCATGT
TTTTACTAGT AATTGTCTTT ACCATGGTGG CACAATGGGC AATCGTGGAG TTCGGAGGTG
ACTTCACACA GACGTATCCG TTGAGTTGGG AAGAGTGGAA GATCACCGTT GGTCTCGGAG
CGATATCTTT GCCCGTCGGT TTCTTCATGC GATTGATCCC CGTTTCAGAA GATCCCTCTA
CGTTCGCCGG TATTGAGCGA AAGAACGGCA AGCCCAAAGA GATCTGGTTG TTGACTCCTC
TTCTGATGAT TCTGGTTCCA GTCTTCGTGG CCATTGTCTA TCAATTGGCT TACGAAATTG
ACGAGTTTGC CCATGAGACC GAGCATCATT TACCATAGTT GGAGAATATT AGGAATTCAT
ACTAAGAAGT TAAACGGAGC TTTGTCACCC TG
 
Protein sequence
MPSKTASHSS TANPHGKSRV AETIRMLEAE SSDLRGLLQS INEAQSLESN RHNLETVLEG 
PQGLARRLGT DPKAGLDRET IETRRACFGA NRLPSAPRKT FGQLFLDTFD DATLQILIVA
ALVSLAVGLY DDPATGYVEG CAILAAVLVV SFVTAVNDFQ KESQFRELSA ANDAVDVLVV
RNNVHWQIPV DELVVGDVVC VEAGDQIPCD GVLLVADDVQ VDESALTGEP TDVDKSLQND
PFVLSGCTME AGTARFLAIA VGKDSQWGII KAHLDKEHSQ TPLQEKLDDM AAMIGYIGMA
AAAATFLAMM FIKVVLKPSY LAHISVFNYA LEAFIIGVTI VVVAVPEGLP LAVTISLAFS
TKKMLADKNL IRHLSACETM GNATNICSDK TGTLTENRMT VVKGIFADTR CDDTINRVPV
LINKKALEVI LEGIACCSTA KVIPAQAAVA NEHGIDDLHL VDDRPHIIGN KTEAALLILA
RSSWTPHDDT DQRRVDANFG AEGGSRLFPF SSSRKCMTVF VTKDEAAVSD TSIRTRRATK
NVQSYTLYHK GAAEIVLDKC TKYLDIDGTE KEMSDQKREE FAKLIREFAS QALRCVALAH
RRDIQNVVDP QTVTQQDCEK KLEKEMCLDA IAGIMDPLRP DVVEAVAICQ RAGIFVRMVT
GDNLDTAEAI ARQAGILTEG GISMIGEKFR KLTPAQLDEI LPRLQVLARS SPEDKHTLVQ
RLNGAAIPST ESEWCEAHPN KDFATQRNLL LPGYKDEWAK SRFGVGEVVG VTGDGTNDAP
ALKAADVGLS MGLSGTDVAK KASDIIIMDD NFASIVRAVL WGRSVFDNIR KFLQFQLTVN
VVALTITFLA AVVGYQPPLN AVMMLWVNLI MDTMGALALG TEPPLKELLD RRPYRRDSSL
ISRPMWRNIL CQAVFQLSLL VFLLNKGPAM FECEDGSRHH FTILFNAFVF CQVFNEFNAR
EIGDRFDPLR SLSESPMFLL VIVFTMVAQW AIVEFGGDFT QTYPLSWEEW KITVGLGAIS
LPVGFFMRLI PVSEDPSTFA GIERKNGKPK EIWLLTPLLM ILVPVFVAIV YQLAYEIDEF
AHETEHHLP