Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54805 |
Symbol | |
ID | 7203062 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 161199 |
End bp | 164883 |
Gene Length | 3685 bp |
Protein Length | 1181 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | atpase2-p5 |
Protein accession | XP_002182337 |
Protein GI | 219124074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGTGC ACCTAAGTGT CGTTTTACTG TCGCAGTGGA ACGTAATTTT CCAGGCAACG ATCGGGTATC AATTGGCGGA GCGATCAAGG GATAAAATAG CGTCGTGGAC GCACGCATTG GTACAATCTA CCCACTCTGG TCTTAGTAAT ATAGGGGATC AAGATGCCGG TATTGTCGTA GTACAAAAAG ACGAAAATGA CATAGTTCAA ATTGTGTTCC ATGATACCAC CTTCCGATGC CGAGTAGAAG ACGTGGACTG GGATATACTA CTCTGGCAAT CAACGGAGAT AACGGCATGT CACGCCACGA AAACATCGCC AGTCCCCCAA TTTCGTTTGC TACGCTATCC AGTCGACTTG CCACGGCAAT TTTACGCGTC CTGGAATGGA CACTCTAGTC TCGAACAGGT CCGAATTGCC AGTCAAGTAT ACGGTTCCAA TCAAACGCTG TTACAATTGC CTACGTTTCA GCAGCTGCTT GGTGAACAAC TGGTCGCACC TTTTTTCCTC TTTCAAATCT TTTGCGTTGT CCTGTGGTCG TTGGACGAGT ATTGGTACTA CGCTATATTC ACATTGTTCG CGCTACTCAT GTTCGAATCT ACCGTCGCCT ACAATCGGCT CCAATCCCTG CAACGACTGC ATCGAGCCGG CCACAAGGGC GATCAGCGAA TTTGGGTACA ACGGGGCATC GCACCGACAA CGGCAGCAAC AGACAAAACC AATTTGAGAC TGCAATGGAT GTACGTACCG ACCAAAGAAT TGGTACCGGG CGATATGGTA TCGCTGAGTG TGGCTCAGGA CGGGACACCT ACCAACGTTC CGGCCGATTT GTTGCTGGTT AAGGGCACTG CCGTTTGTGA CGAGGCCTTG TTGACTGGAG AATCCGTCCC TCAGCTCAAA CAAGCATTGG ATGTGAGTAA GGGCAATTCA TCAATGCGAC TGGATCTACA AGACAATGCC TGCAAAGAAT CGATTTTGTT CGGCGGTACC AATTTGCTGG TCGGATCATC TTCGACCGAG GAAGCTCTAG ATGAAAAGGG TACGATTACT CCAGATAAAG GCGTCAAGTG CATTGTTCTT CGGACTGGCT TTGAAACTGC CCAAGGTAGC CTGCTGCGGA CGATGGCGCA TTCTTCTAGA AGTGCCGATG GTGTTCACAC CTGGGATACT TTTGTTTTTA TCTTGATGCT CATTATATGT GCCATTGGCG CCGCGACGTG GGTGCTCAAC GAGGGATGGT ACGACGAGCG TCGCAATAGA TTTCGCCTTG TACTGCATGT TGTAATTATC GTGACGTCAG TGGTTCCGCC CGAGCTGCCT ATGGAGTTGT CTCTTGCTGT TACTAACAGC GTAGCGGCTT TAATACAACG CTCACAAGTC CATTGTACGG AACTCTTTCG CATTCCGTGG GCTGGTGAAG TCGACGTTTG CTGCTTCGAC AAAACAGGGA CGTTGACTAG CGATGAGATG CGTTTGCGAG GCGTTCGCCT TTTTGAAAGC AATGGCAACA CCACCAAGGA TGAGGAAACT GGATTGGTGC ATCCGGATGA TACTGACCTG CCGTGGCCAG TGACGCGAGT CATGGCTGCT TGTCACTCTC TGGCACTGGC CGGATTTCAA CGAGGAAACA AGCTGCCACG TGTGGTTGGT GATCCTCTGG AACAAGCTGT GCTGTCTCAC ACGGGATATC GTCTGGTAGG GAACAATGTC ATTACGCATG TTGATCCTAC GAGCTCACCA ATCATTTGCA AATCGATGAC GATCCTCCAC CGATTCTCGT TCTCTTCAAA GCTCAAGCGT ATGACAGTTC TGGTCTCCGA AGAGGGAGGG GAAGGTGCTG TGTGGGCCCT CAGTAAGGGA GCACCGGAGA CAATCAAACA GCTCCTATCG CCCGATGCAA TTCCATCCAA CTACGACGAA GTTTCTTTCT ACCATATGAG TAGAGGTCGA CGTGTTCTGG CTATGGCGTA CCGAGAAGCC GGAACCATTC ACAAGTTGCA AGCATTGAAG AACCTTGGTA GAGATAGTGT CGAACGAAGA CTTCTCTTTG CTGGATTCTT GGTGCTCGAT TGTCCGCTAA AGCCGGATTC GAAGTCGGTT GTGGCTGAGC TACAAGCAAG TGGGCACAAG GTCGTGATGA TTACGGGTGA CGCGATTTTG ACTGCCGGCG AAGTTGCTAG ACAAGTGGGA ATTGTACCAG GAGAGTCCTC TAGAAAGGAA CACTTATACC GTATAAGAGA ACGAAAAGAA AAGCCTACGC GCTCGCCCGA TGTACTCACA GCTTTTGAAT GTGTAGCCCT AAGAGAGAAG GATGGCGATT TCCATCCAAT TATTTTGTCG AAGGAAAAGA CTAGAACTTT TGCCGAAAAA AGTAAGCTGA ATGGCGCTTC TTTTTGTATT TCGGGGGATG TTCTGACCAA GATCGCCGAG GCGGCGCTGC AGTCAGAAGG CCCTCCAAAT TTATCAAACA GTTCGACGGC CGATGAAAAA CAAATTCTTC TTCACCCCGT AGCACAAGCC GTATTGAAGG ATCTAGTGCC GTTGATTTCT GTGTTTGCCC GACACGCGCC GCACCAAAAG GAAGCCGTTG TTGCGGCTTT CAATCACGGT GGCTACCACA CACTCATGTG CGGGGACGGG ACGAACGATG TTGGTGCTCT GAAACGCGCT CACGTGGGCA TTTCCATCAT CAGTGCGCCT GAAGTGGAGG CCAAGCAGCG TAAGGCGGCT AAAAAAATGT CCAAACTGAA AAAGAGTGCA AAAAGGAACG GTACTGCAAC TCGAGCACGT CCGACAAACA ATGCCTGGGA GGAATCGTTG CATCAATTGC AGGAAGCCCA AGAGGAACTC GACAACGTCG AGCTTGGTGA TGCGTCAGTT GCAGCACCCT TTACTTCCCG TGCCGTAAGT ATAAAGTGCT GCAAGGATGT CATTCAACAA GGGCGTTGTA CTTTGGTGAC CATGCTACAG ATTTACAAAA TTCTGGGTGT GAATTGCTTG GTGAACGCCA TGGTACTGAG CAAGCTGTTC CTACACGGCG TCAAACAAGG CGATCGGCAG TTGACAGTCC TGGGGTTGGG CGTGGCGGCA CTCTTTTTCT TCGTTACGCG CGCCGAGCCG CTACCAACGC TGTCCCACAC CCGCCCTCCC GTGTCGGTCT TGTCCAGGCA AGCACTCTTG TCAATTGGGT TGCAGTTTGC CGTACACATT GTCGCAATCC TGTTGGCCAC GGAAACGTCG TTACGATTGG TGGATCCGTA CGATCCTTCG CTCGTCCCCG ATGGTCCATT CAACCCTAAC GTACTCAATA CGTGCACCTT TCTATTGACG TGCGTGTCAA CAATCAACAC ATTCGCCGTC AACTACCGTG GTCGTCCGTT TATGCAAGAT TTGCGGGAGA ACCGAATGTT GTACCGTTCG CTGCAGCTTA GTTACTTGAT ACTAGCTTTG AGCGTATGGG AAGTGTTTCC ACCCTTGAAT GATTTATTGC AGTTGACGGC GTTACCCAAC GTGACTGAAT TGTTAACCCT GCAGGAGGGT GGCAGCAATA GTGTCGGGGT GCCGTTGCCG TGGATGCCAC TGATCCAAAC GGTAGGATTT CCGGCATTCC TTAGTGGTCT CATGGTCGTT GATACGGTAC TAGCCTTTCA AGTGGAATCC ATGGTACTCC GCTACGTTCC GGATT
|
Protein sequence | MLVHLSVVLL SQWNVIFQAT IGYQLAERSR DKIASWTHAL VQSTHSGLSN IGDQDAGIVV VQKDENDIVQ IVFHDTTFRC RVEDVDWDIL LWQSTEITAC HATKTSPVPQ FRLLRYPVDL PRQFYASWNG HSSLEQVRIA SQVYGSNQTL LQLPTFQQLL GEQLVAPFFL FQIFCVVLWS LDEYWYYAIF TLFALLMFES TVAYNRLQSL QRLHRAGHKG DQRIWVQRGI APTTAATDKT NLRLQWMYVP TKELVPGDMV SLSVAQDGTP TNVPADLLLV KGTAVCDEAL LTGESVPQLK QALDVSKGNS SMRLDLQDNA CKESILFGGT NLLVGSSSTE EALDEKGTIT PDKGVKCIVL RTGFETAQGS LLRTMAHSSR SADGVHTWDT FVFILMLIIC AIGAATWVLN EGWYDERRNR FRLVLHVVII VTSVVPPELP MELSLAVTNS VAALIQRSQV HCTELFRIPW AGEVDVCCFD KTGTLTSDEM RLRGVRLFES NGNTTKDEET GLVHPDDTDL PWPVTRVMAA CHSLALAGFQ RGNKLPRVVG DPLEQAVLSH TGYRLVGNNV ITHVDPTSSP IICKSMTILH RFSFSSKLKR MTVLVSEEGG EGAVWALSKG APETIKQLLS PDAIPSNYDE VSFYHMSRGR RVLAMAYREA GTIHKLQALK NLGRDSVERR LLFAGFLVLD CPLKPDSKSV VAELQASGHK VVMITGDAIL TAGEVARQVG IVPGESSRKE HLYRIRERKE KPTRSPDVLT AFECVALREK DGDFHPIILS KEKTRTFAEK TQAVLKDLVP LISVFARHAP HQKEAVVAAF NHGGYHTLMC GDGTNDVGAL KRAHVGISII SAPEVEAKQR KAAKKMSKLK KSAKRNGTAT RARPTNNAWE ESLHQLQEAQ EELDNVELGD ASVAAPFTSR AVSIKCCKDV IQQGRCTLVT MLQIYKILGV NCLVNAMVLS KLFLHGVKQG DRQLTVLGLG VAALFFFVTR AEPLPTLSHT RPPVSVLSRQ ALLSIGLQFA VHIVAILLAT ETSLRLVDPY DPSLVPDGPF NPNVLNTCTF LLTCVSTINT FAVNYRGRPF MQDLRENRML YRSLQLSYLI LALSVWEVFP PLNDLLQLTA LPNVTELLTL QEGGSNSVGV PLPWMPLIQT VGFPAFLSGL MVVDTVLAFQ VESMVLRYVP D
|
| |