Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45673 |
Symbol | |
ID | 7200457 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 895981 |
End bp | 899142 |
Gene Length | 3162 bp |
Protein Length | 951 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | beta-galactosidase |
Protein accession | XP_002179741 |
Protein GI | 219117911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGTTTCCT GTTGTGTTCG AAAGAAGAAA TTTGTGGCTC GACACACCTA TCAATATTGT GTAGTCTTTC TGTGCTTTAT TGCTTCTGTC ATCGAATAGG CCCGTGCATG CACCATGAAG ACTATGCCAT CCCTCGAGTT TCGGCAATCT TTGAAAGGGG GACGACCAAT CCCAGCCGAT GAGCTGCGTC AAAGCCTTTC ACACCAAGAA GTCGAATACG GCTCTTTTCT CCCAAAATCT TCAACTTTCC ATGAAACAGA ATTACGGCCC TGCTCGGAAT TGCAAAAACG CCGTTACCAA CGAGATTTGA TTTTGCTCGC ATGCGCAGCA CTCATCATCT TGTACGTCAG TGGTTTTCCG CCAAGAATTA AGAATGCTGG TATCTTCAAG AAAACTTCCT CAATTATTCC AGAGGGTACT ACATACGCTG CAACAACAGC AGAACTGAAA AACGAGATTC CGGCCTGCGG AGACGAACCA TGTTTTCATC CTGACCGCGT TCAAGTACGC CGAGATCGAC CATATTTCCC ATCGTTTTGG AATTACAATG GTAACCTAAG TGTTTCGTAT GATGAACGTG CAATACGTAT CAATGACAAG CGTGTTTTAC TCTTGTCTGG TAGCATGCAT CCGGTACGCG CGACTCGCGG TACCTGGGAG CATGCTTTGG ACGAAGCTGT CTACAATGGT CTCAATATGA TTACGGTATA TATTTTTTGG GGTGCGCATC AATCCTTTCG TGACGAACCG TTAAACTGGT CCTTGGACGG ATCTAGTATA GGTCCAAAAG AATCTCAATG GGAATTGGCA GACGCTCTCC GGTCGGCGGC CAATCGAGGT CTTTTCATCC ATGTTCGGAT TGGACCATAC GCTTGTGGTG AATATACTTA CGGAGGCATT CCCGAATGGC TCCCTTTGCA AAGTTCGACG ATGCGTATGA GAAGGTTGAA TCGGCCCTGG TTAGACGCTA TGGAGGGTTT CGTAGCTGCT ACAATCACCT ATTTGTCTTC TTTCAATCTT TGGGCTCATC AGGGAGGACC GATTCTCATC GCTCAGATTG AAAATGAACT CGGAAGTGGC GTTGATGGCT CTGCGGCCGC AAATTACGTT GTACTTGAAC GTGATGAGTT CAATGACGAC AAACACGAAG ACTCTCATCT TCTCCAACTT GATCGATACG GGCACATTTT GGAAAATGCA TCGTCTCGCG GTATGGATTC TGAGTTGCGC AATGCAACTG TCCAGGACTA TGCGGACTGG TGCGGCAACC TAGTGGCACG ATTGGCTCCG AACGTCATCT GGACAATGTG TAACGGTCTT TCGGCGGAGA ACACAATTTC GACTTTCAAT GGAAACAATG GGATCGACTG GTTAGAAAAA TATGGAGATT CGGGTCGTAT ACAGGTAGAC CAACCTGCGA TTTGGACTGA AGATGAAGGT ACGTTAATTT TTGCTACCAT CAGACACTTG TACAGCTGTC GATCACACAA CTACCGGTAT CAGTTCTCCA ACATTTTTTC TCCAATTTCA GGTGGATTTC AGCTGTGGGG TGACCAACCC TCGAAACCTA GCGATTACTT CTGGGGTCGC ACATCTCGTG CCATGGCCAC TGATGCTTTG CAATGGTTTG CACGCGGTGG GACGCATCTG AACTATTATA TGTGGTGGGG TGGGTACAAC CGCGGTCGCT CATCAGCTGC CGGGATTATG AATGCGTATG CTACGGATGC TTTCTTGTGC TCATCTGGCC AGCGGCGACA TCCCAAGTAT GATCACTTTC TTGCGCTGCA TTTGGTTATT GCTGACATCG CAGCAATTTT GCTACACGCC CCCACGTCAT TGCTCAAAAA TGCTTCGGTA GAGATAATGG ACGGCGACGA TTGGATTGTT GGTGACAATC AACGACAGTT CCTCTACCAA GTTCTGGACA CACACGATTC GAAACAAGTA ATATTTTTGG AAAACGATGC CAACACAACT GAGATGGCTC GACTCACAGG GGCGAAAGCA GACGACTCAT TGGTGTTTGT AATGAAACCA TACTCATCGC AAATTGTAAT CGATGGCATT GTAGCTTTCG ATTCATCCAC TATTTCAACT AAAGCGATGT CTTTCCGGAG GACATTGCAT TATGAACCAG CAGTGCTCCT CCACCTCACA TCCTGGTCGG AGCCAATTGC GGGTGCGGAT ACTGACCAAA ATGCTCATGT CAGTACCGAG CCTCTCGAGC AAACAAATTT GAATTCAAAG GCGTCTATAT CGAGTGACTA TGCATGGTAT GGGACGGATG TGAAGATCGA CGTCGTCCTT TCTCAGGTGA AGTTGTACAT CGGTACGGAA AAGGCTACGG CACTGGCTGT CTTCATAGAT GGGGCGTTCA TAGGAGAAGC AAACAATCAC CAACATGCTG AAGGTCCTAC TGTTTTGTCC ATAGAAATCG AGTCGTTGGC AGCAGGGACG CACCGACTGG CGATTCTTTG CGAATCGCTA GGTTATCACA ATCTAATTGG GCGATGGGGG GCTATCACCA CAGCAAAGCC GAAAGGCATT ACAGGGAATG TTCTCATCGG TTCCCCACTG CTATCGGAAA ATATCAGTCT CGTCGACGGG AGACAAATGT GGTGGTCACT TCCAGGCTTA TCTGTTGAAC GAAAAGCTGC GAGACATGGT CTTCGTAGAG AGAGTTTTGA AGATGCTGCT CAAGCTGAAG CAGGCCTTCA TCCTTTGTGG TCCTCGGTTT TGTTTACTTC GCCGCAATTC GACTCTACAG TGCACTCTTT GTTTCTTGAT TTGACGTCAG GCAGAGGCCA TCTTTGGTTA AATGGCAAAG ATTTAGGCAG GTACTGGAAC ATTACCCGCG GTAATTCTTG GAACGACTAC TCTCAGCGCT ACTACTTTTT GCCTGCCGAC TTTCTTCACC TGGATGGCCA ATTGAACGAG CTTATCTTGT TCGACATGCT TGGTGGGGAT CACTCTGCCG CTAGACTTCT GCTGAGCTCC ATAGAAGAGT CCGAAACGTC CAAATTTTCT GACGAAGTGG ACTTTGCACT TGCGTGTATA TAAGAAAGTG CGTCTACTAG CTCGCGAAGA AAGTACACTC TACTTTTGTT TATTAAAGAA ACTACTCGGT AAGGAATTTA GATAAAGGAT TGACTCTATG AC
|
Protein sequence | MKTMPSLEFR QSLKGGRPIP ADELRQSLSH QEVEYGSFLP KSSTFHETEL RPCSELQKRR YQRDLILLAC AALIILYVSG FPPRIKNAGI FKKTSSIIPE GTTYAATTAE LKNEIPACGD EPCFHPDRVQ VRRDRPYFPS FWNYNGNLSV SYDERAIRIN DKRVLLLSGS MHPVRATRGT WEHALDEAVY NGLNMITVYI FWGAHQSFRD EPLNWSLDGS SIGPKESQWE LADALRSAAN RGLFIHVRIG PYACGEYTYG GIPEWLPLQS STMRMRRLNR PWLDAMEGFV AATITYLSSF NLWAHQGGPI LIAQIENELG SGVDGSAAAN YVVLERDEFN DDKHEDSHLL QLDRYGHILE NASSRGMDSE LRNATVQDYA DWCGNLVARL APNVIWTMCN GLSAENTIST FNGNNGIDWL EKYGDSGRIQ VDQPAIWTED EGGFQLWGDQ PSKPSDYFWG RTSRAMATDA LQWFARGGTH LNYYMWWGGY NRGRSSAAGI MNAYATDAFL CSSGQRRHPK YDHFLALHLV IADIAAILLH APTSLLKNAS VEIMDGDDWI VGDNQRQFLY QVLDTHDSKQ VIFLENDANT TEMARLTGAK ADDSLVFVMK PYSSQIVIDG IVAFDSSTIS TKAMSFRRTL HYEPAVLLHL TSWSEPIAGA DTDQNAHVST EPLEQTNLNS KASISSDYAW YGTDVKIDVV LSQVKLYIGT EKATALAVFI DGAFIGEANN HQHAEGPTVL SIEIESLAAG THRLAILCES LGYHNLIGRW GAITTAKPKG ITGNVLIGSP LLSENISLVD GRQMWWSLPG LSVERKAARH GLRRESFEDA AQAEAGLHPL WSSVLFTSPQ FDSTVHSLFL DLTSGRGHLW LNGKDLGRYW NITRGNSWND YSQRYYFLPA DFLHLDGQLN ELILFDMLGG DHSAARLLLS SIEESETSKF SDEVDFALAC I
|
| |