Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50575 |
Symbol | |
ID | 7199360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 167551 |
End bp | 170054 |
Gene Length | 2504 bp |
Protein Length | 717 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185535 |
Protein GI | 219130780 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTC GTCTTTTTCG CAAACAAAAG AGTGCCGTGA GTTCCAAACA GAGCAGTAGT ACTCGCAGTA GTACCCAAAA GAGTCTCCCA CCCACATCGC GGGAAACGCC CGAGTCTCCG AGAGCGAAAC ACTCTTCCTC AGCATCCGCC GCTGTTTCCG CACTCTCCTC GCTTCCACCC TACGATGAGT CCGGCAAGAA ATCGAGCATG ATCTTGAATC CGCGGACAGG CGGCTACGAG GCTCCTTCCC AAGTAGCCTC GTATGAAGAT AAGCAATTGG ATGATGATGA TACCTTGCTG GATGCCTCCA TTTACACCGA CGGGTACACG ATTGAAGCAC CTTCTTCGCA GAAAAAGTCC AAATCTGCTC TCACGCCCAG GCAGCGGCTG GAAGCCTATC ATGAAACATC GAACGTCGCG TCTTGGGACC GTGCGTCGAC GGCTACTCCA ACATTCGCCC CTTCGACATT GCAGAGGAAG TCTCCAACCA GCCCATACAG TCCTCGGACT CGTACCTCGA CGGCTTCCAC TTCTCCCAGT GGTAGCTCTG TCGCAACCTC GCCCATCACT GCCGAAGGCA ATCAACCCCG GTGGCGGACG ATGACCAATC ACCGTGGAGA ACAAAGTTCA CTAACGCAAA AGGATATCCG GCGCTTGAAT GTGACCGGCC TAGCTGCGTC GACCTTTGAA AGCATTTCCA CCAACGGTTC TTTCGACACA GCCACAACCT CTCCGCAGCG TAATGTGAGC CCAGACAACA GCTCCATTCT GACACCGTTG GTGGGATTGG ATTGCCTGAT GACTACCTGC AGTGAAATGG ACGACGAAAA TTCCATCCTT ACCTTGGAAA ACCGGCAATG TCTCTACCCC TCCTGTCCGT CCTTTCCACA TGAAAGTACA TTGGCAGGGG ATTCGCCACC ACAACGACAG CACCAAACAT ATTCACTCGA TAGACAAAAA GGCAGCGAGA GCGGCATTGT AGATCCTGTT GCCGATGGGA CAGATGACTC CATATATACG CTGGAACACA ATGGAGTTTA TCAACGAGGA ACAGATCAGA ATACGATCAA GCCACGGGTC AAAAGGAAAG ACAAAGTGTC ACCTAAACGA CCGTCGGCGG CTCGTTCTAC TTCTATCCAG AATGATGCCG TCAAGACTAC CCCTCGCGAA AAGAAAAAAG GTCGATGTCG GACACTTCCT GTCGAGATTG CAGGGATGGA AGATGACAAT GAGGACGAGG GCACGTGCAC CATTCTCTTT GTTCCGACAA CGGAGGACAC AGGCTCAGAC CAAGTTGATC CTCTGTTTTC GTCACATGGA GAACAATTGT GTTCCATATC ACTGCAGCCT ACGGAAGAAA GAGACTCTTC CCTGAGAGGA CCGGTCGATT TTGATCGCGT CAGCAAGGCT ACCGTCGTAG ACACAAATTT GCGGAAACCT AATAAATCAG TCAAGATGCG ATTGACAAGG GAAGATTTGC AACGTCACGA ACGCAAATCG CTCAAGCAAG CGCTGGCGCA AAAGAAAAAT GCTGGTATAG AAGAAGATAC CCCTATGGAA GATGACTTTG TTTTGTGGAA GCGCGAAGAG CAAATTAAAA AATTGTACCA GCAGAAGCAA GGGGATCGCA ACATGCTATT GAAAGATAGA AAATCTTTGA CTTCCAGTCC TAACGCGAAA GAAAATGGTT CGAAAAAGCG AAGGAAGGAT CGAAGAAAGG ATGGTATCCT TTCGAGAGTT TTTACTGCGT CGCGCAAAGA ACCAGACCTC GGCGATTTTT CTTTGGCTCC GATGACAGCC ATGGAACATT TGAAGCTACA CGAAACAGTT GACCAAGCGC CTCCGACACG TGGATTTTTT CTCTTTCGCA GACGTAGCTC TGGGAATGTA TCCACAACCG CCCAAGCTCT GTTAGAGAAG GAGCGTTTGG CAGCTAAGGA ACTTCATAGG AAAAAGCTGA ATCGAAAGCA GCAAGACAAC CACAACGCGA GCAAAACGCA TCGAAATATT GTCCATGAGC AAAAAGTCAA AGCATTGGCA TTGAGCCCAG CAGCTCCGCG GGTCAACGAG GAAACTCCAA AGCGGAGCGG ACGATCCCAC ACTGACGCGA GAAAATCATC TGTTGCGAGT CGTAGCCTGC CACGCAATCG TTGACGATAT CCAGGACGCG CCATACTTCT TACGATCTAT CAACTCTTTC GAGGTCAATA TCAATACAGA CTTGTGTCCT GTGTCGCGCG TCGTCGCGCA CCCACATTGC CACTTGTTGC ATGCAATTGT CATTTTGCAA AGACTGTGCA GAAGCGTTGC AAGTTTCAAA GACCTGCCCT GTGTGCTCGG AAACAAATGT CAGATTTGTT TCTGTGGCGA TCAAAAGGAT TGTATTTATA GGGACGCCGA GACATAACAG TCCCACTTGC AGCTTTACCC TATTTAAAAG CTACATGTAT AGGTGCAGTT CTTTAACTCG ATTGTTGCAG TAATCATTTC TCATGTAATG ACCT
|
Protein sequence | MKLRLFRKQK SAVSSKQSSS TRSSTQKSLP PTSRETPESP RAKHSSSASA AVSALSSLPP YDESGKKSSM ILNPRTGGYE APSQVASYED KQLDDDDTLL DASIYTDGYT IEAPSSQKKS KSALTPRQRL EAYHETSNVA SWDRASTATP TFAPSTLQRK SPTSPYSPRT RTSTASTSPS GSSVATSPIT AEGNQPRWRT MTNHRGEQSS LTQKDIRRLN VTGLAASTFE SISTNGSFDT ATTSPQRNVS PDNSSILTPL VGLDCLMTTC SEMDDENSIL TLENRQCLYP SCPSFPHEST LAGDSPPQRQ HQTYSLDRQK GSESGIVDPV ADGTDDSIYT LEHNGVYQRG TDQNTIKPRV KRKDKVSPKR PSAARSTSIQ NDAVKTTPRE KKKGRCRTLP VEIAGMEDDN EDEGTCTILF VPTTEDTGSD QVDPLFSSHG EQLCSISLQP TEERDSSLRG PVDFDRVSKA TVVDTNLRKP NKSVKMRLTR EDLQRHERKS LKQALAQKKN AGIEEDTPME DDFVLWKREE QIKKLYQQKQ GDRNMLLKDR KSLTSSPNAK ENGSKKRRKD RRKDGILSRV FTASRKEPDL GDFSLAPMTA MEHLKLHETV DQAPPTRGFF LFRRRSSGNV STTAQALLEK ERLAAKELHR KKLNRKQQDN HNASKTHRNI VHEQKVKALA LSPAAPRVNE ETPKRSGRSH TDARKSSVAS RSLPRNR
|
| |