Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47724 |
Symbol | |
ID | 7202902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 641412 |
End bp | 643378 |
Gene Length | 1967 bp |
Protein Length | 646 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181946 |
Protein GI | 219123260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.642125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACGG CTCTGGGAAA GGAAGACGAG GGTCAAATCC CACGTCAACT GGATGCCGTT ACCGAGTCGG AGCAGACCGA TCAACGAAAG AGCAATGCGG TCGCGGCAGA AAGCGAGGAG TTTCATCGAG CACCGCAACG CCAGACGCTA CATATGCGGA AGGAATCCAA AGAGTTTCTG AACACTATGC ATTCCTACTT GGATTGCAAA CGAAAATTGG AAGCCTTGGC AGGCAAAGTG GACCCCTGCC GAACACTGCA CGGCATGACG CTATATACGC ACTCGGGCGG AATACGCGAG GCGGACGGAA CCTACAAAGC GACTTATCGG CAGCAAAACC AATCTGGAAA CCCGGTTCGA GACTCGGAGG AACTCCTGCA AGCCGCCACG GTGACACGAC CTCTCTTTTT GGCGCTCATG CAACAACTAG CATCCCAAAT AATGCAAAAA CGACAAGATC TGCGCCGGGA CGGTGCGCAA TTCAAGGCAC TGGTCGAACT TCCCCTCAAA GCCTGGTCGA GGATTGTGGA AAAGTCACGG GACGACTACG CGAAACGGAC ACCGGGTCCA CCGGAAGCAT GGCTCTACGA TCTCAACCGC GCTTCCGTTC TTTGCCCTGA CGTTGCCACC ATGGACGCCG TCGTTACGTG GCTTTACAAA AATACATATA TTGTTACGGC CAAGAATCGC TTTCATCGGC CCACTATCAG CGGATACCGT GACTTGCTCT TTGTTTTACG GATACCCGTG AGACCCACGG ATCAACAAGG CGGTACCGGT TCACCCGCCA ATCCTGTTCT ATTTTTTCAC ATGTGCGAAC TGCAAGTGCA TCACGAACAA ATTTGGCAAC TTAGCAAAAC ACTGGAGACG CACAGGCTAT ATCGATACTT TCGCTCCTAC TTTTGCGGCA ACGACGAAGC CAGTGTCGGG AAGCTGATGA CAGAACTGGT TGAAATTGAA GATTCTGGCA AACTCGATAC GATGGTTGTA CAGCGAGTGC TGGAATCGGA CGACGTCCCG CGGATACGCA AGCTCACCGA GCTCTTTCAA CATCTGCCAG AGCACGATTT TGCCCTACTC TTGTCCCAAC GCGCTCTCGA CATTCACGTG GCGGCCGAAC CGGATGCTCC TAGCCATAGC AAGCATGAAA CCCACCACAG TCATCGTATC AACGTTGCCG AATCGTACAA CCATATTGGC GACATACTTT TGGCTAAGGG AGAGTATCTT AGTGCCTTGA CGCACTATCG ACAGGCACTC GATATTTACC TACAGACGCT TGGCCCACAG CACGTCCAGA CGGCAGCGAC ACACAACGCG ATTGGTCTAG TGCTATCCAA TCAAGCATCG TACGACGAGG CCTTGATTCA TTTCGAAAAA GCACTCGCAA TTTTTTCAGC GCCCGGGGGA GACAGCAGCA GCAACAATAC CGACCAGAGT ATCGCCAAAA ATGCCCATCC CCAGGCCGCC ACCGCTTGGA TTTACATTGG TCACATTGCC CAGGCTCGTG GAAATCTGGA GCAAGCTCGG GAGAATTACG AAAAAGCGCG TGCAATCGCT TCGGCGCTGG AGGAACAGCT ATACGGAACG AAACAGACTC TACTCGCCAC GACAGAAACG AATTTGGGAA TTGTAAGCTA CCATCAGGGG GACCTGGACG ATGCCTTGGT GCACATGCAA CAGGCATTGG CTACCCGTCA AGCTGTTCTG GGACGTAAGC ATCGTCAAAC TGGTCTAGTC CACGAGGCAT TGGGAACAAT TTGGCGTGAT CTGGGAAATT TGGAAACCGC CGCTGAACAC TTCTGTCAAG CGAACAGCAT TGATCAAGGA ACAACATGTG ACTGGGAACG CTCGCTATTG AAGAAACGGT TGCATTGGAG CACAGGGGGA CTCTCTATCG ATGAGAAAGT GAGCCGGACT TCTACAGAGG AGTTACCGTA GCTCTCAGGG ATAGGTGCAC GACTGAT
|
Protein sequence | MGTALGKEDE GQIPRQLDAV TESEQTDQRK SNAVAAESEE FHRAPQRQTL HMRKESKEFL NTMHSYLDCK RKLEALAGKV DPCRTLHGMT LYTHSGGIRE ADGTYKATYR QQNQSGNPVR DSEELLQAAT VTRPLFLALM QQLASQIMQK RQDLRRDGAQ FKALVELPLK AWSRIVEKSR DDYAKRTPGP PEAWLYDLNR ASVLCPDVAT MDAVVTWLYK NTYIVTAKNR FHRPTISGYR DLLFVLRIPV RPTDQQGGTG SPANPVLFFH MCELQVHHEQ IWQLSKTLET HRLYRYFRSY FCGNDEASVG KLMTELVEIE DSGKLDTMVV QRVLESDDVP RIRKLTELFQ HLPEHDFALL LSQRALDIHV AAEPDAPSHS KHETHHSHRI NVAESYNHIG DILLAKGEYL SALTHYRQAL DIYLQTLGPQ HVQTAATHNA IGLVLSNQAS YDEALIHFEK ALAIFSAPGG DSSSNNTDQS IAKNAHPQAA TAWIYIGHIA QARGNLEQAR ENYEKARAIA SALEEQLYGT KQTLLATTET NLGIVSYHQG DLDDALVHMQ QALATRQAVL GRKHRQTGLV HEALGTIWRD LGNLETAAEH FCQANSIDQG TTCDWERSLL KKRLHWSTGG LSIDEKVSRT STEELP
|
| |