Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44530 |
Symbol | |
ID | 7197786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 817591 |
End bp | 819648 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178311 |
Protein GI | 219115031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGG CCAAGCTGTC ACGAGGGAAG AAATGGCGTT GGCGTCTAGT TGTGATGGCT ATTCTGGCAA GCCTGGCGCT TTCGGTGGTG AGGCGGCTCT TTTTTCAAGA ACGTCAAACG GTACAGTGGC TTCCACCTTT GGCCATCGAG AATCCTACAT ACTTTTCCGC ATTCGAGGAA GAACGCATGC CTACCTCATC CAGCGATGAA GCATGTCAAA CTCCACTTGT AGAGGACACG AAAGAGGATT CCCGGAGCGA ACTGCTGGTC GACGGCTCGG ACTTTGCTGC CGCAGCTCGC GCCCGCACAT CCTCCCACCC CTTCTGCCAA AGATTCCGGT ACGAACAGTC AGCAAGTGCG CTCTGGTCCT CGCATCGAGA AGCGATTATG AATAAAACCA ATCATGCCAA CGATGAGGAC GGGCGACATC GACCATGGAT AGAGAAACTC TTTGACACGG TGAGTCCCTT CTTGTTGCAA CGCGGACTTC GCAATCCACC GAATGCGGCG GATACGAAAC GCGTTTTAAA TATTGTGGAA ACCAAGCTAG CCAATTCATC GGCGCCGCCC TTGTATGTCG CTGTATTTGG AGGATCAGTG GTGGAAGGGA CTAATTGCGA TTTTATTCCC CCGGCTGCAC TCGAACTAGT GACGAACCGT TCTTTACTCA GAGAAAGAAA GAGAAATATG TACGATACTG TTATAAAGGG AAGAAGCTGC ACTTGGCCCA ATCGATTGCA AGCCCTAGTC GATTACGCAC TGGGCGAAGG AGTCGTGAAG ATTTACAACC TCGGTGTCGG AGGCACCAAT TCACAACTGG CGGTCCCGAT TGTCAAATAC CGTCTCTACT CGGGTGCGGC GGATCTAGTG AATATAGGTG GACCCGATGT TGTCATCAAT GGATATGCCG TCAACGATAA TGCCTATTTT TCGAGCACTG CCGCGACGGC AACGTACACT CACTTTAACG CATCGCTCGC TCGCGCGGAG GACTTCATTC GCGCGGTTTG GAAGTCGCGC CCCTGTCACG ATCCACCAAT GGTTTGGTTT TTTGACGAGC ACTTTGGTAA TTACATCGAA AGTCTCCTCG GTGAAGACAT ACAAAAGGAT GCGGTCCGTC TCTTGGCGGA CTACTATGAG CTTGGTTACG TCAGTTCGTC CTTTTCCGTG CGCTCTTTCT TCCTGTCTGA CCCGGACGAA ACCTTGTTTT CTCCTGATTG GGAAGATCCC GTCAAAAAAT CTCGCATCGT GGACGGGCAT TTTGGAATGC CCGGGCATGT TCACGCATCG TGGACCTTTG CATACGCAGC ATTGCAAACC GTACTCGACT ACTGCGCCGA TCATGCCTTC ACGGATAGTA TATCTACAAT ACGGTTACAG GACGGCCAGT TTACCTCGCA GCTCGCCGAC ATGCAAACGT TGCTGGATCG TACCTACCTG CCGCCACCAC TCGGTCAGGA CACATTGTCG CTTGTGAATA TTGCGAAAAC TTGGCGAGCA ACTCAGCAAC AGCAGTATCG TAGCGAAAAA CTCCTTTGCG AAGACGAAAT GAATACGCAA GGCACATTGT GTCCCCTGGC CTTCGTGGCA ACACCCGTGG GTACCACTCG TCAAGCCCGT GCTGTGGATG ATTACTTAAA ACCCTTTGTG ACAGTCAACA CTGGCTGGTA TGGCAGAAAT GATATCCGCA ACGGCTGGCA AAATAAGATC GGTCTCATGC CCACCGGTAT CGGTGCTTCA ATTATCTTGT CACTCGAGCA CGTCACAACT CTCATTCAGA TGATTTCCTT GCAGACCTTA AGAAGCTACG GTGATCCCTG GGAGGGATCG GAAGCATTGT TTGATTTGAC TATTATCCGA GGCAATCCAA ATTCCACCGA CTTCCGGGAC AATTTTACCA TTCCTGCCTA CCACGACGCA AACATGAGCG TTTCCTATTT GTACGAGCAC GACTTAGGAA AAAATCGAGC CGTTGCTGGT GACAGTCTTA CTCTGAATAT AACGCTGGTT GCTGGTTCTG CTTTCAAACT CATCGCATTA ATGATGTGCA GCGGCTAG
|
Protein sequence | MASAKLSRGK KWRWRLVVMA ILASLALSVV RRLFFQERQT VQWLPPLAIE NPTYFSAFEE ERMPTSSSDE ACQTPLVEDT KEDSRSELLV DGSDFAAAAR ARTSSHPFCQ RFRYEQSASA LWSSHREAIM NKTNHANDED GRHRPWIEKL FDTVSPFLLQ RGLRNPPNAA DTKRVLNIVE TKLANSSAPP LYVAVFGGSV VEGTNCDFIP PAALELVTNR SLLRERKRNM YDTVIKGRSC TWPNRLQALV DYALGEGVVK IYNLGVGGTN SQLAVPIVKY RLYSGAADLV NIGGPDVVIN GYAVNDNAYF SSTAATATYT HFNASLARAE DFIRAVWKSR PCHDPPMVWF FDEHFGNYIE SLLGEDIQKD AVRLLADYYE LGYVSSSFSV RSFFLSDPDE TLFSPDWEDP VKKSRIVDGH FGMPGHVHAS WTFAYAALQT VLDYCADHAF TDSISTIRLQ DGQFTSQLAD MQTLLDRTYL PPPLGQDTLS LVNIAKTWRA TQQQQYRSEK LLCEDEMNTQ GTLCPLAFVA TPVGTTRQAR AVDDYLKPFV TVNTGWYGRN DIRNGWQNKI GLMPTGIGAS IILSLEHVTT LIQMISLQTL RSYGDPWEGS EALFDLTIIR GNPNSTDFRD NFTIPAYHDA NMSVSYLYEH DLGKNRAVAG DSLTLNITLV AGSAFKLIAL MMCSG
|
| |