Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33633 |
Symbol | |
ID | 7204073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1422369 |
End bp | 1424825 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186250 |
Protein GI | 219113333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0303682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATACG CGCTTTCGAT CATCATGGGT CGAGCCATTC CCGACGCACG CGACGGTCTC AAGCCCGTAC ATCGCCGAAT ATTGTATGCC ATGGACCAGC TTTCCCTCTA CCCAAATACA GGACACCGAA AATGCGCGCG TGTTGTTGGG GAAGTCCTGG GCAAGTTTCA TCCGCACGGA GACATGGCTG TCTACGATGC TTTGGTTCGC TTGGCGCAAC ACTTTAGTAC GGCCTATCCA TTGATCGACG GACACGGAAA CTTTGGGTCC ATCGACGCGG ACCCCGCTGC TGCTATGCGT TACACCGAAT GTCGCCTGAC AAAATTGTCG CAAGCGGCTT TACTAGAAGA TTTACAAGAT GATACGGTAG ACTTTTTACC CAATTTTGAC GGAAACGAAA TAGAACCCGC AGTTTTACCG GCCAAGCTCC CGATTCTGCT ACTCAACGGT TCGTCGGGCA TTGCGGTTGG CATGGCTACC AACATTCCAC CGCACAACCT CAACGAGATT ATGACGGCCT GTACCGCCTT GGTAAAGGCG CGCCAAGGGG GCGCGGCAGT GACGGACAAG AAGCTCTTGC AAATGGTCCC TGGACCTGAC TTTCCCACCG GGGCGTCGAT TCTTAGTACC GGCGGTACTG AGAAATTGTA CACAACAGGC AATGGTGGGA TTGTCATGCG GGCAGTGACC AAAATCGAAA AAATTACGAC CGGCCGCAAA AGTTCCATCA CACGAACGGC CATAATTGTT ACCGAGCTAC CGTACCAGGT CAACAAGGCG GCATTGTTGG AAAAGATTGC GGGACTAGTC AATGAAAAAA AGCTGGACGG TATTGCGGAT CTACGCGATG AATCCGATCG TGATGGAATT CGGGTCGTCA TTGAGCTCAA ACGAGACGCC GTCGCGGCCG TAGTCTTGAA CAACTTGTAC AAAAAGACAC CCTTGCAGAC AACTTTCTCT GGGAACTTTT TGGCGCTGAT GACGGCGAAT CGTGACAGTA GCAGCAGCCT AGTACCACAA CGATTCACTC TTCGCCAAGC CATGGACTGC TTTCTTGACT TTCGCTTTGA AACGATTCGT CGCAAGTCAC AATTCCAATT GACCAAAGTC AACGCCCGTT CCCATATTGT TGCAGGCTTG TTGATGGCAC TGGACAAGGT CGATATGGTC ATTCAGATTG TTCGTGCTTC TGCGGATCAG CAGGCTGCTC GAGAAGCTTT ATACATTGAG CTCGGCACTT CATCAGAACA AACTGACGCT ATTTTAAAAT TGCAACTTGG GCAACTTACT CGATTGAACA AGGGCAAGCT CGAGTCGGAA AAGGCCGACC TCGAAGAATC TCGTGAAAGT TTAACACATC TCCTGGAAGT TGATGATGCG GTTTACGATG TCATGAGGGA AGAGTTCATT GACATGATGA AGAGATTTGG CGGTGAACGA AAGTCAAGCA TCATTGTTGA AGATGACGGC GATTTTTGCG ATATGGACCT TATCGAAAAC TCGAGATCGG TTATTGTTGT CACTCGTGGT GGCTACATCA AACGCATGCC GTTGAAGACA TTCGAGAGTC AAGGAAGAGG CACTCGCGGC AAACGCGGCA CTTCCGATGG TGGAGAGTCA GCTGACAGTG AAGTGGCACA TTGCTTCACG TGCAACGACC ACGACACGCT CTTAATGGTT ACCCAGAATG GTATTGCCTA CGGGTTACGG GCGTACCAAG TTCCTATCGC GGGCCGTACG GCAAAGGGAC AGCCCATTCC ATCCGTATTG CCAGTTCGCG GTGACGAAGT TATTACGGCG ATCCTCCCGG TGTCCGAGTT CTCTGACGAA GAATACGTTG TTCTGACAAC AGAACAAGGC TGGATTAAAC GAACACCGTT GGATGCTTTC GAAAAATTGT CGAGCCGTGG CTTGACCATC GCTACTTTGG AAGATGGTGA TCGCTTGAAG TGGTGTCATC GAGTTCGAAA CGAGGATGAC ATCTTAATCG GCACTGTAGG TGGGATGGCG ACTCGATTCG GAGCTGCCAA GCTACGACCC ACTGGTCGAA CGAGTCGAGG CGTGAGGGCG ATGAAGCTCC GGGAGGGCGA CACAATTGCA GATATGAATG TACTCAGTGG CAAGAACAAG GAGTACATTT TAACAGTAAC TGCACAAGGC TACGGAAAGC GTATTGCGAC GAGCGAGTTC CGGGCCCAGG CTCGTGGCGG AGTTGGTGTA ATTGCCATTA AGTTTAAAAG AGGGCAGGAG GAGGACAAGG TAAGCTGCCT CCGAATTGTA AAGGACGACG AGGAAATATT GGTTATTACA GCAAGAGGGA TAATGGTCCG ACAGAAAGCG TCCGATATTC CGTCACAAGG TCGATCTGCG ACTGGCGTTA TGGTACAGCG CGTGGACGAT GGAGACCACA TATCTAGTGT GAGCATCGTA CCACAATACG AAGAAATTGA CGGCTAA
|
Protein sequence | MQYALSIIMG RAIPDARDGL KPVHRRILYA MDQLSLYPNT GHRKCARVVG EVLGKFHPHG DMAVYDALVR LAQHFSTAYP LIDGHGNFGS IDADPAAAMR YTECRLTKLS QAALLEDLQD DTVDFLPNFD GNEIEPAVLP AKLPILLLNG SSGIAVGMAT NIPPHNLNEI MTACTALVKA RQGGAAVTDK KLLQMVPGPD FPTGASILST GGTEKLYTTG NGGIVMRAVT KIEKITTGRK SSITRTAIIV TELPYQVNKA ALLEKIAGLV NEKKLDGIAD LRDESDRDGI RVVIELKRDA VAAVVLNNLY KKTPLQTTFS GNFLALMTAN RDSSSSLVPQ RFTLRQAMDC FLDFRFETIR RKSQFQLTKV NARSHIVAGL LMALDKVDMV IQIVRASADQ QAAREALYIE LGTSSEQTDA ILKLQLGQLT RLNKGKLESE KADLEESRES LTHLLEVDDA VYDVMREEFI DMMKRFGGER KSSIIVEDDG DFCDMDLIEN SRSVIVVTRG GYIKRMPLKT FESQGRGTRG KRGTSDGGES ADSEVAHCFT CNDHDTLLMV TQNGIAYGLR AYQVPIAGRT AKGQPIPSVL PVRGDEVITA ILPVSEFSDE EYVVLTTEQG WIKRTPLDAF EKLSSRGLTI ATLEDGDRLK WCHRVRNEDD ILIGTVGGMA TRFGAAKLRP TGRTSRGVRA MKLREGDTIA DMNVLSGKNK EYILTVTAQG YGKRIATSEF RAQARGGVGV IAIKFKRGQE EDKVSCLRIV KDDEEILVIT ARGIMVRQKA SDIPSQGRSA TGVMVQRVDD GDHISSVSIV PQYEEIDG
|
| |