Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44774 |
Symbol | |
ID | 7199739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 206949 |
End bp | 208643 |
Gene Length | 1695 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178951 |
Protein GI | 219116312 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0253106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGTCAGA AAATAGCCTA CAGAAGGCCG ATGGACCGGT ATTTGGCCTT TCCACATCAG TCAACAGGCC CAGGTAGGCT TTTCACCTTC GTTTCGCTCG GCATTTTTCC AAACACCCTT CCAAGCGCAG ATAGTGTTGG ACTCTCTCTA GCCGACACAG ACCAACATGA AGGGGGAGAC GATCTACGAC GTTTGTATTG TTGGAGCAGG GCCTGCCGGA CTGGCTTGTT TGTCGGCTGT CCAGGAGCCC TTTACTCTGG ATACTTTGAG CGACGATCAA GTGCACCGAG CGTTGCGGGC CTTACCCGCA AAGAAAACCG CGCATCGCCG AGTTTGCGTG GTTGATCCAA ATCCAACATT TATGTGTGGT TGGAAACGCC AATTCGACGT GCTTCAGATT AAGTTTCTCC GTTCTCCGGC ACTGGCGCAT CCCGATTTTT TTGACCAGAA TGCGCTGTTG GCTTACGCAC GACAGCACGG TCGCGAAGAC GAGCTTTTGG AATCGGGTTG TGCTCGTCTG CGTTCATTAT TGCCTTTGGG GCAATCGCAG ATCGGTTTGT GGAAACTGCC GTCGACGCAA CTGTTCCAAG ACTTTTGTCG AGACCTGCTG CCCACTCTGC GTCACGAGTA CCGGCAAGGA ACGATTGTTG ACCTACAGTC TACGGATGGC AATATGTTCT GCTTGACCAC CGCTGTGGGG CAGCAGCTTC ACGCCCGGAC CGTGATTCTG GCCATGGGCG CGATTGGCAA GCCCGTCATA CCTGCGGCTC TTCAGAAGGA CATTCCGCAG GTGCATTCGT GGATGGAGAT TGACCGTCTC GAAGCGCAAT GGGCGTCCAA ATTCTCGGTG GACAACGAAA CCGTCATGGT GGTGGGAGGA GGTTTAACCG CGGTGCAAGC GGCATTGCGA CTGGCACGCA TTCCGACGGG CGGTAGCCAA CGTCGTCGTC GTGTGATACT CAGCAGTCGC CGACCACTGG TGGAGCGGCA CTTTGATCTG GGAGTCGAAT GGTTCGATCG ACGGACGGCG ACCAAGTGTG TCGCTGACTT TTATCACCAG CCCGTATCGG AACGGTTGGC ACAATTGCGA GAGAGTCGCG GAGGAGGAAG CGTGCCACCC ATTTACATGG CCGAATTGCG TAAGTGGGAA CGACAGGGTC GGATCGAATG CGTGGTGGGC TCTATCGATT CCGCCGCAGC ACAAAGTCAG GCGTCCGGAG ACGCACGCGT CCAAGTCACG ATCGAGTCCC AGAAGTACCG GGTGGACGCC ATTGTGCTTG CATGCGGCAA TCAACCCGAT TGTCTCGCAC ATTCGCTGCT GCTGCGTCGT ATACAGGAAC AGTGGCCGGT GCCCATGCAG GGTGGCTTCC CGTGCGTAAC GGAAGATTTG CGATGGAGCG CCGCTTGTTC CGGTTTGTAC GTAGTGGGTG GCTTGGCGAG TCTGAACGTG GGACCCGACG CGTCCAATTT GATGGGCATT CGACGAGCGG CGTCCATTGT CTCGTACCAC GCACTGGCGT GTCGTAGTTG GTTGCGTGAA GCTCAGTCAC CGGTCCTGGC CAACCGATTC CACGCACTGT GGTCGGACAG TGAGAGTGAG TGCGAGTTCG AATGCACGTG TGAGAGTGGT GATACCGACA GTGAAACCGA CGATTCCTGC ATGTTGTTGG TTCCTGCATA GTATGTGGAA TCACA
|
Protein sequence | MKGETIYDVC IVGAGPAGLA CLSAVQEPFT LDTLSDDQVH RALRALPAKK TAHRRVCVVD PNPTFMCGWK RQFDVLQIKF LRSPALAHPD FFDQNALLAY ARQHGREDEL LESGCARLRS LLPLGQSQIG LWKLPSTQLF QDFCRDLLPT LRHEYRQGTI VDLQSTDGNM FCLTTAVGQQ LHARTVILAM GAIGKPVIPA ALQKDIPQVH SWMEIDRLEA QWASKFSVDN ETVMVVGGGL TAVQAALRLA RIPTGGSQRR RRVILSSRRP LVERHFDLGV EWFDRRTATK CVADFYHQPV SERLAQLRES RGGGSVPPIY MAELRKWERQ GRIECVVGSI DSAAAQSQAS GDARVQVTIE SQKYRVDAIV LACGNQPDCL AHSLLLRRIQ EQWPVPMQGG FPCVTEDLRW SAACSGLYVV GGLASLNVGP DASNLMGIRR AASIVSYHAL ACRSWLREAQ SPVLANRFHA LWSDSESECE FECTCESGDT DSETDDSCML LVPA
|
| |