Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48580 |
Symbol | |
ID | 7194740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 260238 |
End bp | 261668 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183060 |
Protein GI | 219125592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000300421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCTG CGTCCGCCCT CAAATGCGCA TCCTATGAGG GAAGGAGTGG CGTAGCTGCC GCGGTCCGGC TGGTGTCGTC AATGCGTACT CGCAACTCCG AGACGATCGG TGCATTGTCG CCGTACTGCT ACGATCACAC CCACTTGCCA TTGTCTACAA CAAGGGCTCA GGCTTTCTCG GCTGCTTCCG CCGTGAGAGA CTTTGCAGAT CCGCAGGGGG CGGCCCGCAA ACACCGTCCG ATTTACATCG CGGCGACGAG GCAGCACGTG GGCAAGACCT CGGTCAGTCT CGCACTGATG AAAGGATTGC AGCGCCGTGT TCCCAAAGTA GGTTTTCTAA AACCCGTTGG ACAGCATTCC GTACGAATTG CCGAGCTCGA CGGCAGTGTC GTGACGGTCG ACAAGGACAC CGCTTTGATC GTTCAACACT TTGGACTTAC ACGGCATCAG ACATTGCAAG ATGCCAGTCC AGTCCTCATT CCGCCGGGGT ACACAAAAGA TTACGTTGAC GGGAAAATTA CGCTCGATAC TCAACGTGCC TCCATCGGAA AAAGTTTTCA ACGCGTCGCT TCCTTTGCCG ATATTGTCCT CTGCGAAGGA ACCGGACATT GCGCCGTCGG TAGCATCGTA GACGCCAGTA ACGCCGCCGT CGCGTCTTGG CTCGGCGCCC GGATGGTCCT TGTGGCCAAC GGTGGTCTGG GGAATTCTGT GGACGAACTT GAACTCAATA AGGCCTTGTG CGACAAACAT GGGGTTGAGA TTGCCGGCGT TATCATCAAC AAGGTCTTGC CCGAAAAGTA CGAACAGACA AAATACTATC TGGAGAAAGC ATTGCACGAT CGGTGGGGTA TTCCCTTGCT GGGATGCGTT CCGGATCGGG CGTTTTTAGG ATGTCCCGCC TTGGCCGATC TAGAGCGTCT CTTCCCCGGC GCGATGTTAG TTTCTGGGCT CGATCATCGA CTGCGACATT ATACGGTGCA AGATTTGAAC CTCGTCGCCA CATCGCTCGA AGTCTTTTTG CGCAATCTGC GAACCGATCC CTCCCGCACG CTTTATGTTT GTCACGCTTC CCGAAACGAT ATCTTGCTCG GCTTCCTTAT GGAAAGTCAG CAACGGCCGG ACTGGGAAGC CGCGTTAGTT GTCACAGGCT GTCACGATTA TCCCGTCAGT GACCAGGTTT TGCAAATCAT CACTTCCATG CCTTCGGCCC CACCGGTGCT CTTGGCATCG CCACCGACGC GACAAGTCAT GCACGATATA CACCACTTTA CCCCAAAATT GAATTTTGAG GATGGACACC GCGTCGAAGC CGCTGCCGCT CACTACGAAC CCTACATTGA CTTCGATCTT CTTTTGTCGA GGGTCGGAAC GACGTCTACT GGCTCCTCGA AATCTACGTC GAAAGCCGGC CTTGCAGTCG CAGTGCCGTA G
|
Protein sequence | MIAASALKCA SYEGRSGVAA AVRLVSSMRT RNSETIGALS PYCYDHTHLP LSTTRAQAFS AASAVRDFAD PQGAARKHRP IYIAATRQHV GKTSVSLALM KGLQRRVPKV GFLKPVGQHS VRIAELDGSV VTVDKDTALI VQHFGLTRHQ TLQDASPVLI PPGYTKDYVD GKITLDTQRA SIGKSFQRVA SFADIVLCEG TGHCAVGSIV DASNAAVASW LGARMVLVAN GGLGNSVDEL ELNKALCDKH GVEIAGVIIN KVLPEKYEQT KYYLEKALHD RWGIPLLGCV PDRAFLGCPA LADLERLFPG AMLVSGLDHR LRHYTVQDLN LVATSLEVFL RNLRTDPSRT LYVCHASRND ILLGFLMESQ QRPDWEAALV VTGCHDYPVS DQVLQIITSM PSAPPVLLAS PPTRQVMHDI HHFTPKLNFE DGHRVEAAAA HYEPYIDFDL LLSRVGTTST GSSKSTSKAG LAVAVP
|
| |