Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42003 |
Symbol | Cbs |
ID | 7201255 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 487016 |
End bp | 488548 |
Gene Length | 1533 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180648 |
Protein GI | 219119791 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGCA TCTGCGACGA TATTCTTGAA GCGATTGGAG GTACACCGCT GGTACGCTTG AATCATGTTG GAGCCGATTT ACCATGTGAA TTGCTTGCCA AGTGCGAGTT CTTCAACGCT GGTGGATCTG TCAAGGACCG AATCGGCCGA CAAATGGTTT TGGACGCAGA AAAGGCCGGT AAAATTAAAC CTGGCGACAC CTTGATTGAG CCTACGTCCG GTAATACTGG AATTGGGCTA GCCTTAACAG CTGCCGTTCG GGGATACCGC TGTATTATTA CAATGCCCGA AAAAATGTCG AAAGAAAAGG TAGATGTTCT GAAGGCCCTG GGGGCCGAAA TTATCCGAAC GCCGACCGAA GCCGCGTACG ACGCACCGGA TTCACACATA TCCGTCGCAC GTCGTCTACA ATCTGAAATC CCCAATTCAC ACATTCTGGA TCAGTATTCG AATCCATCGA ATCCGAATGC ACACTACTAC GGGACTGCGG AAGAAATTCT ACGCCAAACG GGTGGCAAAG TTGACATGTT GGTGGCTGGA GCCGGTACAG GCGGGACGCT AACCGGTATT GCCAAGCGTT TAAAAGAACA CAATCCGGAT ATTCAAATTA TTGGTGTCGA CCCAGAAGGT AGCATCTTGG CCATTCCGGA TTCTCTCAAT GACAAACGTC GTTTGGAATC GTATCACGTC GAAGGCATTG GCTACGATTT CATTCCCAAC GTTCTTGATC GCAGCGTTGT CGATCATTGG TACAAGTCCA ATGATGCCGA AAGTTTTGTT GCAATGCGGC GCTTAATTCG AGAAGAAGGT TTACTCTGTG GGGGTAGTTG TGGTGCGGCC GTCGCGGGCG CGCTCAAGGC TGCGCGAAGC TTGAAAGCTG GACAACGCTG CGTCATCATT TTGCCCGACT CTGTCCGCAA TTACATGAGC AAAGGTCTCA ATGACGATTG GGTCCGTGAC AATGGATTCG CGGATGGAAA AATTATCAAG GCCAAGTCAT ATTCTTCTTG GTGGGCCACG AAAAGAGTTT GTGATTTAAA TCTCAGCATT CCTTTGACAA TCACCAGCGA TGTCAGTTGC AAGGATGCTA TTTTGCTGTT AAAACGAGAG GGTTTCGATA TGGTGCCAGT CTTAGACGAC GGGAATGTCG TGGGTGTTGT GACGGAGGGT AACATGACGA GCAAGTTGCT ATTAGGACGA TGCGATCCCG ATACGTCGGT AGCGGATGCA GGTGTCATCT ACCACACGTT TCATAAGTTC AGCATGAGCG ATACGTTGGA TGAGTTGGCT CAAGCTCTGG ACCACGATCC GTTTGCTTTG ATAGTGACGG AGCAACGTTG TTTCTCGGTG GCCTCAAAGA AACGGAAACC GACTGTGAAT GGAGATGGAA ATGTAGAAGC TTTGTCGGAG GAGAGTTCGA CAAAGGCAAA TCATTCCGAA GTTGTGACTC GCAGCGTAGT CAGTGGCATC GTTTCCCGGA TAGACTTGCT GGACTTCATT AGCTCAGACG CCAAGCATGA ACTTGAAAAA TAG
|
Protein sequence | MDRICDDILE AIGGTPLVRL NHVGADLPCE LLAKCEFFNA GGSVKDRIGR QMVLDAEKAG KIKPGDTLIE PTSGNTGIGL ALTAAVRGYR CIITMPEKMS KEKVDVLKAL GAEIIRTPTE AAYDAPDSHI SVARRLQSEI PNSHILDQYS NPSNPNAHYY GTAEEILRQT GGKVDMLVAG AGTGGTLTGI AKRLKEHNPD IQIIGVDPEG SILAIPDSLN DKRRLESYHV EGIGYDFIPN VLDRSVVDHW YKSNDAESFV AMRRLIREEG LLCGGSCGAA VAGALKAARS LKAGQRCVII LPDSVRNYMS KGLNDDWVRD NGFADGKIIK AKSYSSWWAT KRVCDLNLSI PLTITSDVSC KDAILLLKRE GFDMVPVLDD GNVVGVVTEG NMTSKLLLGR CDPDTSVADA GVIYHTFHKF SMSDTLDELA QALDHDPFAL IVTEQLSGIV SRIDLLDFIS SDAKHELEK
|
| |