Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33901 |
Symbol | |
ID | 7197747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 626345 |
End bp | 628135 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178265 |
Protein GI | 219114939 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.530565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACCTC TCACCACTCA TACCACCCAT ACTGGTACCT TCGCCAACGG CACCGACGAC GGCGACAACA ACAACATGGT ACCCACCGTG GCAGATCGTG GGACGTGCGT TGCCATTCTC ACGGACGATC CCGACGCATC GAAGCCGTGG TATCAGCGTC TCGCATGGTG GTGCACGTCT CTGCCGTGGG TTTGGGCCAC AACGACGGTG TTGGTCTTGG TGGTGGTGGG TGTCACGCTG TTGACGATCC AGGTCCGTGC CGACTCGTCC ACCACTCCCG TGTATGATCC GCGTGCACAC GGAGAGGCCT ACAACCGTAC CGCCTACTAC GTCGCACTCC GGGATGTCCT CCTGTCGTCC TCCGACGATG CACCGGTCGT GTGGACGACT CCGGGATCGC CGATGGAACG GGCCTTGGCT TGGATGACGT TGGACGATCC CTTGGCACCC TTGCCCTTTT TCGAATCCCA CGCCACGTCC GACGAACAAG CAAAAGAATC CACCGCCACC ACGCTATACG CGTACGAACG CACGCGACTC CACCAACGCT TCGCCTTGTG TGTCTTGTAC TACACATGGG CCGGTCCAAC CTGGACTTTG GAACCGTCCC ACGGCTGGTT GCACCGACAT TCCGGAATCG AGTCCTCCGG ACGACTCGCG GACCAGTCGA TCGATGCGAC GCACGAATGT CTCTGGTTGG GGGTAACCTG CACGAGCGAC AACCGCACCA GTGACGATCA CCGCGTGGTC ACCGGTTTGG ACTTTGGAAC CTCGAATGCC GCACTCAAAG CGTATGGCAC CATTCCGGAG ATTGTGGGAC GCTTGACGCA TCTCCAAAAT CTTCTCGTCT TTGATCAGCA ACTGCAAGGA CCGTTGCCTA CGACACTCTT CCTACTCACC AATTTGCAAG CGTTGGACGT CAACACCAAT CGCCTCACGG CCATACCGGA AGCCTTGGGG GATCACCTCG TGCATCTCCA TACACTCCAC TTGTACGGCA ACGAATTCCG CGGGACCCTT CCCGCTTCTT TTACCCAACT CACACTTTTG GAAAATTTGC GGTTGGACGA CAATCCGGCT TTGGTCCAGG ACGATTTTTG GTCCACCATG CTCCCCTCCT GGCCTCTTTT GCGGACCGTC GTCACGTCCT CCACCGGGTT GGGCGGGAGT CTACCCACGG AAATTGGGAC GCTCCGTCAA CTGGCTACCG TCTCGAGCAA CTTTGCACCC ATTTCGGGAA CTCTCCCCAC CGAACTCGGG CTCTGTACCG GCATGGTGCA GTTCAACGTG AATCAACCCC AGGCGATGGC CGCGACCACG CTCGCTGGTG GTTTCCAGGG TACCTTACCA ACCGAACTGG GTCGATGGAG CAATCTCCGT TTTTTGGCCT TGCGGGGACA CGCCAATCTG GTGTCCACGT TGCCGTGGGA ATTGGCGTCC TGGACGAATG TGCAACTGCT CGATCTGGAC CAGACGGCGG TCCGGGGGAC TCTGCCCGCC TACGTGAGTC GCTGGTCGCA ATTGAATCGA CTAGTATTGT CGTCGACGGA TTTGACCGGC ACGATTCCGA GCGAATTGGG AATGTTGTCG GATACCTTGA TGTCAATGGA ACTGCAAGAC ACGGACCTGG TCGGGACCGT GCCGGTGGGT TTGTGCAACG GGGGTGGCGG CGTCGAATTC GTCATTTCGT GTGACGGCAG TGCGGGTCCG AACGGGAGGG ACGGGAACCA AACGACGACG ACGGCAGCAA AGGCGTTCCT CGTGTGCGAT TGCTGTCGGT GTTTGGAATA A
|
Protein sequence | MLPLTTHTTH TGTFANGTDD GDNNNMVPTV ADRGTCVAIL TDDPDASKPW YQRLAWWCTS LPWVWATTTV LVLVVVGVTL LTIQVRADSS TTPVYDPRAH GEAYNRTAYY VALRDVLLSS SDDAPVVWTT PGSPMERALA WMTLDDPLAP LPFFESHATS DEQAKESTAT TLYAYERTRL HQRFALCVLY YTWAGPTWTL EPSHGWLHRH SGIESSGRLA DQSIDATHEC LWLGVTCTSD NRTSDDHRVV TGLDFGTSNA ALKAYGTIPE IVGRLTHLQN LLVFDQQLQG PLPTTLFLLT NLQALDVNTN RLTAIPEALG DHLVHLHTLH LYGNEFRGTL PASFTQLTLL ENLRLDDNPA LVQDDFWSTM LPSWPLLRTV VTSSTGLGGS LPTEIGTLRQ LATVSSNFAP ISGTLPTELG LCTGMVQFNV NQPQAMAATT LAGGFQGTLP TELGRWSNLR FLALRGHANL VSTLPWELAS WTNVQLLDLD QTAVRGTLPA YVSRWSQLNR LVLSSTDLTG TIPSELGMLS DTLMSMELQD TDLVGTVPVG LCNGGGGVEF VISCDGSAGP NGRDGNQTTT TAAKAFLVCD CCRCLE
|
| |