Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18036 |
Symbol | |
ID | 7197081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 103947 |
End bp | 105235 |
Gene Length | 1289 bp |
Protein Length | 385 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177864 |
Protein GI | 219112225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCATACAAT CACATCCCGT AGTCTCGGGC AAGGAACGCA AGGATAGCAA ATCCTTTACC AGTACCGACA GCTCCCACAT CAACAACACC GTACGTGCAG TAGCAGTCAC AAAGGGTCGT TACAGTACAG CATGGTAGTA CGAACCCCGC GCCAGGGCGC GTTTGTACGT CTATGGCAAA CGTGCATCGT CTTTTTGTTG CTAGGCAGCC ACGTGTTGCT CGTACCGGTA GTGCAGGCTG CATCTCGCGA CTACTACCAA ATTTTGGGAG TATCACGCGA CGCCACCATC AAGGAAATCA AAAAGGCCTA CCGGCAAAAG TCGCTCGAGT TTCATCCCGA CAAGAACAAG GACGAAGGTG CGTCGGAGAA GTTTGCCGAA GTGGCTCGGG CCTACGAAGT GCTTTCGGAC GACGAATTGA AAGCCGTCTA CGATCGACAC GGGGAAGACG GTTTGAAGCA ACGCGAACAG CGTGGTGGGG GAGGAGGAGG AGGAGGCTTT GAAGATCTTT TCTCGCAGTT TGGCTTTGAC TTTGGCGGTG GACGACAGCA GCGCGATCAA GAACAGCGCA CGCCGGATGT CGAAATTCCA CTCTACGTGT CACTCAAACA GTTATATCTC GGTGAAACCA TCGATGTCGA CTACGTCCGT CAGGTACTCT GTTTGCAGTG GGAAATGTGC GTCAAGAGTG CGCCCGATTG TCAAGGGCCG GGCGTCCGGG TACGCCGACA ACAACTCGCC CCAGGATTTG TACAACAGGT CCAACAAAGG GACGACCGCT GTGTGGCCCG GGGTAAGCAA TGGCTGGATA AGTGTCGCGA ATGTCCCCGC CAGACGGAAA CGGAACGAAT CCAAGTGACT ATTGAAATCC AACCAGGATT CCGTGCGGGA GAAAGGGTTA GCTTCGAAGG CGTGACGGAC GAAAAACCCG GCTTCAAACC GGGCGATTTG CATTTTGTAC TCATGGAAGA ACCGCACGAT GTGTATCACC GGGATCGGGA TGACTTGTAC AAGACTATGG AAGTCCCATT GGTGGATGCG TTGACGGGAT TCTCCGTCAC GCTCAAGCAT TTGGACGATC ACGAGTACAC GGTGACGGTG GAGGATGTGA CGGATTGTGA TCACGTCTTG CGCGTGCCGG GAAAGGGAAT GCCGCGACGC AGCGGGCGTG GCTTTGGTGA CCTGTATCTC ACCTTTGAAG TCGACTTCCC CGATACACTG ACTCGTGAAC AAAAGGACGC CATTCGCAGT ATTCTGGCTC CGGGAGAAGA AGCGAAGCAA GAATTGTAG
|
Protein sequence | MVVRTPRQGA FVRLWQTCIV FLLLGSHVLL VPVVQAASRD YYQILGVSRD ATIKEIKKAY RQKSLEFHPD KNKDEGASEK FAEVARAYEV LSDDELKAVY DRHGEDGLKQ REQRGGGGGG GGFEDLFSQF GFDFGGGRQQ RDQEQRTPDV EIPLYVSLKQ LYLGETIDVD YVRQVLCLQW EMCVKSAPDC QGPGVRVRRQ QLAPGFVQQV QQRDDRCVAR GKQWLDKCRE CPRQTETERI QVTIEIQPGF RAGERVSFEG VTDEKPGFKP GDLHFVLMEE PHDVYHRDRD DLYKTMEVPL VDALTGFSVT LKHLDDHEYT VTVEDVTDCD HVLRVPGKGM PRRSGRGFGD LYLTFEVDFP DTLTREQKDA IRSILAPGEE AKQEL
|
| |