Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50318 |
Symbol | |
ID | 7199063 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 284929 |
End bp | 286430 |
Gene Length | 1502 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185248 |
Protein GI | 219130177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGAACGAAG CTGACAAACG CCAAGCCCAG TCCAATCGAA CATCTACTAC TGGGAGGCCT CTGCGAACTA CGAGAGATTA GAAAGAACCG CTCCAGCAGC AATTCGGGAT CGTCTGGACC AAAACAAATC ATGGTTCACC GTCAGCCTAC GGTTCCGACC TACGATGCGC AACCCTTGAC GGACGCTTTT ACGGCCAATC GGCGTTTGCA GTACGCAGTG CTGCGCAATT TACGAGAACT CGCGCTTCAA AAGGCAAATA ATCGTGCGCA AGCCGCCAAA TTGACGGCAC AGGCGAGTCG TCGGCGAATG GAGGCGTTAT GCGTTCCCGC CGTTATAGGA AACCCTTACC GGCACAGTAT CTGGAAGAAT CGAGGTTTCT TCACGGGACC GGATGGAAGT ACCCCTCCAC CAAATCCAGA CACACTGAAG AGGAGAAAGC TGGAACAGGA ATCCTTCTAC TATCACTTGC AGCCTCCGTG GTCGAGTAAG GAATCCAATA CGCTTTCTTC AATCGTCCAA CATCTGCGGA TTGAAAGCAC CACTGAAGAG GCGATTATCG AGGGACAAGA GCAACAAATT GACTTTTTGC AAGTAGCGGA GCAGCTACAA CAGCAACGCG TTAAAAATGT ATCGCTTAGC TCGTCGCTTC CGAAAACGGC ACTTCCGAGA ACCGCAGAGG AATGCCGTGT GCACTATGAG CAGCTCATCA GAAAACGAGA TATGATAACC AAAGCGGAAT TGCAACGCAT TGTCGAGCAA GTGGAAGCTG CATCGAAAAC ACCTGATTGG TTTGCCATAG GAAAGGAGTT ATCCACTGCC ACAAAACAAA GGACGGGATG GGAATGCTTT CTTGCGTACC AAAATATGCT ACGCCAATCA TGCCAATCGT CCACTATAGT GTTTACCGCC GAGCAGGACG AACTTCTACT GAAGTACGTA GCAGCCATGG GTCCTCAACT AGTCCTGGAT GGATCGCAAG TAGCTTACAT GGCTGCGAAT ATTGCGCCCG ACAAACCCAG ATCAAAAATA TTCAAACGAC TCAATACCTC CCTGCTCAAC CCTAAATTAA AGCACGACGC ATGGAGTGAC GAAGAAGAGC GTAAATTGGC CATTGTGATG AAGATGTACA AAAGCTCACC CGGAAATGAT TTGCACCAGC TCGTTTATCA TCTGCCGGGC CGGAGCATGA AGTCCGTGGT TGATAAATGG CATAGAACTT TGAATCCGGT TTACTCAACA ATACCGTTCA CGGCGGCAGA AGATGAAGCT CTTTTGCAGG CTGCACGACA AGAGGAATCA AGAGGTAGGG GAATTCAGTG GGTCAATTTC GCACACGAAA AGTTTCCAAA ACGACATCCT CAACGTCTTC AACGACGTTA CCTAAACTTG AATGAAAGAG CGAAGGAGGC ATTGAGAAGA GAATATCAAG AATCTACGGA ATTCTAAATT CTGTTTGGAC AAACTTAGTG TCTCTTGCTC TAAAGTACAG CTGGCTCCCC TT
|
Protein sequence | MVHRQPTVPT YDAQPLTDAF TANRRLQYAV LRNLRELALQ KANNRAQAAK LTAQASRRRM EALCVPAVIG NPYRHSIWKN RGFFTGPDGS TPPPNPDTLK RRKLEQESFY YHLQPPWSSK ESNTLSSIVQ HLRIESTTEE AIIEGQEQQI DFLQVAEQLQ QQRVKNVSLS SSLPKTALPR TAEECRVHYE QLIRKRDMIT KAELQRIVEQ VEAASKTPDW FAIGKELSTA TKQRTGWECF LAYQNMLRQS CQSSTIVFTA EQDELLLKYV AAMGPQLVLD GSQVAYMAAN IAPDKPRSKI FKRLNTSLLN PKLKHDAWSD EEERKLAIVM KMYKSSPGND LHQLVYHLPG RSMKSVVDKW HRTLNPVYST IPFTAAEDEA LLQAARQEES RERRRH
|
| |