Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38705 |
Symbol | |
ID | 7203403 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 739124 |
End bp | 740643 |
Gene Length | 1520 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182752 |
Protein GI | 219124943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.010548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAGG GCCGCTCCTC CCGATCCTTT GGCAAGTTCT CCATTGGCGA CCCTTTTACG CCAGGCGGCA TGACCTCCCA TATTACGGAC ATTCAATTCC TGTATTCATT GCAATGGAGT CGCTACGGCC AGCGGTAGAA AGTGCGAATT GACTGTAAAT TAATGTAATT GTAGATGTGG GGTTTTCTCG GATGTGCAGC GTGGACAACG TAGGCTTGTC GAAGGGGATC ATGTGTCAGA TGGTTTAAGC TGGATATAGT GACGACATGT AAAGACCAAT CCAAATACGT AGATAGGTCA GGTGTCTGGC CTTCATGCGT TTCCAGGAAG AGCGTGACGT AGGCAACGTA GTTCTTTGCT CACTTTCTCT TCGTTTTGGA TCTGGCTTTC ACTTTTGGCA TTTCTTTAGA AGTTCCGCCG CCATCTGCTA TCCGTTCTAG TAACATTGCG CCATACTCAC CGGAAGGACT CTCAAACTGT ACAAACTCGT CGCCGACCGT CCGTACGGGG AATATGCATG TTGCGTCGCG GATGTTGTCC TTTGTGTTTT GGGCAATCCT GTCTCTTTTG CTGGTGCTGA CAGCGAATGC ACTGGGATGT GTTGGTTTCG GTATGCAAGC GTCTCCGCTT TCTCAGTGCA CGAGTCGTCG TCGGTTCGCA CAAAAGAAGG GATCCCACAG TCACCTCTCG TCTCACCGCG CAATGACGCT CAAATTGGCG AAAGCTCTTT CCGACGATGA CACGAGTGTG ACGATTGTGA CAAAGTCCGC GGAGTCCTAC TTTGTCGACG CAGAATTCTT CGAATTGCAA ATATCACCCC ACAGACCCCT TGGTTGCACG GTAGAAGAAA GCCTCGGCGT AGGACGACAC GTATTCGTCA GCAAAGTCGT TCCGGATGGG AATGCCGCTA AAGCTGGAAT CGCAGTGGGA GACGTCCTAA TAGGGGTCAC GGCGGTCACT GGTGATCAAA TAATGGATGT CTCCGGTCTC GGGATCGAGA CGATGTAAGT GTGCGATTGC GTCAAGGACG GTGGAAAAAA TACGATCCGT ACGCTCAAAG ACTCTCTCGT TTCCTTGTTC TTTTCAGTAA AGGACTAGTC GCATCTCGAC CGGAAAATGA GTCCCTATCT CTAAAGCTTG CGAGGGGAAC AACCGTCGTC GAGGATCACG AGCAAGCCAT TGTAGATCTG TGTGGCAATG AGGACCAGAG CGAGTCGGAA GCTGAGCAAT GCGTCTTGGA TTTTTTGAAG AGTGGCTACG ACTACGCCAA TGATTCTGAT GACAGCATGG AGGCTGTTGA TGATGTTGAC GCCGCAGACA GTGCTGCGGA GGAAGAGGAC TTGGTAGGAA ATATGTACAG TATGTGGAAC GAAGACATGC CCGCCGCATC GCCTAAACCG GAACCCATAC CTGCATCCGA GGCGGCTAGT GTAGTCAAAC CATGGTCTTC GCGATCAAGC CCATCCGGAA CTTTCATCCG AGATCCAACT ACTGGAAAGA TGAAAAATAT AGACGCTTAA
|
Protein sequence | MSKGRSSRSF GKFSIGDPFT PGGMTSHITD IQFLYSLQWS RYGQRCGVFS DVQRGQQVPP PSAIRSSNIA PYSPEGLSNC TNSSPTVRTG NMHVASRMLS FVFWAILSLL LVLTANALGC VGFGMQASPL SQCTSRRRFA QKKGSHSHLS SHRAMTLKLA KALSDDDTSV TIVTKSAESY FVDAEFFELQ ISPHRPLGCT VEESLGVGRH VFVSKVVPDG NAAKAGIAVG DVLIGVTAVT GDQIMDVSGL GIETIKGLVA SRPENESLSL KLARGTTVVE DHEQAIVDLC GNEDQSESEA EQCVLDFLKS GYDYANDSDD SMEAVDDVDA ADSAAEEEDL VGNMYSMWNE DMPAASPKPE PIPASEAASV VKPWSSRSSP SGTFIRDPTT GKMKNIDA
|
| |