Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38038 |
Symbol | |
ID | 7202918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 722174 |
End bp | 724435 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181963 |
Protein GI | 219123296 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TACGGTGCTT TGAGGACCAC CAAGGTCGTT GTGGGGACTG TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAACAC TCTGAACATC CGTAGCGTCA AACTCTTCAA ACCGGACCAG TCGACAGTAC CATCCAGTCC CGCAGCACCA ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG GAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT ACACCAATTG GCGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG CAATATTTTC TACTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TCGATTCTTT GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGACG ATTTTGTTGA AAGATACAAC AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAGCAG AAGATGATGG GGACTACAAT GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACGTACGCTA TCGTTGGAGA CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCAAAGTCGT CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGGCAAAT TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACGGGAG AACCAAGAAG TGGCCTGTAC GCACACCTAA CACCAACCAA AAAAAGAAGA AAGAACAAAG ACGGTAGTTT TAGCAGCAAT AGACTACAAG GACGATGCTT GGTGTGTTCC AAGAAGACAA CATATGTTTG CTCAGTGTGC AAAGATGAAG AAACACCTCA CTCCAGAGAA CCGTGGGTTT GTTATACCAC CAAGGGGAAG CTGTGCTATG CAAACCACAT GGCTACCTGT CACGGCGCCT AA
|
Protein sequence | MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTVVEVNNT RKAPNNRVST FITADFDIGG GSVKRNTLNI RSVKLFKPDQ STVPSSPAAP IPAVDNADTD LAVPEQEEGE AVLQETSPDE ELEFPAQPMM EIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QYFLLMFPPD QLSAMCQLTN VQLAQQNKHC MSTGELLRFF GILILATKFE FSSRSQLWST TAPSKYIPAP AFGKTGMSRQ RFDDLWRNIR WSNQCPERPE GMSSHTFRWQ LVDDFVERYN NHRANTFKPS HLICVDESMS RWYGQGGEWI NHGLPNYVAI DRKPENGCEI QNAACGCSGI MLRLKVVKGK TAAEDDGDYN EQLLHGTKIL KELVLPWWWT DRIVCADSYF SSVGTAMELQ RHGLRFIGVV KTATKQYPMR YLSTLELNQR GERRGLVMRD VDTNYSTLLA FVWMDRDRRY FVSSASSLDA GKPYVRYRWR QIDQSPDADP ERLEIIIPQP KAAELYYSAC GMIDRHNQSR QDTLMLERKL GTTNWSTRVN LSIFGMIVVD TWLAYSLCTG IGRANGREEK QKDFYTALAE ELVDNQYDNV GSRRVFVEAN LDNDSPALSR TTGEPRSGLY AHLTPTKKRR KNKDGSFSSN RLQGRCLVCS KKTTYVCSVC KDEETPHSRE PWVCYTTKGK LCYANHMATC HGA
|
| |