Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43460 |
Symbol | |
ID | 7197169 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 517514 |
End bp | 519529 |
Gene Length | 2016 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177951 |
Protein GI | 219112399 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCCCA TCCTGGAATG TTCGGCGCGG CGGTCGACCC GTGAACGAAC AACCGAACAT CCTTCCGTAG CACGTGGTGG AGACGAGCGG AGAAAGGGGT CAAATTCCAA ACGAAACGCT CGTTTGTTTA AGAGGGGGCA TGCCTACACC TCTCAGCTTC ACAATTTATG TATGTGTATG TGAATGCGGA TTCAAGATCA CGACACTGCT CGATTTTCAG GGTACGCCCC ACACCACTCA CAATCGGCAT CCGCAGTGAT CGCATCCCTA ACAACACTGC CAACTCCCGA TACCCGCATC GATTGATACC ATAGCCCAAC AGCAATAGAA AATGATCAAG TCCTTTGCGT GTACTGCCAC AGCCTTGCTC AGCCTGCTGA CGGTCGTTCA TTCGATAATA GACCCCGGTA CGTGTCTCTA GATTGGATTG GACTGGACTC AATCACAAGA ACACAGATTG CTTTCGATTG CTGCAATAGT CTCTCACATA CTTCGTCGTT CTCTTCACCA GCAACGTTCG ACCCGGATAA ATGTTCTTCT TCGACACTGA ATTTCAACTT TTTCCACGCA TACGATCAAG TCACGGACGA GACCTTGGCG GCCTACGGCT TCGCCTATTT GTCGATTCGC ACAAGCTCCG ACAGCAGCAA CTGCGAGACT CCGTTGGCTC GGGTCTTTGA CACGGCCCGT CCTACTTGCG GTGACGGCGA CTTGACGGGT GCGCCAGACG ACGGCAAGGC CATTATCGTC CAGGAAAAGT CCCGCTGCGG TGGAGCCGAT GACTGTGCGG ACGGTGGAAC GATCGAGTTC GACTGGAACG GAGCTCTGGT CAATCTACGC AGCATGCGCA TTCTGGATAT GGACGAAGCC GTCGAAGTTT CCGTGAAGAT ACACACCCAT CCGGATTGGA CCGTTCTGCC CGCCCCTAGT CATCCCGGCA ACGGCCAACA CGTCACGTAC GACTTTGGGG GAGGCGTGGA TAGTGTCACC CAGCTTCGGG TACACTTTGT GGGATCGGGT GGGATTCCGT CACTCACCTA CACCAAGTGT GAGCCCATCG TTCACGGGGA TCCCCATTTC AAGACCTGGG CGGGACACAA GTTCGATTAC CACGGACAGT GTGATTTGCT GCTCGTGCAC GCACCGCACT TTCAACAAGG TCAGGGACTC GATCTGCAAG TACGCACCGA GCAACGTAGC TTCTTCTCAT TCGTCAGTCG GGTGGCAATC AAGATCGGCA ACGACATTTT GGAAGTCGGC TACCGAGATC TGCTTTTGAA CGGGGCGCTG CACGACAATC TCCCCGTGAA CGGCTCCTTG GACTTGTCCG GTTACCCCGT CACCTACACG GACGAGCCCT TCCCCAACGG ACGAGCCCAA AAGGTCTACA CGATACAGAT CAATTCTATG GAAAGCATTC GGATCAGTGT GTTCAACCAC TTTATGGCGA TCCGCTTCCT CCATATCAAT CCCCGCAACT ACCGAGACGC TACGGGCCTC TACGGGGACT ACAATTCGCT ACGCATGCTC GCCCGCGATG GAACCACGGT CCTACAGCAC GATCCCGACC AATACGGTGC CGAATGGCAA GTCAACGATC AAGACGCCCA GCTTTTTGCG CAGGCCCAGG CGCCGCAGTA CCCCCAAGCC TGTCGACCCG CACCGTCAAT TGCGGACGGC AGTCGTCATT TGCGCCACGG TATTACCAAG GCTCAGGCGC GGGATGCGTG CCAGCGGGGC ATGGCCGCGG ATATTGGTGA TTGCGTCTTT GACGTAATGG CCACGGGAGA TCTCGGTATG GTCCACGCGC ATTTCTTTTA ACTCAAGTGC AAGATCCAAT CCATGAGATG GGTGGTGAAG ACCATTGAAG TAACGGGGGA CGCGATCGAC GCTGGAATCG GGGGGACGAC GACCGTGGGG TCGGACGTGG CGGCCGCTAC GAATTGGTGG GTGGGGAATG TAGCAAACGT CACCAATCCA GGTATTGATA TTGAGGAGCT GTTCTTTTTA CACTAA
|
Protein sequence | MIKSFACTAT ALLSLLTVVH SIIDPATFDP DKCSSSTLNF NFFHAYDQVT DETLAAYGFA YLSIRTSSDS SNCETPLARV FDTARPTCGD GDLTGAPDDG KAIIVQEKSR CGGADDCADG GTIEFDWNGA LVNLRSMRIL DMDEAVEVSV KIHTHPDWTV LPAPSHPGNG QHVTYDFGGG VDSVTQLRVH FVGSGGIPSL TYTKCEPIVH GDPHFKTWAG HKFDYHGQCD LLLVHAPHFQ QGQGLDLQVR TEQRSFFSFV SRVAIKIGND ILEVGYRDLL LNGALHDNLP VNGSLDLSGY PVTYTDEPFP NGRAQKVYTI QINSMESIRI SVFNHFMAIR FLHINPRNYR DATGLYGDYN SLRMLARDGT TVLQHDPDQY GAEWQVNDQD AQLFAQAQAP QYPQACRPAP SIADGSRHLR HGITKAQARD ACQRGMAADI GDCVFDVMAT GDLGMIQSMR WVVKTIEVTG DAIDAGIGGT TTVGSDVAAA TNWWVGNVAN VTNPGIDIEE LFFLH
|
| |