Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14145 |
Symbol | |
ID | 7202311 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 134691 |
End bp | 136574 |
Gene Length | 1884 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181664 |
Protein GI | 219122670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.472302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAGC GGCAAAAGCG AGAACATGCC GGATTAGAAG CTAGCTTTAT CGGGAGAAGC AAGTGCCTTA AGCTGCTACA AATTAGCTTG AAAGACTTTC GGCGTCTCTG CATTTTGAAA GGCATTTACC CTCGTGAACC GCTTGGCCGC ACTCCTGGAA ACAAAAAGGG ACAGTCGTAC TACCATATCA AAGACGTCCG TGCTATTGCT CATGAACCGG TTTTAGAAAA ATTTCGCGAC TTTCGAGCGT ATATGAAGAA GGTGAGTGAC ATTTAAAAGA GGACATGCTT CCGGCTGTAC GCTGTTGGAA AGTCTAAACC TGTACATTCA TAGGTCCGCC GGGCTGCTGG TCGAAACGAA AAAGACGAAG CGATTCGAAA AAATGCACTG GTTCCTACTT ACACAATTCA CCATCTCGTC AAAGAAAGAT ACCCTCGCTT CTCAGATGCT CTTTCTGATC TGGATGATGC ACTGACTCTT TCGTACTTAT TTGCTGCGTT GCCTGCCGAA AAGAACATCA AATCAAAAGT TGCCGGCAAG GCAAAAACAC TGGTAGCTGC ATGGGGCGCC TATTGCGCAA CTACGGGATC CATCTCCAAG TCCTTTATTT CTGTGAAAGG AGTATATTTG GAGGCGACTG TTCAGGGCTC TCAAATCCGA TGGGTGGTCC CACATTCTTT TACTCAATAC ATGCCAGAGG ATGTGGACTA CCGTGTTATG CAGACATTTT TTGAGTTCTA CGAAACCCTT CTGAATTTTG TGTTGTTCAA ATTGTTTAAT GTTATTGGAG TGCGGTATCC CTTTCCCGTG AAGCAACTGG GTGACCAAGT TGTAGGCAGT ACCAGTGCTA TCTTGGGGGC AAACTTGCGT GCCCTCACAA ACTCACTAAA TTCTTCTAAC GGGACAATCA GCACGGTTGT TGACGCCACC TTAAACAAGA ACCCCTTCGA GCCAGCGAAA AAGTCGAAGA GCTCAAAGAA AGACAAGGTT CTTATTTCGT CTGTCAATGA TACACTTAAC CAGCTGCCTG AAGAGAACAG TGAAGATGAA ATGAGAGACG ACGACGACAA CAGCGTTGAT GTGTCCGGTC CGCTTCAAGC AGCTTTAGAA AGCATGGCCC AGGAGCAGAT AAAGCACGCT ATTCCAGGAG GTTCAGTGGA TCTAGATGAT GATGCAATCA TGAGAAAACG TATGTTTGAA GGTCTTACAT TCTTCTTGTC TCGCGAGGTA CCCCGAGGAT ATTTGGAGCT CATCTGTCTA GCGTATGGAG GCAAAGTAGG CTGGGAAGGT CAAGATAGCC CTATTTCTGT AAAGGATTCG ACTATCACAC ACCACATTAT CGATCGCCCG AAACTTCCGG TCAGCTACGA GTCATTGCCG AAATCAAGGG AATACGTGCA ACCTCAATGG ATCCTAGACA GCGCAAACTT TATGTTTCTG TTGCCGATTG GCAGATACGC AGTTGGTGCT GAACTACCAC CTCACCTGTC GCCCTGGGTT GACGATGAGG AAGAAGGATA CAAGCCTGTG TATGCCGAAG AAATTGAACG TTTGAAGAAC GGTGAACCAG CTCGCCCCGT CGATGTAGAA ACGGAGCCCG AGCTTGCGGA AGAATCTCCC GATTCCACTC CAGAGGATAT ACAAGCACCG GACGAAGATG AAGAGGACGA AGGGGCGGAT CTAGAATCAA AAATGGATAA CGAGGATGTA CTTAAGCGGC AGCAAAAGAA GCGAAAGAAG GAAGAAGATG AAGCCCATGA ACTTGCGAAG ACGATGATGA GCAGAAAGGC CAGTCATCTG TACAATCGAA TGCAGCATGG CATCGCGCAG AAGCAGGCTA AAGTGGATAT GCTTAGCCAG CGTCGTAAAG ATCTAGGACA AAGT
|
Protein sequence | MGKRQKREHA GLEASFIGRS KCLKLLQISL KDFRRLCILK GIYPREPLGR TPGNKKGQSY YHIKDVRAIA HEPVLEKFRD FRAYMKKVRR AAGRNEKDEA IRKNALVPTY TIHHLVKERY PRFSDALSDL DDALTLSYLF AALPAEKNIK SKVAGKAKTL VAAWGAYCAT TGSISKSFIS VKGVYLEATV QGSQIRWVVP HSFTQYMPED VDYRVMQTFF EFYETLLNFV LFKLFNVIGV RYPFPVKQLG DQVVGTLESM AQEQIKHAIP GGSVDLDDDA IMRKRMFEGL TFFLSREVPR GYLELICLAY GGKVGWEGQD SPISVKDSTI THHIIDRPKL PVSYESLPKS REYVQPQWIL DSANFMFLLP IGRYAVGAEL PPHLSPWVDD EEEGYKPVYA EEIERLKNGE PARPVDVETE PELAEESPDS TPEDIQAPDE DEEDEGADLE SKMDNEDVLK RQQKKRKKEE DEAHELAKTM MSRKASHLYN RMQHGIAQKQ AKVDMLSQRR KDLGQS
|
| |