Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50848 |
Symbol | |
ID | 7199533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 1042944 |
End bp | 1044719 |
Gene Length | 1776 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179112 |
Protein GI | 219116634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.104715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACATCGCT GTGGTTGACA ATGGAGAGAG CAAGGTCCAC TACTGTCAGG CAGCCGCATT CATTCCGCTC TGCTCTATTG AAATTACAAA TTAATTATCG GTGTTACTTC ACAAACGTTC TGAAAACGTC ACGCGCTCAA CATGACGTCG CAATCGTCCC ACAAATCGAA TTACGCGCTG GCCATTTCCT TTCTGGTTCA AGAGCTGATG GATGCGTACG ACAACGGTGA TACCGTCAAT TTGACGCAAC TCAAAGGCAA GGCCTCGCGC AAGTTCAAGT TGAAGGGAAT TCCCAAAATG AGTGACATAC TGCAGGGTCT ACCGATCAAT TACCGCAGCA AGCTATGGCC GTACCTGCAG ACCAAGCCGG TCCGTACGGC CTCTGGCGTT GCGGTCGTCG CCGTTATGAG CAAGCCACAC CGGTGTCCGC ATATTGCCTA CACTGGAAAC GTTTGCGTTT ACTGCCCCGG TGGACCGGAT AGTGACTTTG AATATAGCAC GCAGGCGTAC ACTGGATACG AACCAACCTC CATGCGCGCC ATTCGAGCCC GTTACGACCC CTACAGTCAA GTCAAAGGAC GTGTCGCGCA ACTGCGAGCC ATTGGACATA CGGTAGACAA GGTAGAATTC ATTGTTATGG GAGGGACCTT TCTGAGTTTG GATAAGGAGT ACAAAGATTA TTTCATTCGT AACCTACACG ATGCTTTGTC GGGATATCAT TCGCAAACAG TGGAAGAATC AGTTCGATAC TCAGAGCAAG CTGTTACCAA GTGCATTGGA ATTACTATTG AAACCAGGCC TGATTACTGC TTAAAGCCAC ACTTGGAAGA GATGCTTTCG TACGGCTGTA CGCGAATCGA AATCGGTGTA CAAAGTATCT ACGAATCCGT GGCACGAGAA ACTAATCGCG GACATACGGT GGCGGCTGTT TCTCACTCGT TTCAGCTAGC AAAGGATTGC GGCTTCAAAG TTGTCACGCA CATGATGCCG GATCTACCAA ATATGGGCTA CGAACGAGAC TTGGAGGGCT TCAAGGAGTA CTTTGAAAAT CCAATGTTCC GAAGCGACGG TATGAAGCTA TACCCGACTC TAGTCATCCG AGGAACGGGC TTGTACGAAT TGTGGAAAAC AGGTCGATAC CAGAATTATA CTCCAGATCA ATTGGTGGAA CTAACCGCGC AAGTTTTGAG CCTCATTCCA CCGTGGACAC GTTTGTATCG AGTCCAGCGT GATATTCCGA TGCCACTCGT CTCATCTGGT GTTGAGCACG GCAACCTCCG TGAACTCGCC TTGCAAAAAA TGCGGGAGCA AGATTTGCCG TGTCTGGATA TTCGATCGCG AGAAGTTGGG ATGAAGCAGA TTCATCACTC CGTCACGCCT GATCAGGTCG AACTAGTGCG GCGCGACTAC GTTGCCAATG GCGGATGGGA AACTTTTCTT AGCTACGAAG ATCCGACACA GGACATTCTG ATCGGGCTGC TGCGATTGCG CAAAACATCG CCTGCTGCAT GGTTGAAGGA AGTTGCCGAA TATCCTTCAA GTATTGTACG AGAATTGCAC GTATACGGCA CTGCTGTTGC CGTCTCGGCT CGCGACCCGA CTCGTTTTCA GCATCAAGGC TTTGGTATTC TGCTAATGGA AGAAGCCGAG CATATTGCCC GGGACGAGCA CGGGTCCAAA AAGTTACTCG TCATTGCGGG GGTCGGAACG CGGCACTATT ACCGCAAGAT GGGGTACCAC CTGGACGGAC CGTATATGAG CAAAATGTTG CTGTAA
|
Protein sequence | MTSQSSHKSN YALAISFLVQ ELMDAYDNGD TVNLTQLKGK ASRKFKLKGI PKMSDILQGL PINYRSKLWP YLQTKPVRTA SGVAVVAVMS KPHRCPHIAY TGNVCVYCPG GPDSDFEYST QAYTGYEPTS MRAIRARYDP YSQVKGRVAQ LRAIGHTVDK VEFIVMGGTF LSLDKEYKDY FIRNLHDALS GYHSQTVEES VRYSEQAVTK CIGITIETRP DYCLKPHLEE MLSYGCTRIE IGVQSIYESV ARETNRGHTV AAVSHSFQLA KDCGFKVVTH MMPDLPNMGY ERDLEGFKEY FENPMFRSDG MKLYPTLVIR GTGLYELWKT GRYQNYTPDQ LVELTAQVLS LIPPWTRLYR VQRDIPMPLV SSGVEHGNLR ELALQKMREQ DLPCLDIRSR EVGMKQIHHS VTPDQVELVR RDYVANGGWE TFLSYEDPTQ DILIGLLRLR KTSPAAWLKE VAEYPSSIVR ELHVYGTAVA VSARDPTRFQ HQGFGILLME EAEHIARDEH GSKKLLVIAG VGTRHYYRKM GYHLDGPYMS KMLL
|
| |