Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14032 |
Symbol | |
ID | 7202536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 512673 |
End bp | 514073 |
Gene Length | 1401 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181572 |
Protein GI | 219122480 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCAACCAGC TGTCATTGTC TCGTCCGCTG CTACGTGGTG TTGCCGCTAT GGGATTTGTA AAGCCGACAC CAATCCAAGC CGCCGTTATT CCGCTGGCTT TGGCGGGAAG GGATATCTGT GCTTCCGCCG TCACAGGTTC AGGGAAAACA GCTGCTTTTC TCTTGCCGAT TCTGGAGCGC CTGCTGCATC GTTATTCTGG CCGCACTAAG GCTATCATTT TGACACCTAC ACGCGAATTG GCGGCTCAGT GTTTGGGCAT GCTGACGTCG TTTGCCCAGT TTACCAATCT GAGGGCGTCG CTCATCGTTG GTGGTGCCAA AAACGTGAAC GCACAAGCGG CCGAGCTGCG GTCTCGTCCG GACGTTATTG TTGCGACACC TGGTCGCCTT TTGGACCACA TCACTAACTC GGCTGGTGTA ACGTTGGAAG ATATTGAGAT CTTGGTACTT GACGAAGCAG ATCGTTTGTT AGATTTAGGC TTTCAGGATG AAGTACATGA ATTGGTCAAG GCATGTCCCG TACAGAGACA GACTCTTCTA TTCTCGGCAA CTATGAATAC TAAGGTAGAT GACCTAATTC AGCTCAGTAT GAAACGCCCA GTTCGAGTTC GAATAAGCGA CAAGGCGAAC AGTATGGACA TTGAAGTGGC GCCGCGCTTG GAGCAAGAAT TCGTACGCGT TCGCGCTGGA AACGAAGGAG CTAACCGCGA GGGAATGCTG TTGGCGTTGC TTACACGAAC ATTCAAGAAA CAAACAATTG TTTTCTTTGA TACGAAAGCC GCCGCGCACC GTTTAATGAT CCTTTGTGGT TTGTGCGGGA TTAAATGCGC GGAGTTGCAC GGCAATTTAT CCCAGCAACA GCGACTAACA GCGCTTGAAG AGTTTCGGAA AGGCGACGTC GATGTTTTGT TAGCCACCGA CTTGGCGGCG CGAGGTTTAG ATATTGATCG CGTGAAGACC GTAATCAATT TTGAAATGCC GTCACAGGTT GCTACCTACG TCCACCGCAT TGGACGTACC GCTCGTGCAG GTCGAGGAGG TCGGAGTTGC ACGCTTATTG GCGAGGGGCG TCGACACTTG ATGAAAGAAC TAATCAAGGA CGCGGAAGTC AAGAATAAAC GGCACACCAC AGGTGACACA GCAAAAAGTT CGTTTGAATC AGGTGTGATT CGATCACGAA CAATTCCTCC CGCCGTCATG GGTCACTTTG TTGCTAAAAT ACAGTCTTTG GAAACACATG TAGATGAGGT CTTCCAAGCC GAAGCCATCG CAAAGATGGA CCGACTAGCA GAGATGGAAG TAATCAAAGC GCAAAATATC ATTCAGCATT CTGACGAGAT TAAGGCACGC CCTCAACGTG AATGGTTCGC ATCCGAGAAG CAGAAGAAAC TTACAAAGGA A
|
Protein sequence | FNQLSLSRPL LRGVAAMGFV KPTPIQAAVI PLALAGRDIC ASAVTGSGKT AAFLLPILER LLHRYSGRTK AIILTPTREL AAQCLGMLTS FAQFTNLRAS LIVGGAKNVN AQAAELRSRP DVIVATPGRL LDHITNSAGV TLEDIEILVL DEADRLLDLG FQDEVHELVK ACPVQRQTLL FSATMNTKVD DLIQLSMKRP VRVRISDKAN SMDIEVAPRL EQEFVRVRAG NEGANREGML LALLTRTFKK QTIVFFDTKA AAHRLMILCG LCGIKCAELH GNLSQQQRLT ALEEFRKGDV DVLLATDLAA RGLDIDRVKT VINFEMPSQV ATYVHRIGRT ARAGRGGRSC TLIGEGRRHL MKELIKDAEV KNKRHTTGDT AKSSFESGVI RSRTIPPAVM GHFVAKIQSL ETHVDEVFQA EAIAKMDRLA EMEVIKAQNI IQHSDEIKAR PQREWFASEK QKKLTKE
|
| |