Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48789 |
Symbol | |
ID | 7195102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 242359 |
End bp | 244167 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183326 |
Protein GI | 219126149 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.450874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTCA CTCCATCGCA TGCTACCAGA GAAGCGAGCA AGGATGCCTA CGAGCTTAAC ATGGCCGTAG CTGTGTTGTT CTTCCTCCGT CGAAGCAAGT TCGAAAGATC CGCAAATCTG TTTCAGACTG AGCTCCGCGA GACTTACGGA AATTTCAACG GGAAAACATT TAAAGGAACT CTCAGATGGG AGCCGCTTGA ACAAAAATCA CCGTTCCCCG AGAAAGGGGA CAACGAGGAT GAAAGCTACA TAAGTGAAGA CTCCGACGAT TTCGAATGGA AACACTTTGA TAGCAACCTA GTCAAAATGC GAGTCGGGGA CCACTTCGTC GGAGACAGTG GTAGCGATAG CAGCTCCAGC TCCGACTCTA CCAGCGAGCT GACTGCAAGC CATGTCAAAA TGAATATAAG GTGTACCACG CGTCAACCAA ACAAGGATGG TGCGCGCGAT GAAAGCGGCA TTATAGACGT CGACAAAAAG CAATTGAGGA CTCTGGACGG TAGCGAAAGC GATAGTACAG GGGACAGCAA ATCGAGCGGT GAAGGCGACT GCATTCCAAA ACAGGGTCAC GCATCAAACA AGTTTGCAAG TGCCTCCTCG TCCAGGCCTG CTCTTATTGC ACAACACTTA AAGCCACTTC CAAAAGGAAG GAATTTTCTT GATAGTGACA GCGATAGCTC TTCCAGCGAA GAACAACGCG CAAAGCCACT TCCAAAAGGA AGAACTTTTC TTGATAGTGA CAGCGACAGC TCTTCCAGCG AAGATTCGGC TCAAAAGAAA AAACCGCCGA TTATGGCCCA AAGACTTGTG CGACGGGAGC CACAGCTATG TAGCATCGAC ATCATGCCAA CGAAGATTGC TGCTACCAAA AATTTACCAA AATACGATTC TCTGAATCGA AAAAAGAAGC TTGCTTCTCA GGTAAAGTAT CGAGCGAAGG TTCTCGACGA CAGCGATAAG AGTTCCGAAG ATGAACGCCG AGAGCGCACT AAGTATCCTG CCCACAAAAT ATCAAACGAT GTTGAAAAAG TAGGATCGAG GGTATCGAAT TCCATTTCTG CCCAAATTGT TACCCAAAAC GACTGCAACG ACAGCAGCAG TAGCTCCAAC GAAAGCGACA GTACCGCCAG CGACGAAGAA ATTCGATCAG GATCCGTTTG GCGCTCCCGA CAGTCTATTA CGAGTATTAA CAATGAGAAA AACAAGGAAA GTGCCCGGGT AGACGTCAGC TCATCACAGC AAGACGTGAA ACTGCAGAAA ATGAATTACA ATGTACAGAA TTTGTCTACT AGGACTGCAC CATTGAAAGC CAACGGCCAA GAAAAAGCTG ATAAGAATCG CGATGCCGAA AGTCATTTCT CCAATACACT GAAGGCTGCC CTGTCCAACC TTAGAACTGT TGCTGAAAAC TCAATTTTGC ACAAAACAAA CGGCCTTGCT TGTACAAGTG ATTCTTCGTC GAATTCAAGT GAATCATCAT CGTCAACGTC GGAAATTTAT AAAAATCCAT ATCCCATCCG ATCCAAGGCT AGAGAGAGAA AACGTTTACC TAGGCGATCT CAGTCGCTTG ATATTGAGTC TCTATCATTT CTGTCTACTT GTCGAGGTAT GAAACGCTCA GTATCATTCT CTGACGATGA CAAAGTAGCC GAGATTCCTC GGTATGAGGC TCAATCAAAG TCTGAGCTCT TCTACAATAA AGCCGACATC AGGCGGTTTA CTGTCGATGA GCAGACACGC CGTCAAGAGG AGCAAACTGA GAAAATGGCC ATGATGCTAA AGCTCTACCT GCTTGCTAAA AAAAGTTAG
|
Protein sequence | MVVTPSHATR EASKDAYELN MAVAVLFFLR RSKFERSANL FQTELRETYG NFNGKTFKGT LRWEPLEQKS PFPEKGDNED ESYISEDSDD FEWKHFDSNL VKMRVGDHFV GDSGSDSSSS SDSTSELTAS HVKMNIRCTT RQPNKDGARD ESGIIDVDKK QLRTLDGSES DSTGDSKSSG EGDCIPKQGH ASNKFASASS SRPALIAQHL KPLPKGRNFL DSDSDSSSSE EQRAKPLPKG RTFLDSDSDS SSSEDSAQKK KPPIMAQRLV RREPQLCSID IMPTKIAATK NLPKYDSLNR KKKLASQVKY RAKVLDDSDK SSEDERRERT KYPAHKISND VEKVGSRVSN SISAQIVTQN DCNDSSSSSN ESDSTASDEE IRSGSVWRSR QSITSINNEK NKESARVDVS SSQQDVKLQK MNYNVQNLST RTAPLKANGQ EKADKNRDAE SHFSNTLKAA LSNLRTVAEN SILHKTNGLA CTSDSSSNSS ESSSSTSEIY KNPYPIRSKA RERKRLPRRS QSLDIESLSF LSTCRGMKRS VSFSDDDKVA EIPRYEAQSK SELFYNKADI RRFTVDEQTR RQEEQTEKMA MMLKLYLLAK KS
|
| |