Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37843 |
Symbol | |
ID | 7202648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 197775 |
End bp | 199514 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182024 |
Protein GI | 219123422 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGGT TACCGCAAGG CCAGTCGGCC CCGACAACTC CAAGCTCCAA GTCGATGAGC AAGCAGACCC TGGCTCAGTC CAAAACGGGC TACGGTCGCC AGGGCCTCTC CCGAGGACCG ACTATGATGG TGCCCCGTCC ACAGCGTTCG CGCTTTGCCG ACCCACCTCG GAGGCATTGT CAACCAGGTA GAGCCTCTCG AAACGGTGTG GCGTCTATTA AACAGGTCTC GTTTTGCAAC GAAAGCATTG GAGTATCCCA GCTAACGCAA GATTCTACGA GCACGACGCC TCCTTCCTTT GGAGGATATC CTCCTTTTCA GCAAAGCTGG GCGTCTTCCT GCAACACGGC ACAGCAAGAG CGGTCCGTGT CTTCCTATAC GTACTCCAGC AGTGCTTCGG TAACTGCAGG GCAAGGGAGG CCCAGCTTAT GTCCCGAATC ATCGAAGAGC CTTCATCAGT CGTCATTGGT CTCTGTAGCG TGTTCAGAGA AGTCAAAGGG ACTGTCGAGC GCTTCCCGTG CCGCCTGTAA TCGTAATATG CTGCGGTCTA TGCTCAAGCC CACATTTGCC ATGAGCCGTG CGCTTGTTCA ACGCCCCGCC CTGCTTCCAA TTCCTCACCA GCGTTCTCTC TCAACACAAG CAAGCATTGC CAGTGCTCGA CAAATTTTGA CACCATCTGA CAAATCAAAT CATGCAAAGG TCGGACCGCA TATCGTTGCT CACGGCCAGT CTGTTATCAA GAAGACTAGC TTAGATGACG ACGAAAAGCG CACTCCGAAC ATCGGAGTCT CTCTCGATAC ACATCGACTT ATCAAGTCGA TTGTTTTCGA AGAATTCAAA TCACACTTTT CGGAACGTGC CTTGCAAGTC GATGAGAAAG AGAATAGCAT TGCTAAAAAA TTGCTTGAAA TGCGCACAAT AGCGGAATCG TACGCCGTTG CCACCATACA GCACGGTGAG CGTGTCAAAT ACCTCGACCA GAAGACCCAG GAAGTAGACG AGAAACTTTC AAAGGTCTCA AATATGCTTG GAAAGGCCAA CGAAGCTCTT TCAACCGTCA CTGACATCGC CGAATCAGCA ATTATCAAGG TTGAGCTAGC TAGGGATACG ATTGTGGCTT CGGCCTTACC CTTTGTGAAG AACGCCGTCT TCCAAATGGC CGAAAGCCTT TTTCAGCGCA ACCGAACTTC CTCCCTGACT AATACGAGTC TATCGTCCCT TTCTCAATCC CAAGAATCAG CTTTTGTTGA AGAAGAGAAT CCCGAGAATC GCGTACCTCC CCCAAACAAA ATCACAAGGG GCAATGGCGT GCTCATTTCG AAGCACAAGC GCAAGCAGCC CGATCCCAAG GTACTCGCGA AGGGAGCTAA AAAGCACCGC AAATGGCCAA GTCTACCAGC GCGTAAGACC TCGACTGCGA CAAAGATATG TGCCAATGAA GTTGCTTGCT CTCCGTTCAA GCCACTCGAC TTCGTCACTG TAGAAAATTG CAAAGACGGC CATCCTGTTA CACCCTGCGG CAAAGGAAAG ATTGGCCACA TGAGCTGGTG GGATGTAAAT TCGGACGAAG AAGAGCTTCA TTGGGGCAGC ACTTCCACCA GTCCACTGTG CGTGTCAAAG ACTGCCACCA AAAGCGGATG CAAGCGTCGT TGTGCTGACC GATCTAATAG TAAAAAGCAT CGCAGCGCCT TCGGCAGTCG CAACACCGAA ATTTTGAACG ACATAGACGG CTTTCTCTAA
|
Protein sequence | MDGLPQGQSA PTTPSSKSMS KQTLAQSKTG YGRQGLSRGP TMMVPRPQRS RFADPPRRHC QPGRASRNGV ASIKQVSFCN ESIGVSQLTQ DSTSTTPPSF GGYPPFQQSW ASSCNTAQQE RSVSSYTYSS SASVTAGQGR PSLCPESSKS LHQSSLVSVA CSEKSKGLSS ASRAACNRNM LRSMLKPTFA MSRALVQRPA LLPIPHQRSL STQASIASAR QILTPSDKSN HAKVGPHIVA HGQSVIKKTS LDDDEKRTPN IGVSLDTHRL IKSIVFEEFK SHFSERALQV DEKENSIAKK LLEMRTIAES YAVATIQHGE RVKYLDQKTQ EVDEKLSKVS NMLGKANEAL STVTDIAESA IIKVELARDT IVASALPFVK NAVFQMAESL FQRNRTSSLT NTSLSSLSQS QESAFVEEEN PENRVPPPNK ITRGNGVLIS KHKRKQPDPK VLAKGAKKHR KWPSLPARKT STATKICANE VACSPFKPLD FVTVENCKDG HPVTPCGKGK IGHMSWWDVN SDEEELHWGS TSTSPLCVSK TATKSGCKRR CADRSNSKKH RSAFGSRNTE ILNDIDGFL
|
| |