Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41268 |
Symbol | |
ID | 7198991 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 344924 |
End bp | 346427 |
Gene Length | 1504 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185178 |
Protein GI | 219130031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.577371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCTC AGCTCTTTCA GGAACTAGCC CTGTCACAAC TGGAGCTTTT GGCAAATTCT ATTCCGTGTG CTGATCGCCC TGGCGTAAGC AAAATAAAAA CGATGGCTTT GTATCTTCCT CAAGAGAATG TCAACACGGG TCAACTAGAA TTTCTTCCCG CAATTCACTA CCCTCACCCG AGTCGAGAAA GAGTTTTTAT CGCCAACGAG GCTGCGTCGG GCGTCGCTCC CGCACTACCT CGGACTCTGA CGACATTGCC TGGATTTGCT CATGCTACTT CTCTGCTTCC GGGCTATCCT ATGGTATCTG GTGGTTCCGA AGCCAGTGTC GGCGTCGTTG AGGAGGTTAT CTGTGACCTT ACCTCGGGGG CAGCAGCTTT GTCTGTTCCG TTATTCTCTG GATCCCGGAC GGTCGGGGTA CTCCTTGTGT CTCCTTCAAT TTCAAAACGC AGAAAGGGAA GCGCCTGGAC CAAAGAGGAT CGAGAGCAAG TCGGTAGAGC CGGGAAGAGC CTTTCGTTGG CGCTTTCGAT GGACACGGAA AGAGCAGCAT TGCAAATGCA AAATGATCGA GCGGCTCGCG CCTTATCCGA TAGTCTACAT CAAATCAAGA ATCCGCTCCA GGCTATGCGA ACATACGGAA AGCTGCTACA ACGGAGGATA GCCGACCTTG GACTACTATC AAACGAAGGT ATGACACCTC AGCTTTTGGA AATGGCGGAA CATTTGCTTG TGCAGAGCGA TCGGCTAGCT GACAGATTAA AACCGGTGGA TACGATCGTA GATTCGCTCA GCGAGCGAAC GCCGTTCGTG CTGAATCCTG CAGCTCCCAC GAAAACCCAG GATTCGCTCG TGAGCTTGGC TACGCCGCTG GTACCATGGG AAAGTGAGAC CTTGGAGTTT GCTCGCGAAT CGAAGACATC GGGAGAACTA GTAATATTCA TGCCTACGAA GAAAGTTGCT GCATCTGGAT CTAATGGCCC AAGCAATTCC ACGACATCCT TTTCTGGGAA TGGCAGCGAT GTATCTACAT ACACGGAGGA CGACGGCGCT GGTGAGATGC CATCTTCTCT GTTCAGTGAG ATGGATTTGG AAATGTCTTT TTTGAGTGAC GTTCTTGACC CTGTGCTTGC CGCCTTTCGT GCTATCGCAG AGGACCGGGG GATTATGTTT TCTCATGATG ACACTGAGGA TCTTCCTGGT GTAACGATAT GTCCCCAGGC ATTGCAAGAA GCACTCATCA ACGTGATCGA CAACGCGTTT ACATACGTCT TTTTGCCTAA ACCTGGATCT CGTTTTGGAC CAAATCCGTC TGCAGAAGTT CGAATACGCC TCGTACAGAA CTCAAAAGGC AGTGACGCAG GCGTGACAAT CCTAGTGGAA GACAATGGTC CTGGGATTCC AGCTGAAACC AGAGATCAAA TTTTCGATAG AGGAGTAAGG TCGGACTCTA CTAGATCCAT AGAAGGATCA GGCATTGGAC TGGACATTGC GAAAACACTG GTGA
|
Protein sequence | MPSQLFQELA LSQLELLANS IPCADRPGVS KIKTMALYLP QENVNTGQLE FLPAIHYPHP SRERVFIANE AASGVAPALP RTLTTLPGFA HATSLLPGYP MVSGGSEASV GVVEEVICDL TSGAAALSVP LFSGSRTVGV LLVSPSISKR RKGSAWTKED REQVGRAGKS LSLALSMDTE RAALQMQNDR AARALSDSLH QIKNPLQAMR TYGKLLQRRI ADLGLLSNEG MTPQLLEMAE HLLVQSDRLA DRLKPVDTIV DSLSERTPFV LNPAAPTKTQ DSLVSLATPL VPWESETLEF ARESKTSGEL VIFMPTKKVA ASGSNGPSNS TTSFSGNGSD VSTYTEDDGA GEMPSSLFSE MDLEMSFLSD VLDPVLAAFR AIAEDRGIMF SHDDTEDLPG VTICPQALQE ALINVIDNAF TYVFLPKPGS RFGPNPSAEV RIRLVQNSKG SDAGVTILVE DNGPGIPAET RDQIFDRGKD QALDWTLRKH W
|
| |