Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45024 |
Symbol | |
ID | 7199532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 1037921 |
End bp | 1040060 |
Gene Length | 2140 bp |
Protein Length | 560 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178901 |
Protein GI | 219116212 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAAGGGCT TCACAACGAC GACGTTTTCC CCCACAATCC CGACGACACC TAGCGAAACA CATTTCTTTG TACTTTACGC CAGTCCCTTG TTTCCGACCT GATTCACAGG TCAGCGACTG TGCACGAGGT GATTTGGTGA TTTGTCTGTG CCGTGAGAAA AAAAGTCAAG ATAAAAAAGC AAGCTTTTTT CCACAGCATG AGAAGAAGCG TGTGGTAACA TTTATCATAT TGACTGATAG ATTGATTGAC TGACTGATTT GTCAACCAAT CGATTGCTTG ATTCGTTCAC GTACTGAACC AACCGACTGG AGAACACGGG CTGACTGTGT ACTTCCTACA AACCTGCCTA CCTGCCTATC TATCAACTTA CCTGCATGAG TGTTTTCTAC TGCGGATTAC CTGTTGATTG CAAGACTCGT GGTTGATTCG AACCTTTCTT CCTGTGACTA CACCGACATG AGTGACCACC GCCATTCCCT TTTGCCGTCA CAGGCGTCCA ACCGAAGACT ACGCACGCCT CCACAAAGCA CAGCCTTTGT CGCGCTGGAA ATAGAATCCT TGGTCTTGAA CCCATCTTCC CAAGGCAGTA TCCACAATCA AAACGTCGCC ACCGCCACGA ATACGACCAT TGCGGATAGA TCCATGTTCT CCTACTCGCC AGCCACACGC ATGTTGTTGG AAGAGTCCAT GCGTTCCATC ACCAACAGTC GCTGGCACGT GCGATCACTC GTCAAGCTTA TTTGTTTATT AGTGGCCGTA GCTGTTGCCT TGAACGTGAT TGCTAGTACC AGAATGGTAG TCCGTGGCGA ACGTACCGAA GACAAGAACA AAACCGGAGT ACTGCCCGAC CACAACAAAA AAAGCAACAA ACTGGCACCA ACCGAAGCGC CAACCGTACT ACCTTCATTT TCTCCAACCA CTTCACCATC CACCCTTTTA CCCACGATGG TACCGCTTTC ACCAACGTCG GCAGCCACAA TACCACTCAC ACCAGAAGCA CAACAGCAGG AGCAACAATC TTCATCGTCC AAAGCGTGGA TTCAACTGCA ACCACGCGAT TCCGACAAGA AGCCCTACCC GCCTCCTGGA AAGAATGCAT CCCTCACTTT TCAGTCCCGA TGGTGCGATT TGCAAGGCCT AGCCTTTGGT ACCGAAGCTG ACTGGAACCC GTCCATAGCC ACGAAGCGCG GAAAATTCGC TCCCAATATC ACAAACGCAA CGAATGCCTG GCAACTCCGT GCACCGTCTT TGCTGCTTCC CGGCACCAAA CACGCCGGAA GTGAAGCACT GGGTGCGTTG CTGGCAACGC ATCCTCTCGT GTTACGCCAA ACCACACAAG GATTCTTCTT CGACCACGTC TTTCGAAAAT TTGTTTCCGC TAACGAAAAA ACCCGAGTCG CGGCTGCGCG GCATAAGATG TACGCTTCAC ATTACGACTT AAAAGCTATC CAAGCTAATC CTGCCCTGCG GAGTGTGGAT GTTACACCAG GGTATCTGCT TTACAGTACA TTGTTGCCGC GACGTGTGTT CTGTGTCACA CCATGGATAA AACTCTTAGT GGTGCTACGG AATCCTACGG ACCGTGTATT CGAACACTAT CAAGCGGCCC GTGCCAAGGG TTTGCCTTTG TCGCTCAAGG ACTGGTTGGA CAAGGACTTG GCGATTTTGC GCAAACATGG CTTGGTGGGG GATGCCACTG GGATGGGTAA CGCCACGAGC ATAGTCAAGC ACGGGTCGTC CGAAGAAGAC GTGGCCTGGC TCAAGTACCA GGACGAATCA CTGGAAGGCG TCATCGGACG ATCAATTTAC GAAATACAGT TGCGGCAGTG GTTTCAGGCA ATGCGAGCGG CCGGCAAAAA TCCGTCCAAG GATGTGATGG TAGTATTGAG TGACGATTGG TCACAAAATC CGTCTCGAAA ATATCAGCGT GTGCTAGAGT TTGCCAATCT ACCCCACGAT GACAATGTGG TGGTTCCTAC CACATTGTTG GCTTCTACCT GGCGTCGTAC AACTGGCATA AACCAGACTC TAGCAAGCTC AGGATCTACG GCCACACGCG ACGAACTGCA AAAGTTTTTC CGGCCTTACA ATACCAGACT TGCCTTGCTC CTGTCCTCAT ATGGAGTCAC GGTGTTATAG
|
Protein sequence | MSDHRHSLLP SQASNRRLRT PPQSTAFVAL EIESLVLNPS SQGSIHNQNV ATATNTTIAD RSMFSYSPAT RMLLEESMRS ITNSRWHVRS LVKLICLLVA VAVALNVIAS TRMVVRGERT EDKNKTGVLP DHNKKSNKLA PTEAPTVLPS FSPTTSPSTL LPTMVPLSPT SAATIPLTPE AQQQEQQSSS SKAWIQLQPR DSDKKPYPPP GKNASLTFQS RWCDLQGLAF GTEADWNPSI ATKRGKFAPN ITNATNAWQL RAPSLLLPGT KHAGSEALGA LLATHPLVLR QTTQGFFFDH VFRKFVSANE KTRVAAARHK MYASHYDLKA IQANPALRSV DVTPGYLLYS TLLPRRVFCV TPWIKLLVVL RNPTDRVFEH YQAARAKGLP LSLKDWLDKD LAILRKHGLV GDATGMGNAT SIVKHGSSEE DVAWLKYQDE SLEGVIGRSI YEIQLRQWFQ AMRAAGKNPS KDVMVVLSDD WSQNPSRKYQ RVLEFANLPH DDNVVVPTTL LASTWRRTTG INQTLASSGS TATRDELQKF FRPYNTRLAL LLSSYGVTVL
|
| |