Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49872 |
Symbol | |
ID | 7198504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 144544 |
End bp | 146661 |
Gene Length | 2118 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184742 |
Protein GI | 219129115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGGAAACAG GCATTCCACC ATCGCGCGCC TCTGTCCAGT GCACAGCCCA CTCGACTAGG GGAAAAAACC CGACATCCCA AACCCCCCAT TGCAAACCCA ACAGACGACA CCTACGCTTC CCAAATTGTA TATATTTGTT GCTTGGGTGG CGTGTACGAA TCTTGAATCT TCGAGGCGTG CCACAACGGC ACCCGAACGT GTGGACTTTT GGTCACAGAT ACTGGAGACG TTGTTGTTGG TGAACGACAG TGTGCGACAA AAAGGCGTAC GATTTTCCCC ACTCTGACCG ATCGTTTCGT GGCCAAGTGG AAATCACTGT CTGCTGTGTT TGGCCCCAGT CCATTGGGTG TGCGTACTGC AACGACGACG ACGTATTTAC GGATTCGATT CATTTCGAAT TGGTGATTCG AGGCAAAGCC AAAGTTCGGA ACGAATACGA AGATCAATGT CCGGAATGCG CTCGACTCGA CGGTACAATT CCCGGCAGTC GTCGTCCTCA CCGCGTGGAG TCCTTTGGTT CGGCAGTAGC ACCATTCCCG TGGCGGTTGT ACTGACTCTG GTGGGGATCG TGATTCTCCC GGCGTTCCAG CACCCGCACG GGTGCTGCAC GGTCGACGCC TTTGCCTCGT ATCTCTTGTC CACGACTGGA TGTAGGACGG ATTTGGATAC TACCGAAGTC ATTATGAATC AGCTGGTAGT CGCGGCGGGT GAGGAAGAAA CCGCCAATAA TGAGGTCGTC GATGGGGAGG AACCGATTCC TTCCTTCCAC GTCGTCGTTG CGGACCACAC GGTAGCCGCC GACGGGAGCC TCACCATAGC CACCCCAGTG TCCTCGTCGC ACCATCCAAT CCTGCTGTCC CTACAAATTG TACCGACACC ACCGTTGTCA CCACAATCCA CACACGTCAA GGACTACCAG TTCGTCGTCC AAACAACCGA AGGCTGTCGT TTCGTCCACG GCGGTTGTGA CGACGAACGT CGCATCGCGG GACGGGGATC CGAGACGGTA CAGCTCGCGA TTCCTCCGCC GACGACGGAA TCGGTGCGGG GAGCAGACGT GTGTACCGTG TGGGGTGGCT GGGCGGCCGG ACACCACGCG GTACGATTGA CTCCAGCCGT CGTGATACGA CTACATGCGA ACGAACACAA CCATCCGGGG GATGCGCACG TGGCGGAACC GACGTGGTTC GAGGTTGGGA CGGAACAAGG TTGTACGGAT GGTGGTACAC TCGTCGACGC CGTGGGGATG GGATCGCGAC TCGTCGTGGA GGACACGGAC GATCGGTACG GGAAACTCGG GGTGAAAACG GAGGTGGACG CTGTACCGGC GGAACTGTCT CTGTACTGGT CACCGCGGCC CGGTGGGGAC GCCTCGGTCA ACGAGTCGAC CATCGACACG TTGGTGCTGG AGACCAGTCC GGGAGCCACG TTTACGGACG GAGCGTGTGG AGGGAAACGG ACCGTCATCA CCAAGGAAAT GCTGTCCAAC ACCGCAGCCT GGCCGCAACT GACGATACAC ACCGAGCGAA CCGTTTCGGT GTACGGGGTG TATGCACTCA CCGGACCCGA CCACGTGGAC AAACTCTACC GTATGGACAC ACTGACTCTG GAATGGAGTC CGCCGTCGAC TGACTACCGA GCGCGTAAAG AAGCCGGGGA GAAATCACGC AATCGCCGGA ATTCGTCACG GAACACACAC CGATCGCCCA AGTTGCCCCG TGGTACACCG GTGGATCCGC AAAGAGCCAT TGACGCGGCC GCCCGCCGGG ACGCCTCCGA CATTCAAGCA CAGGTGGCAC GACACAACGC GGGGCATCGG AAAGAAGAAG GCGTTCACCC GGGAGAAGGG AAATCCCGCG AGGGCAGACG TCGTTTTTCG CGACGGATGA CTGGAGAAGC CTTGCGTCGG GAACCGCAAC TCTCTAGGCC CTACCCTCCC AGGAATCTTC GCCACCGACC GTTCCCGGTC GTGGAGGGAA CGGAGTATTG CCTGGCCATG GCCTTCTTCG TTGCCGCGCA CGTATTTGTC ATACAATTCT GTCTCATTTG TAGTCAAAGA CCTAAAGGGC GGAGAGTATT GTAGTTTAAC TGTAAGTTGT GACTTTTAGT ATAGCGTT
|
Protein sequence | MSGMRSTRRY NSRQSSSSPR GVLWFGSSTI PVAVVLTLVG IVILPAFQHP HGCCTVDAFA SYLLSTTGCR TDLDTTEVIM NQLVVAAGEE ETANNEVVDG EEPIPSFHVV VADHTVAADG SLTIATPVSS SHHPILLSLQ IVPTPPLSPQ STHVKDYQFV VQTTEGCRFV HGGCDDERRI AGRGSETVQL AIPPPTTESV RGADVCTVWG GWAAGHHAVR LTPAVVIRLH ANEHNHPGDA HVAEPTWFEV GTEQGCTDGG TLVDAVGMGS RLVVEDTDDR YGKLGVKTEV DAVPAELSLY WSPRPGGDAS VNESTIDTLV LETSPGATFT DGACGGKRTV ITKEMLSNTA AWPQLTIHTE RTVSVYGVYA LTGPDHVDKL YRMDTLTLEW SPPSTDYRAR KEAGEKSRNR RNSSRNTHRS PKLPRGTPVD PQRAIDAAAR RDASDIQAQV ARHNAGHRKE EGVHPGEGKS REGRRRFSRR MTGEALRREP QLSRPYPPRN LRHRPFPVVE GTEYCLAMAF FVAAHVFVIQ FCLICSQRPK GRRVL
|
| |