Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49881 |
Symbol | |
ID | 7198512 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 177828 |
End bp | 179644 |
Gene Length | 1817 bp |
Protein Length | 423 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184666 |
Protein GI | 219128956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGTGCTTT GGAGAAAAGG TAACAGTTAC CAACAAGCTG GCGCATCTTC TTTCGACGAC AAAGCACATC GCCGGTGTCC AACTTGGCTC ACAGTCAGCT CACAAAGTCT TGCAAAGCGT CCTTAAGTTA CCAAGAAGAA TTCATATAAA ACCTACCATC GAACAGGCTC CTCACAATCA TGAGACTTGC TGTTGTTAGT GCCACCGGTA CGCTCGGTTT CTGCCGCTTG TTCCTCGGGA TCCTGCAAGC CGCAATCTTG GCTTCGTTGA CGTTCGCACA CGCCGACGAC ATTGGCGACA ATTCAATCTT TCAGTCCACA ATGCTGGAGG CGCAATTGCC GAAACCCCTC TCGGACATGA CGGCCACGTA CTTGGGTAGC AGTGACGATG GGGTGTACAT TGCCGGCGGC TGCGACGATC CGAACGGCAA CGTCTTTGTG GATGATCCAG ACTTTCCCGG ATTCTCGTGC GGTTCCATTT CCGATACGCT CTACCGCTTC GACGTGTCGG AGAGCACCAT TCGTGAAGTT GCAAAAATGC CACGACCGCG ATACCGTCAC GCCGCCTTTG GAATCCAAGG CCGACTGTGG TTGGTTGGTG GACGGGATTT GAATGATGAC ATCGTGGTGG AAGTCGATGT GAGTACAGCT ACTGTGCGTG GTAGAACCTT GCTATTCATA CCGTCATGGC TTCCCTCCGT GCTCGTGGCG CAGATTTGTG ATTGCTGTCT TGCGTTGATT CTTTTTTTTA CCTACGTCTC GCTTTCTCAC CCGTATACTG GTTGCCGTTG TTACTTGCTG ACTGTCCTTG GTGTTGCTCG ATCCAGTCGT ACGATACCGC TACTGATACT TGGACGACAT TTACGGATCT TCCCGTCTCG CTCGCTACGA GTGATGTGGG AGGTTTCAGC CACGGCGACT ATGGCTACAT CGTCGGTGGA TATAGTCAGG ATTACAAGGC ACAGGCCACG ACCTGGCGGC TCTTGACCAC CGCCGCACTC GTCAATAACT CCTTGGCGAA CGACGATGCG GTCGAAATCG TCAATGGCCT GGCGGAAGCT CGTGGAGACG TGGCCGCGGC CTTTGACAAC AACTTTGGCT ACGTTTCGGG TGGCTTCACG GACGTGAACA AGTTTTGCGA GCCGCTCGCT TCCGTGTCGG CCTACAGCTT TGACTCCGGA GAATGGAGTA GCCTGCCCGA TTTGGCCTTT GCCCGGGGCG ACAAGGGACT GGCAGCGTTT GACGGACACA TCTACGCCCT AGGTGGAGAA CGACAAATTG AAAACATTTG TGCGGTGACC GCCGATACAC CCGATCCGGG GGATTTGACC GTGGCGGTGG ACGTAGCGGA ACGTTTCGAT CTGGGTGAAG GGAACACCGA ATGGACCTTA TTGGACGATT TGCCCATTCA TCGATTCCGA TTTGCTGCCG TAGCAGTTCC TGCTCGACAC GAAGTTCTCA CGTTCGGTGG TCAACAGGCG TACAATGCGG ACTGTCAGTG TTTGGCTGCA ACGACGGATA TCGTTCAGTA CAAGGAAGTT CCCCAGGAAA GCGGAACTTC GGATGCTGGA CATGGTTACT GGCTGCTGCC GATGAAAATG ATTGTGTTTG CGATTACGTA TATGACATTG ATTGCGTAGT AGAATTCTCG TTTGTTTGGA ATGTTACTGG CCATACCGAG AAAGTTTCTT ATATGTGAAA CGACACAAAC AGCGCTATCG GTATGTACTC GCAAACAAGA CAATTTTTGT TAGTTGATTG GGGGCTAGCT CACTGTCAGT TTGAAGCCAT TTTAAGAGTA CAAAGCATTT GACTGTA
|
Protein sequence | MRLAVVSATG TLGFCRLFLG ILQAAILASL TFAHADDIGD NSIFQSTMLE AQLPKPLSDM TATYLGSSDD GVYIAGGCDD PNGNVFVDDP DFPGFSCGSI SDTLYRFDVS ESTIREVAKM PRPRYRHAAF GIQGRLWLVG GRDLNDDIVV EVDSYDTATD TWTTFTDLPV SLATSDVGGF SHGDYGYIVG GYSQDYKAQA TTWRLLTTAA LVNNSLANDD AVEIVNGLAE ARGDVAAAFD NNFGYVSGGF TDVNKFCEPL ASVSAYSFDS GEWSSLPDLA FARGDKGLAA FDGHIYALGG ERQIENICAV TADTPDPGDL TVAVDVAERF DLGEGNTEWT LLDDLPIHRF RFAAVAVPAR HEVLTFGGQQ AYNADCQCLA ATTDIVQYKE VPQESGTSDA GHGYWLLPMK MIVFAITYMT LIA
|
| |