Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49980 |
Symbol | |
ID | 7198658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 495023 |
End bp | 496769 |
Gene Length | 1747 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184812 |
Protein GI | 219129261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAGATCACC TAGTCTCGCG GGTCGCTTCA ATTGACGTGA ATGGAAAGAC CTATCCAAAC GCCGGGAGGC GAGCCCGCAC CGCCGTGCTC CTCCTGGTTT GGATGCTCTG CTCTATCAAC AGCCGGTGAC AATGCAGAAA ATCGAAGTTT TTAAGGCCTA CCTTCATTTC CTTGTATTCG CTACTACACA GTAAATTTCA GCGTCAATCA GCGTAAAAGG ATAAAAAATA CATCAAGTGG TCTAGCCTGA TAAGCCTAGT TTCTTCGGCG GTGGTCCAAA TTATGGCGCT ATTGCGGATC GACCCAAACG TTTTAGGTCA AAGAAAGCGG AAAAAGATGT TGGCAATATT GGCTGTAGGG ATGCTAATAT CAGGACTCCA GTTCGGATCA GTGCTCCATC TTTGGCGATT TCAAAATGTC ATTTTCCCTG CCTCTCCAAA TTCTGATACG ATGCTTTACA CTGGAAGTAT TGTCACCAGT CTTCCTCAGC CCAGTGAGTC GCCGCGACCA ATTGCGAACA ACGCCAATGA CTTAAATGGA TCCTCTCTTG TATCGCAGCC ACACAATGCA ACAAACGTAG TCCCAAATGG GACTTCACTT TGCGAGGAAT GTTGGAAGAT TGTACAAAAG GCCCCTCGCG TCAACCTATC TTTTGCCCCG ATTCCGACGG ATAGTCATCC CTACATGGGA GCTCGCGATG CCAGAGGACA ATGGGGCTAT GTCCATAACG TCTACAACTT GCAACAGAAT CCGCCACATC CGCAAGGGTG GTCGTTTACA AAAGTCCTTG AAAATCGTAG ATACTGCGAC GTGCGCGACG ACCATTGGAC TGCCTTGCAG CGCATCCAAT TACCCAAGTT GCCAAGGACT GTTTCCGAGC TAACGCCGGC ATCCGCAACG AAGATCCTGT GTGCTGTGTA TAGTTCCGAG CCCTTTCACC ATAAACTCCA CGCAATTCGA GAAACTTGGG CTCCCAAGTG TGACGGCTTT TTCGTGGCAT CGAACTTGAC AGATGCATCG CTAGATGCCG TCGACATTCC GCACGAGGGG ATAGAATCGT ACAGAAATAT GTGGCAAAAG GTACGGTCGC TGCTATCCTA CGCGTACGCA AACTACTACA ACGAGTTTGA TTGGTTTCAT ATTGGTGGGG ACGATTTGTG GGTCATTGTC GACAATCTCA GGGAGTATTT GCACAGTGAC GAGATTCGAA TCGCCGCCAA CGGTGGGATT GAATTCGGAA GCCATTCTGC CTTTCTAGAC AATGAGACAC AAGTTCCGCT TCTTTTAGGA TGCCACTTTG CTCAAGGGGG CAAACTTTCA CAGCTGTATA TAACCGGAGG ACCAGGCTAT ACACTGAACA AAGCAGCACT GAAACTACTT GTCACAGAGG GAATGGACTA CTTTCAGCAC AAAATTACTT CGACGGAAGA CGTACTTGTA TCTCGAATCT TCCGCACACT AAGTGTGGAT CCCTATCCGA CCCTAGATCC CGTCGGAGCA GAACGTTATC ATCACTTTAC TCCTGGTCAG CACTTCAACG CAACCAGGAG GATGTATCAT TGGTACCACG TCTGGAAACG GCCATTCCCG AAGAACCCCA TTGGTCCCAA CCATTCCTCA ACACGGAGCG TGGCGTTTCA CTCGGTTAAT AGTGAAGACA TGCGGCATTT TCATGTTCTC ACGGAAGGTC TCTGCAATTC ATAATAGACC ATGTAAAGTC AAATAGTGTA CACAGATTTT CTAGAAA
|
Protein sequence | MALLRIDPNV LGQRKRKKML AILAVGMLIS GLQFGSVLHL WRFQNVIFPA SPNSDTMLYT GSIVTSLPQP SESPRPIANN ANDLNGSSLV SQPHNATNVV PNGTSLCEEC WKIVQKAPRV NLSFAPIPTD SHPYMGARDA RGQWGYVHNV YNLQQNPPHP QGWSFTKVLE NRRYCDVRDD HWTALQRIQL PKLPRTVSEL TPASATKILC AVYSSEPFHH KLHAIRETWA PKCDGFFVAS NLTDASLDAV DIPHEGIESY RNMWQKVRSL LSYAYANYYN EFDWFHIGGD DLWVIVDNLR EYLHSDEIRI AANGGIEFGS HSAFLDNETQ VPLLLGCHFA QGGKLSQLYI TGGPGYTLNK AALKLLVTEG MDYFQHKITS TEDVLVSRIF RTLSVDPYPT LDPVGAERYH HFTPGQHFNA TRRMYHWYHV WKRPFPKNPI GPNHSSTRSV AFHSVNSEDM RHFHVLTEGL CNS
|
| |