Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45009 |
Symbol | |
ID | 7199678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 985711 |
End bp | 987359 |
Gene Length | 1649 bp |
Protein Length | 498 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179103 |
Protein GI | 219116616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATACGGTT GCAATTGATA ACGGTTGCAA TTGATACGAT TCCGGTGTGT CTTTATTGTG TATTTCAGAC CCACATATCC TCTCCTTTCC CCACACTGCA CAAAGCACTC GAACAGAAAG AATTTCCAAA AAACAACCAC CACAAAATCC CAATGCAGAC AACGGCGCGC CATACGTTGG TGACACTCGT GGAGGATCAC CTGACCGTGT TGATTGCTGA TGCCGTCACG GTGGGACGAC ACGCCAAGCG TGCCCGAACC GTCACCAACA GTAACTCCAA CAATAACACC ACGAACGGGA CACCCTTGGA CGAAGTCGCT GCCGTGTCCA AAGGAACATC TTCCAGTACG GGGGGAGGAG TCACGGTACG CCGACGTTTA CACGCGGCCG ACATTAATCT CGCCTTGCAA ATGCGACAGT CGGAAAAACT TTACGCGACG GCTCTCGTGC CACCGGACAC GGTACATCCG TCGTCTATGT CGACAACAAC CACCACGGAC AATCCCGACC ATTCCTTACC CGCGTCGCAT CGTCCCGTCA ATCTGGCCGA CTTTTTGCGA CACGCCCAAC TCCCATTCCA ACAACCCGCC GAAGTCGCCT TGCACGTGTC GTGGCTTGCC GTGGACGGCA TTGCTCCGGA ACCGCATCCG CACGGGGTTG TCGGGGCGTG GACGGACCGA CACACGCCCT CACCCCATCC CCTCGCCATG CCACACGAAA ACAACAATCA CCACAACCCA TACCAATCCA CCGCTCCCCA AGCCTGGCTC GTACAACAAC TCCAAGCCGC CATGTTGTCC GAAGAACTAC AACTATACTT TACCCGCGTC ACCTACGCTC TGGATAACAC CACCAATACC CACTCCCCGA CCTCCGCACG CGCGCAAGAT CGACTCCTGG ATCGACTCGC CGTCGACGCG CATTTGCAGG AACTCGTGCC CTTTTTCGCA CGCTACGTCA CGCAGACTCT CTACGCATCG CACGTGACCC ACCAACGTGC CGCCGTACGA CTCGTGCAAG CCATGCTCCA CAATCCAACC CTGCATCTCG AACTCTACTT GCACGAACTC GTACCAGCAC TCTTGACCGC CATTGTGGCC GATCACCGCG ACCGGACGAA CCAACGGACC TCCGTCGCCG TGACCGCCAC GCCGCACTGG CGTTTGCGCG TCGAAGCTTC CGTCGCGCTC CGCACCGTCT GTCGACAATT CGGACCCGAA TACCCCACCC TCCAAGCACG GGTACTCCGG ACCCTCTGTC AAGCACTCGG ACCGGACCGG TCCCGACCGG CCGTCTTTGG CGGTCTCACC GCCGTCACAC TCTTTGGACC ACTCGCCATC CAAGCCTTTG TCCTACCCAT GCTGCCGCAC GCCTGGAATG CCTGGGAGGA GGAAGCACAG TCGTCCGCTA CGGAAGAAGT GCAGTGGGAG ACCCGACAAT GCCAACAAGC CGCACTCGGT GCCCTCGGGA CGTGGTTGCG GTCCTACGCA CCAACGGCTC CGCAGACGTT AACTGCTGGA CCGGCAGAAC AAGTTGCCGC GACCGACGTC GCGCATCCAC TCCTCGCCGA CACGTGGGGA GATGCCCTGG TACCCTTGCA AGGCTACGGA CCCGACGTGC CCACCGATTA TACCCTGTGT GTACTCTAA
|
Protein sequence | MQTTARHTLV TLVEDHLTVL IADAVTVGRH AKRARTVTNS NSNNNTTNGT PLDEVAAVSK GTSSSTGGGV TVRRRLHAAD INLALQMRQS EKLYATALVP PDTVHPSSMS TTTTTDNPDH SLPASHRPVN LADFLRHAQL PFQQPAEVAL HVSWLAVDGI APEPHPHGVV GAWTDRHTPS PHPLAMPHEN NNHHNPYQST APQAWLVQQL QAAMLSEELQ LYFTRVTYAL DNTTNTHSPT SARAQDRLLD RLAVDAHLQE LVPFFARYVT QTLYASHVTH QRAAVRLVQA MLHNPTLHLE LYLHELVPAL LTAIVADHRD RTNQRTSVAV TATPHWRLRV EASVALRTVC RQFGPEYPTL QARVLRTLCQ ALGPDRSRPA VFGGLTAVTL FGPLAIQAFV LPMLPHAWNA WEEEAQSSAT EEVQWETRQC QQAALGALGT WLRSYAPTAP QTLTAGPAEQ VAATDVAHPL LADTWGDALV PLQGYGPDVP TDYTLCVL
|
| |