Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22319 |
Symbol | |
ID | 7203397 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 701999 |
End bp | 704132 |
Gene Length | 2134 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182745 |
Protein GI | 219124928 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGCTCTG TTCCGTATCA CCATCCTTTT ATTCTTGTTC CGCCTTTTAC CCGTCGACCT TGATCAGAAA TTTCCCTTGT TGTTCTTTTA GAGATACAAT CATGACCCTA GTCGCCGCAG GTACGAAATA ACGGCCCTTA GTATCTCCGC TATTCGAGGC ATTGTATTCA GTCTCTAATG CCACAATTGT ATACCTGGGC GGCTTTAGTT CCTCGTCGTG ATGCGTCATT TAAGAAGACC GTGTCAGCTG AAGACGGTCG TCGATCTCGG GAAGAGACAA CTTTGCATAT TCGAAAAAAC AGAATGGCCG AGCGCATGGC CAAACGTCGC ACAGCACCGA TGTCAAACCA AAGCACAGCC ATTGACAGCA ATATCGATAG TGCAGCCCAT CAACCCGTGC CAACCACTGC AAAAGAAATT GACATTGCTG TAGCTGGCTC TGTGCAAACT TTCCGCGCAT GGGAGGCAAA TCCTGCGTCG GTGAACGACG ATTCTCTAGC AGACGCCACC AAAGCAATTC GACGACTATT GTCAAAGGAA CGAAACCCAC CTGTTGTCCA GGTAATTCGA TCGGGAATTA TCTCTTTTTT GGTACGATTC TTGCAGGAGG ACAAAACTGA CGCTCCACGT TTGCAGTTTG AATCGGCATG GGCACTCACG AACATTGCTT CAACAAGTCG GACATCGGAT GTGGCCCATA GTGGTGCCAT CCCACATTTG GTCCGTATTA TCAATCATGC AGACTCAGCG GAGCTTCGTG AGCAAGCTGC TTGGTGCATT GGAAATGTGG CTGGAGACTC CATTGAGAAC CGAGATGGTC TCCTTGCCAC GCCGATGTTG ACCAACGGTA TGTTGAAAAA TATATCTCGT CCAGCGAACC TCTCGCTACT GCAGAACTTT ACCTGGGCAA TTTCCAATTT ATGTCGATTT AAACCATCTC CCGATTTTGG TGCTGTTGCT CCTTTTATAA GTGCATTGGT AAACATTCTC AATGAAGTTG AGAATGGTGT CGGTGCCGAA AGCGCCGTAG ACCTACAGAC GGACACACTC TGGGCGCTAT CGTACTTATC GGACGGCGAC GACGCTCGAA TCAACGCGGT TCTTGAAGCA GGTGTGCTCA AGCTTTTGAT GAATATTTTT GACAATGATA GTGTTCCTTC CAATACCCTC TTGCCGGCCG TTAGAGTGAT TGGGAATTGT GTGTCAGGAA GTCATGCTCA AACTGACGCG GTGATCCGTG CAGGTTTCAC CGAGCGGGTA GGAAAACTAC TCCATCATCC ATCGGTGCGT TTTTTTTCTA GCTTTATCTG TACCATTTAT GAACACCCTA GAAACTCACT GGTCCTGTAT TGTTTAACAG AAAACTGTTC AAAAAGATGC TTGCTGGGCG GCGTCAAACA TCGCTGCCGG AACCGAAGTG CAGCTTAATG CACTTCTTGC TGCAGAGCCT TTGAATGGAA TTCCAAAGCG AATTATTGAG ATGGCAATTT ATGGGAACTT TGACATTCGG CGCGAAGCTG TGTACACGAT TTCAAACATT TTGACAACAG GGCAGTTTCA TGAGACAAAA ATCATGGTGC AGGAGGAAGC TTTGAAAGCT TTGTCGAGCA GTCTTGAGCT GAAGCAAGAT TCTCGTATGT TGATTGCAGC TATGGAAGCC TTGGAAACAA TTTTCGAGTA CGACGCAAAG TTCAATCAAA ATTACTGTGT CCTTTTCGAT GAAGAAAAAG GTATTGACAA GCTGGAAGAG TTGCAGCAGC ATTCTATAGA CGAGGTCTAT GAAAAGGCCA GTGCCATCAT AACACTCTAC TTTGGAGTAG AAGACGAAGA TGATTTGGAT GAGAATATGG TGCCGAATGA GAATGGTGAC GGAGAGCTCG CCTTCGGCTT TCCAAAGCAG CTATTTCCGG AAAACCAACA ATCCGGTGCC CCGACGTTCG ACTTTTCAGA TGTCGATACA CAAGGGTCGC CCGCAATCCT GGGACAAATT CAGCGTTGAA TATCTAGTTC ATACGAAAAT CGAATTCTGT CGGCGTCGTC TGTCTACGGA AACTTGATCA CAACCGTGTG TAAAAAAAGA CGATTAGAAT CCCTCTACTT TCACTAACAT AAAACTAAAT CTCCTTAAAG AGCT
|
Protein sequence | MTLVAAVPRR DASFKKTVSA EDGRRSREET TLHIRKNRMA ERMAKRRTAP MSNQSTAIDS NIDSAAHQPV PTTAKEIDIA VAGSVQTFRA WEANPASVND DSLADATKAI RRLLSKERNP PVVQVIRSGI ISFLVRFLQE DKTDAPRLQF ESAWALTNIA STSRTSDVAH SGAIPHLVRI INHADSAELR EQAAWCIGNV AGDSIENRDG LLATPMLTNG MLKNISRPAN LSLLQNFTWA ISNLCRFKPS PDFGAVAPFI SALVNILNEV ENGVGAESAV DLQTDTLWAL SYLSDGDDAR INAVLEAGVL KLLMNIFDND SVPSNTLLPA VRVIGNCVSG SHAQTDAVIR AGFTERVGKL LHHPSKTVQK DACWAASNIA AGTEVQLNAL LAAEPLNGIP KRIIEMAIYG NFDIRREAVY TISNILTTGQ FHETKIMVQE EALKALSSSL ELKQDSRMLI AAMEALETIF EYDAKFNQNY CVLFDEEKGI DKLEELQQHS IDEVYEKASA IITLYFGVED EDDLDENMVP NENGDGELAF GFPKQLFPEN QQSGAPTFDF SDVDTQGSPA ILGQIQR
|
| |