Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37399 |
Symbol | |
ID | 7202297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 20461 |
End bp | 21516 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181471 |
Protein GI | 219122270 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000156319 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGTG AAGACGAAGA CGTACTACAC AAATATAGAA AATATGATCC GCCGCCTCAA GAGTTGCCAG TCTACTCGTC CGATGAAGAC AGTAGCGGAT CATCGATATT GACTGCCAAC GCAGAGCCAC TCACCGCCAA GGCCGATTTG CTCAAAGTGT TGGACTACAT CATCATTGTT GGTAACCGTA ACCGATATCA AAGCTTACAA GCAACTAGGG AACAGATTTC GGAGACAACT ACGGGGTCAA ATGAGCTGTC AGAGATTGAA CAAACGGATG ATTTTGAGCT TAGTGGAATG CACGCTGATG ATTTCCAATT AGACCCGCAG AGAAGAAAGA AAAGTCGTTT GAGTGCAGCT TTGCTGTGCC CATCGTATGC CGCTTGGGAA GAAACTAACA GAACTGATGC GCTGGATTGC ACATGGTGTC ATCAATACAG TGACTGTATC GCGTTGCCAT ACAATAAGGT TCGATCCAAA GAACCACCAG CTTTGAAAGG TTTTTTTTAT ACGCGCACAG ACAGAGATTT GACATTGCGA TGCTGGAGGA AAGCTGTTCA TGCAGCATCA ACAGTCATGG TGCTGGAAGG GGTCAATGGG AATGTGAGGC TTGACCTGAC CTCAGCCAAT TCAGCAGAGA AAGACGCCAA GGAGACATGC CTGAACCTAG GAATAGCTCT TTCGAATCTA AAACCATATA CTTGTCCATC TTGTTTCAAG GTCTTCCAAA GTTGCAGAGC CAGGGAATGT CACTTCTGGG GAGAAAATAA CCACAGAGGC TGTTGCTGGA GTCTTGTGCA CAAGAAAAGA CATTCTATTC TGAAAGAAGC CACCAATAAG GAAAGCACCA TTTGCCAACA CGTCTTGATG AGGTTAATCT TATGCTCAAT TCATCAAAAA TGTCATCAGT TGCCTAAAAT CCTGGATAGG AAATTCATTG TTTGTCTTCT TGGAGCAGCG AGATTGCATC ATTTACGTCT CCGACAGACC AGTTCAAAAC TTTCCCAATA TCCAGGCTCC GCTTCATTGG ATGAGCACAG TCCTGATCTT CCCTAA
|
Protein sequence | MSSEDEDVLH KYRKYDPPPQ ELPVYSSDED SSGSSILTAN AEPLTAKADL LKVLDYIIIV GNRNRYQSLQ ATREQISETT TGSNELSEIE QTDDFELSGM HADDFQLDPQ RRKKSRLSAA LLCPSYAAWE ETNRTDALDC TWCHQYSDCI ALPYNKVRSK EPPALKGFFY TRTDRDLTLR CWRKAVHAAS TVMVLEGVNG NVRLDLTSAN SAEKDAKETC LNLGIALSNL KPYTCPSCFK VFQSCRAREC HFWGENNHRG CCWSLVHKKR HSILKEATNK ESTICQHVLM RLILCSIHQK CHQLPKILDR KFIVCLLGAA RLHHLRLRQT SSKLSQYPGS ASLDEHSPDL P
|
| |