Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38958 |
Symbol | |
ID | 7203708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 640132 |
End bp | 641640 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183000 |
Protein GI | 219125461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.235143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCGT CACGCTTCGT GGTGGCGGTG CTTGTGCTAG GCGTCGTACG CCTGAGCTCC GGGTGGATTC TAACCCAAAA TCATCCACAT CTACGGTTCT TCCGTACAAC AACGACACCA ACGCGGTCCG ATTTCCGTCA TGGACTGACA TTGCAACAAG CAGTCCGGGG TGTGGCTGCC GCTGGAGTCG AGCCGGACGA ATGGGAGGCG TTGCGTCCTC GTTTGGCGGC CGTATGCCGG GACTGTATCG ACCAAGACTT GGAAAGTGAC GGGATCCTGT TCCTGCCCTC GTTGACCGGA GACCCCGAGT CGCAGTTTGC CGGAATTGCT GGTCGTGTTT TGCTGCTTTC CATCGACAGT TGGGTAGCGG AAGACGGTGT CGTGCTTTTG TTTGATGCGG CGACGGCCGC TATTGATGAA CTTTTGCAGG ACGAAACCGT AAGGACAACC GAAGGACATG CGCAACCGAT TTTATTGTAT ATCCAACCGG AGAACGTCAC GTCCGTTGTG CAATCCGACT CGATGCAAAT GGAAAATCTT CTGCAAACGC TTATTAGCAA CGACATTGAT CAGTACGGTC TGGCGGACAT CGTGGGTGGG GACGTAGGCT TCGGTTTGCC ACTTACTACT GTCGTACCAA CCGACCAATT TGAGGTCGAC GGGGCCACAG TTCGCGATCC CCTCCAACGA ACGGAATTTT GGGATACCAG TTCCGTACTG GTTTTCGACG GCTTGGTGAC CAGGGATTTA CGTCAACGAT TGCTGGAACT TATTTTGGAT GAAACCAACT CCGACGAAAA GTGGGACGAC GTTGCTAACG GACCGAATCC GAGTCGATGG GTGCGGGGTG GATTGATGGA TATACCCGCA GGACAGGGCA GCACTGACGA GGCGGAAGAA TCGGACAGTC TAGGAGGCTA TGGCCTGACC GACGAGGCGA TTGATGAGCT TTGTTTCGAG GACCACGAGG CGATCGAAGA GTTCGAAAGT ATCGTGTCGG CACTCTTTCC GCACATGAGG ATAGCGCGAT TGCCCGAGGC GGTCTTGGGT TCGTCGGTCT CGCCATTGAC AGCCAACGCT CCCATTCACG GTCACTCGTT TAAATATCAC ATTGACGCCG ATCCTAATTT GGTACCACAA TCGCCATGGG CCGATGTCTA TGGCCGTTAC CCCAATCGAA CACCAGGCAA GCCCCGATTT ATCAGTTGCC TTGTCTATCT AAACGACGAG TGGGACTACG ATGCCTGGGG CGCTCCGACA AGATTTTTGG ACTACGCGAC GGAAAGTGCT TGTGATATCC AGCCCAAACC GGGTCGCATA GTGTTCATGG ATCAGGACGT CACCCACACG GTAGTCGCCC CAATGAGCGC CGCTGGCCTA CGACCCCGGT ATTCCCTAGT TTGGAAACTC ATACTGCACC CCACGCGATA TAATCAAGAC ATGACTGACC TTGCTGGGCC CGGACGCAAT TGGCCCGCAC CAACTTTAAT TGGTAGTGCA AAGCGATAA
|
Protein sequence | MSASRFVVAV LVLGVVRLSS GWILTQNHPH LRFFRTTTTP TRSDFRHGLT LQQAVRGVAA AGVEPDEWEA LRPRLAAVCR DCIDQDLESD GILFLPSLTG DPESQFAGIA GRVLLLSIDS WVAEDGVVLL FDAATAAIDE LLQDETVRTT EGHAQPILLY IQPENVTSVV QSDSMQMENL LQTLISNDID QYGLADIVGG DVGFGLPLTT VVPTDQFEVD GATVRDPLQR TEFWDTSSVL VFDGLVTRDL RQRLLELILD ETNSDEKWDD VANGPNPSRW VRGGLMDIPA GQGSTDEAEE SDSLGGYGLT DEAIDELCFE DHEAIEEFES IVSALFPHMR IARLPEAVLG SSVSPLTANA PIHGHSFKYH IDADPNLVPQ SPWADVYGRY PNRTPGKPRF ISCLVYLNDE WDYDAWGAPT RFLDYATESA CDIQPKPGRI VFMDQDVTHT VVAPMSAAGL RPRYSLVWKL ILHPTRYNQD MTDLAGPGRN WPAPTLIGSA KR
|
| |