Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35561 |
Symbol | |
ID | 7200791 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 229739 |
End bp | 231208 |
Gene Length | 1470 bp |
Protein Length | 449 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180196 |
Protein GI | 219118859 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.742828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTAC CGAGCTCTGT TTGCGGCATC GCTCGGGTCT GTGCAGCAAC TACCTTTCTC GCCATGTTGT CTTCCCATAC TTCTAGATTC GCCGTCCGAG CCTTTACCGT TAAGGCTTTT GCGGCACACA ACTACAGACA ACGATATCCT CTTCCCCATT TGAATGTTCA TATGCACCGT GAAGCGGATA TGGTGGAGCA AATGATTGGC GGAGAGCGGT ACGAAGTGAC TCAACTCCCC GATTCCATGG TGGAAACTAC ATTGTTTGTG GGTAATATTG ACGAATTTGT GCACGACGAT GACTTGTCGG GTCTCTTTCA GTCTGTCAGC AAGTTGCAGT CCGTACCTGC CTGTGTAGTT CGCAAACCAG ACATGTCCTC ACTACAGTAC GGCTTTGTAT CTTTTCCATC CGTGGAGGAA AAGGAGGTAG GTCTTGACTG TGAGTGGGTG GCATTGTGGT GCATGAGAAT CGTGTGACGT TGGCGTTTCT TCCTTTTTCC TGGTACGGAA ACTAACACAT TATTTGTCGA TTTGCAAACT TGGCAGGCTG CGATCATTAG ATTCAATGGC TACGAATGGA AAGGGAAGAA GCTAAGAGTG CAGGAAATTC GGGACCATCC CGGTCGGGCT CGCGTAAGCG TACCGGAACG CATGGTGGCC TATGTAAGTG GAGCTGCCAA GAAAGTCCGC GGAGGGAAAA CGAACCAACT CCGTCGGATC TCGCGCGATG ATGTAGAACG ACTGAGTCGT GGTCAGCCTT CCAAGAAGAA AGGATACGGT AGTCGAAACG TTCCGCATAG GTTGAATGAC GAAGAACGAG CCGAGATGGA TCGTGCCGCC AAGAAGGGTT TCGTTTCCTT TGCGGGCACT GGCAACCGAC GGACTCGAAA GGGATCACCG CTCGCCAATA TTCACCGACA GTGGTGTGAT GCGCGGGACA AACCACAGAT TCTACACTTT AAGGCTAGCG GTGGTAGACA ACCCTTGGAC CAAGTGATTG TTGATTTGTC GCCATTGAGA TTGAACGGCC TCTTTGATGA CCCGAGCCGG GTGGATGATT TTCTGGCTAA GTGGAAAGCA GACATTGCTA CCGCAGCGAG CAACTCAGGG ATGCAAATTT ATACACGGAC TAGTGATGTT GATGCAGATG AAAATGATCT TGACGAAGAT ATTGCTACTG ATTATTTGGT CACCCTTGAC CATCATGCAG TAATGGAGGC ATGGGCCACG AAACCGATAT GGAAGTTGCC AGTGGTTAGC TTTGCAGTCT TTGAAGGAGA TAGACCACGC GCTAAGGCTA TGGCCAAAGA ACTGGCAGCA TTGTGGGAAA TTCCAGAAGA GCAGCAAGAG ACTGGAGGAG GTCCCAAGAC TCGTCGAGAC GCTGGTGCTC GGAAAGGCGG GAAAACCAAC ATGAAGGGAC TTAGCCAACA TCGTAAGCGG GGTGGGGGTC ACCGCCAATC TTTCTACTAG
|
Protein sequence | MSLPSSVCGI ARVCAATTFL AMLSSHTSRF AVRAFTVKAF AAHNYRQRYP LPHLNVHMHR EADMVEQMIG GERYEVTQLP DSMVETTLFV GNIDEFVHDD DLSGLFQSVS KLQSVPACVV RKPDMSSLQY GFVSFPSVEE KEAAIIRFNG YEWKGKKLRV QEIRDHPGRA RVSVPERMVA YVSGAAKKVR GGKTNQLRRI SRDDVERLSR GQPSKKKGYG SRNVPHRLND EERAEMDRAA KKGFVSFAGT GNRRTRKGSP LANIHRQWCD ARDKPQILHF KASGGRQPLD QVIVDLSPLR LNGLFDDPSR VDDFLAKWKA DIATAASNSG MQIYTRTSDV DADENDLDED IATDYLVTLD HHAVMEAWAT KPIWKLPVVS FAVFEGDRPR AKAMAKELAA LWEIPEEQQE TGGGPKTRRD AGARKGGKTN MKGLSQHRKR GGGHRQSFY
|
| |