Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49743 |
Symbol | |
ID | 7198333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 133297 |
End bp | 134583 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184574 |
Protein GI | 219128761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.733154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCGA CCAATGCAAC AGTCAGTCCC AAGTTTGGAA CATTCGTTTT CCCCTCGGCC CGTGTTGTCG TCACTTGCCT CTCTCTCTCT CTCACTGCTT CCTTTGGTGG AAATAGTGTC ATGATTGGTC GGCATCGTCA TCGTCATCGT CATCATCAAC GGCGGTGGCA CGGCTACGTT GGATGGTTGC TGTATGGCAC GTTGATGCAG TACCCGCACG GGTTTGCCGG GAGCGGCGGT ACAACCGTTC GTGTCCAGCG TGACGGACTT TCCGGTTCAC GACGTCTTCA TCACACGGTT TCAACGCTAC TCGCCAACCA GTCCAGCAAT AATCAAGATA CTCATACTCC CAATACCGCA ATCGATGACG TCGGTCCGGT ATTGCCCCAA TCAACAAGAA TGTCTACAAC AAACACTAAC GTGTCCACCA CAACAACACC CACGTCCCAA CAGTTCATGA CCGCCATGGG CACGTCCCCC CGACGCATTC TGTTGTCCGG ATTATCCTCT ACCGCCATTG CCTTGGCGGC AAACTTTTTG GGCTCCACCA GCAAATTGCT TGAGTTCGTG CCCGAAGAAG TTGTGGAGGC CTCCGGTCTC GACACGTACT ACCCACGCGG AGACTACAAA CGCGTCCGCA GTGGAGCCAA CGGATACACC ATGCTCATTC CCAAGGAATG GGTCGCCGAT ACCAGTCTCG CACTCGCCAA AGCCACCCGA AACGCGGGAT CGCTCGATTT CGCCATGGCC ACGAAACCGT CTTCCACACC ATTGTCGTCG TTGTCGTCGT CTTTCCGGTC CGTAAGCAAC GCCCTTTTGC CCGACGCCGC CTTTGGTCCG CCCGGGAGGA ACCGTGGTCC AAAAGACGGT ACCGGTGGGC TTGACAATAC CAACGTCTCC GTGCTGGTGG CCAATCTACC CGATCCCACA TTTTCCCTCG CAGCGACTCT GGGAACGCCC GCCGTCGCTG CCACCACGCT CCTCCAACGG TCCCTGGCGC CACCCGGATC GGGACGCGTC GCCACCTTGG TCGATGCTTA CGCTGACGAA CAGCGGCACG GTCTCTACCA ACTGGAATAC GTCGTGGATC GTGGCGAGTC AACGCGGTCG CCCGACAGCC GGGCCCGGCG ACGGGCTATT TCTGTGCTGG CTGTCTCGGA GGATCGGACG AAACTCTTCA CCATGACCGT CGTCGCTCCC GAAATAGCCT GGAACGACCC AATCTTGGAG CCGAAACTCA GAAAAGTTGC CGCGAGCTTT CGTCCCACGG AAGCGATCAT CCGGTAG
|
Protein sequence | MDPTNATVSP KFGTFVFPSA RVVVTCLSLS LTASFGGNSV MIGRHRHRHR HHQRRWHGYV GWLLYGTLMQ YPHGFAGSGG TTVRVQRDGL SGSRRLHHTV STLLANQSSN NQDTHTPNTA IDDVGPVLPQ STRMSTTNTN VSTTTTPTSQ QFMTAMGTSP RRILLSGLSS TAIALAANFL GSTSKLLEFV PEEVVEASGL DTYYPRGDYK RVRSGANGYT MLIPKEWVAD TSLALAKATR NAGSLDFAMA TKPSSTPLSS LSSSFRSVSN ALLPDAAFGP PGRNRGPKDG TGGLDNTNVS VLVANLPDPT FSLAATLGTP AVAATTLLQR SLAPPGSGRV ATLVDAYADE QRHGLYQLEY VVDRGESTRS PDSRARRRAI SVLAVSEDRT KLFTMTVVAP EIAWNDPILE PKLRKVAASF RPTEAIIR
|
| |