Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40650 |
Symbol | |
ID | 7198573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 67607 |
End bp | 69355 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184638 |
Protein GI | 219128897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGACA ATCTTACATC CCTCCCGACC AACGGCGAGT CGTTGCTACC GACGGACGTT CCCACCGATC GTCCCTTGTC CATTGGCGTT TTTGGCGGTT CCTTCAATCC CATCCATCTG GGACACGTGC TTTTGGCCAT TACTACACAG CAGACCAAAC CGGTGGATCA AGTAGTATTG GTACCCGTCT ACAAACACGC CGTCAAGCGT GACTTATTGC CCTTCGACGA TCGGGTCCGT ATGTGCCGAG CCGCCGTCGG ATCCTTCGGT CAGCACAATC GCGCCATTGT GGTATCTACC GTGGAACGCC GCGTAGGTGC CTCCAACGGA GCCATGCTGC GAGCTCTCCA ACAAGAATAC CCCGAGGGGA CCCGCTTTTG GTGGATCTGT GGCGACGACT TCTTCCGATG GATGGAGCGA CCCAAGGGGC TCGAAACACT CGCGCACGTT TCGGGATTGA TCGTCCAGCG ACGTCTCCAC AAACGCGCCA ACGGACAACT CTTTCAGGAA GATCTAGACG AAGCCCGCGT CCGCGCCAAA ACCCTACAGC TCGATATCCA TTTAGACTTT ATTTACGGAG AGTTGCCGCA CTTTTCGTCG ACCCTCGTCC GGCAGGCACC GGGATCCTGG CGCTCCTTTT TGCCGCAAGC AGTCGCGGCC TACCTGGACG CGCGACCGCA TTTACAGGAA CAGTTGTTGG CCAATCTACA AGCCGACGCC ACGGCAGAAG CCGCGCAAAC CGTATTGTCC GGCGAACAGC CGGTGACCGC GGCTTCGAAC ACCGCGTGCT CCACAACCAC CACGTCCGCG GCAGCCTTCA AACAAGCCGG GTTGTGGGTC ATGCGGGGGC TTGATATGGT ACACATGCTC CAGTACGAAC GTGGCATGAC CGGACTGCGC CTTTCCACCG GAACCACGCA AAAATACCAA CAAGAAACCC TTGAAGAAGT CCAACGCAAT ACGGATCGCG TATTGCGAGA AATTCTCGAC GCCCACGCGG AAAGCGACCA GCTCGTACTC CTCCCCGCCG ACGATGACTG GCCCGAAGTC CAAGCCCTCG CGGCGGAACT CCAGCAAGTC CCCACTTGGT TGACGCGCGA CCGCGCCACA CTCGCTCGGC GCCAACTAGT TTTGACGGCC ACACCCGGCG TCGAAGGCTG GGCGGCACGC TATTCCCTCG TGGAAAAGTT CCACGCCCGC CTCGACCTAC TCACCCAAAG CACCGTCCGC GCGCTGGTAG AAATCCGTGC CAACCTCGCG GTGGCGCAAG CCCAACCCAC ACCGTCTCGG AGTGTACCGG AACTTTTACG TTCCTGGTGT CAGGGCAAGG AAGCCCTGGG ACGACTGCGT GCGTTTGTCT GTGCCGGCGG CCCGGACGCC TCCACCCTGG TGCGCGAATC ACTCGCCACC CGACAACAGC TGGTACGCGT GATGGAAGCC AAGGATCGGT GCATTGCGCG CGTATTGATT CTGGAAGCTG GTGTCTCGAC CCGATTGGCC GCCCCCGATG CTTTGCACCG GATGCTGAGC GAAGTGACCA AGGCGGAATG GTCCCTCATG GGCTGCTGCA GTTACGTGGC CAACCAACCC GGATCCATCC AATTGGTCCA TCAGCAGCTC GCATCCGGTA GCGCCCCCTC CGACGAACCA TTCTCGGTCC AACACTTTTT TGAAGCCTCT AGTACGGCCA TTGATTTTTT GTTAACCTTC GCCAAGGCCT TGGCTGCGTC CGCCTGCTCG ACGTTGTAA
|
Protein sequence | MEDNLTSLPT NGESLLPTDV PTDRPLSIGV FGGSFNPIHL GHVLLAITTQ QTKPVDQVVL VPVYKHAVKR DLLPFDDRVR MCRAAVGSFG QHNRAIVVST VERRVGASNG AMLRALQQEY PEGTRFWWIC GDDFFRWMER PKGLETLAHV SGLIVQRRLH KRANGQLFQE DLDEARVRAK TLQLDIHLDF IYGELPHFSS TLVRQAPGSW RSFLPQAVAA YLDARPHLQE QLLANLQADA TAEAAQTVLS GEQPVTAASN TACSTTTTSA AAFKQAGLWV MRGLDMVHML QYERGMTGLR LSTGTTQKYQ QETLEEVQRN TDRVLREILD AHAESDQLVL LPADDDWPEV QALAAELQQV PTWLTRDRAT LARRQLVLTA TPGVEGWAAR YSLVEKFHAR LDLLTQSTVR ALVEIRANLA VAQAQPTPSR SVPELLRSWC QGKEALGRLR AFVCAGGPDA STLVRESLAT RQQLVRVMEA KDRCIARVLI LEAGVSTRLA APDALHRMLS EVTKAEWSLM GCCSYVANQP GSIQLVHQQL ASGSAPSDEP FSVQHFFEAS STAIDFLLTF AKALAASACS TL
|
| |