Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47956 |
Symbol | |
ID | 7203203 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 549581 |
End bp | 551553 |
Gene Length | 1973 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182416 |
Protein GI | 219124239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGTCTCCATC GCAACGTTCG GTTGTTTCGT GCCGGGCTCG CCAATCCTTT TTTTTCGCGG ACGAGCTCTA TCTCACAATA GTTGCTATTG CAAACCTTAA GATGAAGAGT TCTCTCGTCC TTGTTGCTGT TGTGCTCACC TTTTCGGCCG AGGCCTTTAT ACCACATGTG CGGAGACCAG CGTTGCTTCG TCCCGTTGCC GTGTCGTCGT CCATCAAGAT TTTTACGGCC AAAAACGCCA GTGAAATCGC CTTTGAAGAA GTAGAAAGCT ATCGGGATGG CATGAGCATC ACTCGATCCG GTACCAACGA AAACAAGGTA CGTAGCAAAA GTCTATAATC CGGCAAATAT AAGGCCCCAC GGCTCCGACT CCCTTGCTCA CCTCGAACCG TACGACTTTC ATGGATTCGT AGGTGATGGA CGTTGTCATG AAATTTGGTG GTAGTTCCTT GGCAGACAAG GATCGAATCG ACCATGTAGC AAACTTGATT AAGAATCAGA TTGAAGCGGG GTACCGACCT CGGGCCGTGG TTTGCTCGGC GATGGGCAAG ACGACCAATT CTCTGCTGAG TGCGGGGGAA TTTGCCTTGG GTACGTCGGC TTTGCTCTTT TTCACGTTGC GAGCGCATGT GCGGTGATGT TTGGCACTGC ACAATCGCAA AGTGTCTCAC TTTCAATGGT CGTACTTGTC TCTGTAGAGG GCCGTGTCAA CGTTGATGCG ATTCGTACCT TGCATCAGTC CACTATGAAT CATTTTGAAT ACTCTCAACA CATCATAGAC GACGTCAATG CACTCTTGGA CGAATGCCAG GACATGCTGA ATGGTGTGCG GATGATACAG GAGCTTAGCC CGAAGTCTCT AGATCAGCTT GTCTCCTACG GGGAACGATG CTCAGTTCGT ATTATGGCGG CCCGTTTGAA CCAGCTTGGT GTACCCGCCC AAGCGTTCGA TGCTTGGGAT GTCGGTATGA TTACGGACAG CGAATTCGGG GATGCCAAAA TTCTTGCCGA GTCCGAAGAT GCCATTCGAA ATGCCTTTGA CCGGATCGAC CCGAACATTG TCAGTGTAGT GACTGGCTTT ATCGGCCACG ACCCCAATAA GCGTATCACG ACACTGGGTC GAGGAGGGTC GGATTTGACG GCAACGCAAA TCGGCGCTGC TTTGAAACTG GACGAGATTC AGGTCTGGAA AGATGTGGAC GGTATTTTGA CTAGCGATCC TCGGTTGGTG CCTAATGCTG TCCCGGTGGG CGACGTGAGT TACGAGGAAG CTAGCGAATT GGCTTACTTT GGCGCGCAAG TGCTGCATCC GATCGCAATG CAGCCAGCCA TGAAACACAA TGTTCCCGTA CGGGTCAAGA ATTCGTACAA TCCATCAGCC GTGGGAACAA TTATTCGTAA CAGAAAGGAA ACCGAACGGT TAGTGACCGC CATTACCTAC AAGCGTGATA TAAAATTGAT GGATATTGAA TCGACACAGA TGTTGGGAGC GTACGGTTTC TTGGCACGCG TATTTGGAGA ATTCGAGAAG CACAAACTCT CGGTTGACGT GCTCGCTTCG TCCGAAGTCT CTGTGTCTCT GACCTTGGAC AAGAAACAAA AGGATGCCGA AATTGACGGT CTCATGCGGG ATTTGGGCAG CTGCGCGAAG GTCACGTGCC ACAAGGACCG ATCCATCCTG ACACTCATTA CTGACGTTGG TCGCAGTTCG GAAGTACTCG CTACTGTTTT CCGTGTTTTT TCGACTTGCG GCATTAAAGT TGAAATGATG AGTCAGGGAG CCTCGAAGGT AAACATTTCC TTCATCGTCA AGGACGAAAG CCTGGAACGA GCTATCCTGG AGCTTCACAA ATGCTTCTTT GAAGAGACCT GCTCAGTGGA GCCTTTCAAA CCAGAGGCTG GCAGGAATAA GACATTGCTC GTTGTGTAGA GAAGCGGGTA GATATTATTT TCTGGTATTG CTTTGATTTC GCC
|
Protein sequence | MKSSLVLVAV VLTFSAEAFI PHVRRPALLR PVAVSSSIKI FTAKNASEIA FEEVESYRDG MSITRSGTNE NKVMDVVMKF GGSSLADKDR IDHVANLIKN QIEAGYRPRA VVCSAMGKTT NSLLSAGEFA LEGRVNVDAI RTLHQSTMNH FEYSQHIIDD VNALLDECQD MLNGVRMIQE LSPKSLDQLV SYGERCSVRI MAARLNQLGV PAQAFDAWDV GMITDSEFGD AKILAESEDA IRNAFDRIDP NIVSVVTGFI GHDPNKRITT LGRGGSDLTA TQIGAALKLD EIQVWKDVDG ILTSDPRLVP NAVPVGDVSY EEASELAYFG AQVLHPIAMQ PAMKHNVPVR VKNSYNPSAV GTIIRNRKET ERLVTAITYK RDIKLMDIES TQMLGAYGFL ARVFGEFEKH KLSVDVLASS EVSVSLTLDK KQKDAEIDGL MRDLGSCAKV TCHKDRSILT LITDVGRSSE VLATVFRVFS TCGIKVEMMS QGASKVNISF IVKDESLERA ILELHKCFFE ETCSVEPFKP EAGRNKTLLV V
|
| |