Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46251 |
Symbol | |
ID | 7201205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 725294 |
End bp | 727228 |
Gene Length | 1935 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180494 |
Protein GI | 219119468 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.164515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGAGGTACC TGGCTAAAAT ATCAATGTAC GTAGGGCCAT GTGAAAGAAA GACCATTCAA CTTACGGGTA GCTACTTCAT ACCTCAAGCC TCCAAAACGG ATTTCGTTGA CGACACTGCC GTGCGCCGGC GTTGCCGGGC CCATGAGCCG TTCCCTGGTT GAAGGCGATG GACACTCTCT GGGAGAAGAA AGCGTATCTT TGTCGTCCAG TCGACCGCCA CGATCGCTGG ACGATTTCCT GGAACTAGCG TCCCGTCCGG CCCCACACAC GCCCCCGCCA TACCGCTATT GGCTTATCTT TTTAAGCCTA GGATTGGCAA ACAGTTCGGA CGCTTCCGAA ATTTTGTGCT TGTCGTACAT TCTGTCGGAG AAAAGATTCG AGGATGTCAT TTTACACGAC GAGGCGTGGC GGAAAAGTTT GTTGGCTGCT GCCGTCTTTT TGGGAATGCT GATGGGGGGT CTCTTTGTGG GAGCCTTGGG CGATTGGCAT GGCCGTCGAC CAATGTTGTT GGTCGGGTTA ACGACCAACT CCGTGGCGGG GCTCTTGTCG GCTTTGGCAA CGGACGCAAT TTCTCTTAGC GCGTTGCGGT TGGTGGCTGG TATAGGTATT GGTGCTACAG TACCACCGTT GTTCACCTTG TGTAGTGAAC TGGCGCCCCC GGCCGACCGC GGCTTTTGGG TTACGGTCGC AGCTAGCTTT TGGATGGTTG GTAGCATTTA TGTGGCTATT ATAGGCTGGA CTCTGTTGGG CAATGGAGCT TCTTGGAGAG TTTTTGCGGC TGCCTGTGCG CTACCGTCAG CATTTGGATG CGTCATGGTC TACCGCTTCG TTCCTGAGAG TCCACGTTTC TTGGCCATGC GAGGAGACCA CGAGCGGGCA GTAGCCGTGG CGCAATACTT GGCCGACTCT ATGAACTACG TTGGGTCTCG CTTGCGACAG GAAGAACTGC TTGAATATTA TCCGCGACAC GCTGGAGATA CAGAACGTCG GTTCCCTGGC GATGTCTCGT GTTGGCATCA GATTACCAAA GCATTTACGA CCTTTGCGGT TTCCGTTACC CAACTCTACA AACCACAACT CCAGAAAACA ACCTGGCCGT TACAGATGGT TTGGTTTAGT CTCTCATTCG GTAGCTACGG ACTTTTGACC TGGATCAATA CGCTTTTTGT CGAAGTGCAC CTGGAAAATC TGTACCTGAA CTCGTTGCTC TTTGCGCTGT CAAATTTACC CGGAAATCTT CTTTCGGCCT GGTTGCTCGA TCGGACGGGG CGAGCGACGC TACTAGTAGG TAGCGTGATT GCGGCGGCCC TTTCCCTCTT GGCCTTCGCC TACGTGGCAG CACAGGAAGC CAGTGACTCT CCATCAGTGT CCAAATCCTG GATTATTGTA GCTGCCTGCT CCTTTCAATG CTTTACGATT ACGGCGTGGA ATTCGATTGA CGTCATGACG TCAGAACTTT TTCCGACGAC AGTCCGGTCA ACTGGAATGG GACTTTGTGC AGCGTCCGGT CGAGTTGGGG CAATGCTTGC ACAACTGGTC AATGGAGCCT TGGTGCAAAA TCCGACGCGG CTTTTGATCG TGGCCGCTAT TACTTTATTG ATGGGAGCAC TCACCCCAGC TTTACTGCCG GGGGGTGATC AGACAGGGAT GGCAATGCTT GATCGGGTTG CGGTTGTGCA TGAGGACAAT GACGAAGACA GTGTTCTGGA TGTTTCATTG CGAGATCGTG TGGTACTGCA ATCTCGGCAG AATCAAAAAC ACAACGAGCA CGGCTATAAT CTGGTAGATT CGGTTGTATC GCGACAAGCA CATCGATCGA CTTCAATGGT AGAGTGATTG AATATAAGGA TTTTCATAGC AATCCCGAAC GCGTGGACTT TAACCACTTT GCCCTCCGGG GTATACTGAA TCCTAACCAT GTGACTCAGA TAGTC
|
Protein sequence | MSRSLVEGDG HSLGEESVSL SSSRPPRSLD DFLELASRPA PHTPPPYRYW LIFLSLGLAN SSDASEILCL SYILSEKRFE DVILHDEAWR KSLLAAAVFL GMLMGGLFVG ALGDWHGRRP MLLVGLTTNS VAGLLSALAT DAISLSALRL VAGIGIGATV PPLFTLCSEL APPADRGFWV TVAASFWMVG SIYVAIIGWT LLGNGASWRV FAAACALPSA FGCVMVYRFV PESPRFLAMR GDHERAVAVA QYLADSMNYV GSRLRQEELL EYYPRHAGDT ERRFPGDVSC WHQITKAFTT FAVSVTQLYK PQLQKTTWPL QMVWFSLSFG SYGLLTWINT LFVEVHLENL YLNSLLFALS NLPGNLLSAW LLDRTGRATL LVGSVIAAAL SLLAFAYVAA QEASDSPSVS KSWIIVAACS FQCFTITAWN SIDVMTSELF PTTVRSTGMG LCAASGRVGA MLAQLVNGAL VQNPTRLLIV AAITLLMGAL TPALLPGGDQ TGMAMLDRVA VVHEDNDEDS VLDVSLRDRV VLQSRQNQKH NEHGYNLVDS VVSRQAHRST SMVE
|
| |