Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33014 |
Symbol | |
ID | 7197021 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1314955 |
End bp | 1316315 |
Gene Length | 1361 bp |
Protein Length | 436 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177805 |
Protein GI | 219112107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.111303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTCG AAAGCGATCG TTTATATTGG AGCGTCTCGA AAGAAGATCA GAGCTACGCT GGCACCTCAG ATCCTGCTCC TCCCGTTCGA TCATTAATTC TTCCTCGTAC GTTGTCACAA GTTGTCTGTC AATTTCTGCT AAGTATATCT GACCAGTATT TTTTGATATT TGGCTATACT CAGGAACGTT GGCTTGGTTT TCGGTGGACG ATGACCACAT TTTTCTCTTG GAAGGTTACA CCGCAGCATC GTACACACCA CCAACCATTA TCTTTCCGGC GTCAACTTTG CCAGGCAAAA TGTGGCAGAC GCTCTTGCAA ACTCGTACAT GTACACTATC AAGCGCGACT ACACGGGAGA GTACGGCCGT CTTAAAAAGG GCGTTGTCGG GGACTGAGCC AAAAAGCTTT AAATTTGACG AGCTAAGACT GAAGGCGTCT AAAAACAAAA AGGATTATCC TTTTGTGGTG GCTGAGTCCC CCATCCATAT GTTTTGCAAG GTACATGAGC TGATTGCTTT AACAAACGAC GAAGCCTTGG TCATTTTGAC GGTCGAGACT TTCGTAATTG ATGGATCGGT CCTAGGTCCG CCCACAGATG AAATGACAAA GCGGCCCAAC GTGACGGCCA AAATTGACGC CGATCTAATT GAACCAGTAG TAAGTCTCGG CGACGGCAAG GTATTTCCCT TGATGTGTTT GCGATCGATG CCTCGCCCAA CGCGTTGTGC GCAACGCGGC TCGTGGACTT CCACCGATTT CAATAACAAT GTCGGAAGAG GGGACCATAG CCTCGCTTAC GAAACGACCG AATGGTCGTA CAGACAACAC GGTGGTACCT GCCCCCTCGG GTATAACGCA ACAACGGCTT TAATCATGCC TAGACCAATC GGTTGGATTT CGACATATTC GCAAGAAGGC AATGTTGCTC ATCTGGCTCC CTACAGTTTT TTTACTGATG TCTCCCGCGG ATGTGAACCA ATAGTCGCAT TTTCCGCCTT TCGCAAGGAA GGTACAATAA AAAAAGATGC ACACAAAGAT GCAGAGGAGA TGAAGTGTTT TGTATACAAC ATGGTGGATG AGGACTTGGC CGTAAAAATG AATTATTCTG CTGCAGAGCT TGGACGCAAC GAAAGTGAGT TCGAATTAGC CAAGTTGACA CCTGGGAAGG CACGTCTAGT AGATGCTCCG GTTGTGTCAG AAGCCTGGAT ACGATTGGAG TGTGAATATT TTAAGACGGT GGAAGTTGAC AGCTTTTCAG TCGTTCTCGG TTTCGTTCGT GGCATAGACA TTGACCGCAA GCTTTGGAAA GACGGCAGGC TCGACGTATC TTTGCTAAAG CCTATCACAC GGCTCGGGTA A
|
Protein sequence | MAVESDRLYW SVSKEDQSYA GTSDPAPPVR SLILPLFFDI WLYSGTLAWF SVDDDHIFLL EGYTAASYTP PTIIFPASTL PGKMWQTLLQ TRTCTLSSAT TRESTAVLKR ALSGTEPKSF KFDELRLKAS KNKKDYPFVV AESPIHMFCK VHELIALTND EALVILTVET FVIDGSVLGP PTDEMTKRPN VTAKIDADLI EPVVSLGDGK VFPLMCLRSM PRPTRCAQRG SWTSTDFNNN VGRGDHSLAY ETTEWSYRQH GGTCPLGYNA TTALIMPRPI GWISTYSQEG NVAHLAPYSF FTDVSRGCEP IVAFSAFRKE GTIKKDAHKD AEEMKCFVYN MVDEDLAVKM NYSAAELGRN ESEFELAKLT PGKARLVDAP VVSEAWIRLE CEYFKTVEVD SFSVVLGFVR GIDIDRKLWK DGRLDVSLLK PITRLG
|
| |