Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47323 |
Symbol | |
ID | 7202490 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 296948 |
End bp | 299520 |
Gene Length | 2573 bp |
Protein Length | 722 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181695 |
Protein GI | 219122734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACCCGGGA GCGGAAGGCA GTTGAGACTC CACCCTATCG TTCTTCCGTG GAAACTCGAG TGCTGCTGTC TCTGTATTCC CCCTTCCCTT GTCCGTTCTT GCGAGTGACC GACGGTGTCT ATTGATTGTG AATCGTTCTA CCCAAGCAAC GGCGATAACG TTTGCGCTGT GGGTACGGGT CGGTGTTGGC TCGGTAGATA CAACACGTAA CCACCAGAGC GCACAACTCC TGTCCTTCGT GTGCAGTAGT TGATTGCCAT GGATCTCTCC GTCTTGGCTG CTGTGGGATC TTTTGTTGTA TCCGAGCGAG TACTGAAGCC GAAAGCAACC GTTTTGGTTA TTTTGGGTGC CATTTTGGCC TACGACTTGA CGACCAAAAC ACACGATGTC TCCAGCTTGC GCGTGTACCG CGGTACGTAC AATCGTTTCA ATAGATTTCC CCTCGTAGCA TAGTACGACA CTCTGGTACT TTTGTTGCGC AGACACTCGG ATTGCAGCAG AATCTACTTG ACTCACACAT AATGTTGGTC TGATTCAATA TGTTTGCGTG CTGTCCAGGA CCCGCCTTGT TTGCGTTTAC GCTCATGATG TGTGCGTATT CCTTACGCAC TTGGCGGCGC AACGGCATTG CCTGTGACGA ACTTCTTTTT CTGCCGGGAA CTGCACACGG ACAACGGCAC GGCTTGGACG ATGCCAACAA CGTTTCGATG GGCCACGACT CGGCTCCTCT ACGGACCGTG GAGGCTATCT CGCACTCGCC CGACGAGGGA GACGTGGCGG CGGGGTGGAC TATACCGGCG CTAGAACTGA CGCGTACGAA CGCTGCGACC GCAAATAGCG CTGGAGTCGT ACAGAGGAAT AGCCAAAGCC CCGTACGGTC AAGGACCTTG AGCCACGAAT CGTCCATATC ATCTATACAA GAATTCGTCA ACAGCTGGGA TGAAGACGAC ACGGAGCATT TGGACGAAGA CAATCGAATA AGCACATCTA CCGGAGCTGA ATCTGAATTC TTTCTATCCG AAGAGGCTAA TTCAGGCAGC ACAACACCTA GCGGGAACGC GCCACAGACC GGCATACACC AACGCGGTAG TCGCTTGACC CGCGGAGTGG AACGCTTCCG AGAAAACCAT CCGCAATTTA CTCGTTTGGG CTCCTTTTTC TTTTTTCGAT CATCGGCTAC GTCCACCCAG TCGGCCGAGT ACGCACCGTC CGGTCCATCG GTAGTGGGTG CAGCGTTGGA TTTGAGCATG CCTATTTTGT TCAACTTTCA TCTCTACATT GAGGCCTACA ATCACATGGA CCAGTATGGA TCAGACTTTC CTGCCAAAAT CCTACCCCTC ATTTTCTTGT CGGTGTTAGT GGTCCGTTCG ATGTTTCCAC CGGGACGACG GATGCGATTT TGGTCCACCA TGAAATTTAC CGCGACGGCA CCCTTTCACC GATCACGCTT TCGCGACTGC TTTATTGGGG ACGTTGTTAC TTCGTTGGTG CGACCGTGTC AGGATGTTTT GTTTGCCTTG TCATACTACG TGACAGTTAT TTGGGGTACG CTCTCGCAAA CGTACGGGTT GTCTGAAAGT GGAAGTTACT TGGAGCGCAG TTGGATTTTG CATAACGTTG TGTTGCCGTC GGCGGCATTG CTACCGCTGT GGTGGAAGTT TCTGCAAACC CTTCGGCAGT CGTACGATAC GGGGAAACGG TGGCCCTATC TCGGCAATGC CTTCAAATAC TTGTCTGCTT CGGTAGTTAT TTTGTACGGT ATGACGCATC GGGAAGACCG ACGATCAATA TGGTGGCTCG TGTGTTTTGC TGCATCCATG TTATACCAAA TTTGGTGGGA TACCATCATG GACTGGGATC TATTTGTGAT CGAAACGCGG TCGGATCAAG CCACGGATAC TGACCAGGTT TGGTTCGCCA GTTTATCTTC CTACCGACCG AATTCGTATG TCTTGCCTTT CCTGGAGAGT TGCACTCGCC CGATTCGGAA AACGTTCGTC GCGATCGTGA CCTTTATCCC GAGCTACAAA CAAATCAAAC TACGACCACA ACGGTTGTAC AAAAGCGAAG CGTTTTACTA CAAGGTTTTT GTATACAATA CACTCTTTCG ATTTACGTGG ATGCTGTGCT ATATTCCTGC TTACCATTTG TCGGCATCGG GGGAGGAGCA AGTGACGACT TTTTCGTCGG ATACCAAGAC CTACGTAGGG GTGTTACTAC CTCTGGCTGA AATTTTGCGT CGCGCACTTT GGGGATTCTT GTTTTTGGAA AATGAGACGA TCAAATTGCA GAATGGCAAC GCGAGCTACT CACGGATTGA AAGTGTCGAT GAGCCGGATG AAGAAAATGC TGACCAGTCG GAAATGTCGA GCATGTCGGA TGGCAGTAGT AAGGTGCGGC TGCCGTCGTG GTTGGGCTCT CCACAGCTGC AAGACGAATC ATCTTTTCGT TTGCGGGATC GTTTTCGGAG ATTTTTGGAA TGTAACGAAA GAATGCGCCA ACGTCTCTTC ATACTGGAGC TTTTCTTGTG GGCCGTCGCT TTTGTGGGCT TGGGACTGTG GGCCACAAAC TAG
|
Protein sequence | MDLSVLAAVG SFVVSERVLK PKATVLVILG AILAYDLTTK THDVSSLRVY RGPALFAFTL MMCAYSLRTW RRNGIACDEL LFLPGTAHGQ RHGLDDANNV SMGHDSAPLR TVEAISHSPD EGDVAAGWTI PALELTRTNA ATANSAGVVQ RNSQSPVRSR TLSHESSISS IQEFVNSWDE DDTEHLDEDN RISTSTGAES EFFLSEEANS GSTTPSGNAP QTGIHQRGSR LTRGVERFRE NHPQFTRLGS FFFFRSSATS TQSAEYAPSG PSVVGAALDL SMPILFNFHL YIEAYNHMDQ YGSDFPAKIL PLIFLSVLVV RSMFPPGRRM RFWSTMKFTA TAPFHRSRFR DCFIGDVVTS LVRPCQDVLF ALSYYVTVIW GTLSQTYGLS ESGSYLERSW ILHNVVLPSA ALLPLWWKFL QTLRQSYDTG KRWPYLGNAF KYLSASVVIL YGMTHREDRR SIWWLVCFAA SMLYQIWWDT IMDWDLFVIE TRSDQATDTD QVWFASLSSY RPNSYVLPFL ESCTRPIRKT FVAIVTFIPS YKQIKLRPQR LYKSEAFYYK VFVYNTLFRF TWMLCYIPAY HLSASGEEQV TTFSSDTKTY VGVLLPLAEI LRRALWGFLF LENETIKLQN GNASYSRIES VDEPDEENAD QSEMSSMSDG SSKVRLPSWL GSPQLQDESS FRLRDRFRRF LECNERMRQR LFILELFLWA VAFVGLGLWA TN
|
| |