Gene PHATRDRAFT_43491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43491 
Symbol 
ID7197543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp603675 
End bp605681 
Gene Length2007 bp 
Protein Length563 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177650 
Protein GI219111797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00664754 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAGATAGGC TTGGAGCCAC TGGCCCAGCT TAGCTTTACA TTAACTATCT CGCAATCACC 
GTCCAAAATA GTTCCGAACT TTGAGTAATT TGACGACTCT CACCATCGAT TCTTTCATCC
CGGGGCTCTC GGACTTTACG TTCGTAAAAT ACATCTCCTA TCAGGAGGCT ACTGCTTCAA
CACCTCCGTT CTACTCAAAT TTACAGTTAA TCGCAAATCT CTCCTTGGAA GTACTTCCTT
CAATTTTTCT GAAGTGAAGC TTGCGACCTC AGCCATGTCG ACAGCGTCTG ACGATGCGGC
GGAAAAGGCA GCAGGATCCA ATGTGGAAGA CGGGTCCCTC TCTCTTGCTT CTCACCATTT
TGGCAACGAA ACGGAAATGC CCAAGATTGA GGAAGGCCAA CTCCGCCGCG CTCAGTCCAT
TTCTGAAGCA GTCGAAAACA ATGATGACAA GCTCTGTAAC GCCCCTTGGT TTCTTAAGTT
TATCGAGTAC CCTTTCCATG TAAAGGATCC AACTCGGGAC GTCTACTTAC CTGAAGCGGC
CGGCTGGGCG ATGGATTCTG CGGGTCGGGG CCCGCTCAAT CAAGTCGGAT CTTACGTTGG
ACAGGCGATT CTTCGTTTGG CTACTCAAGA TGCCGGCTGC TTGAACCCGC GAACTTGTAC
GAATACTGTA TATGGTTTGA AGCCAAGTTC TCTCCTTACC GCAACTACTT CTATCGTTGG
AGTAGTGGCT GCTTTTCTGA TGCCCATCGT CGGGGCAGTG GTCGATCACA CCACGCATCG
ACGCTTACTC GGGCTCGTTT CTGGAATGGC GGCCGTCGTT CTCGCTGGCA TTCAGATAAG
TGTGAACGCG AACAACTGGT TCTTCATCCT CTGTGTAGAC GGAGCGTTGT CTTTCTCGCT
ACTTGTACAC ACAACGGCAG TATTTGCCTA TTTGCCCGAC TTGAGTTTGG ACGAAAATGT
TCTCTCCCAC TACACTTCAC ATTTCAACAT TCGGCAATAC TCGGTTCAGG TTGTCTATCT
AGGCTTAGTC ATTATTACAG GAGAAGTCCG CAATTTACCC TCTCAAGCAA TTGCCACTTC
GGTGCAAACT GCGAAAGACG CAGCGGGCAT CAGCTTTGGC GTTGCCGCTC TTTTTATTGG
ATACGCTTGG ATCTTTCTAT TCCGCCCTCG GCCAGCCTTG TCCAAAGTTC CCGAAGGACA
AACATTGCTG ACCACCGGTT TCGTACAAGT TCATCGAACT GGCAAGAAAA TTTGGAAGGA
TTATTGGGCG TTGAAGTGGT TCATGTTCAG TTTACTGTGG TCTCCCGAAG CTGGTGCAGG
TGTCATCCAA TCGATTGCCG TCACATTCTT GACTGTGGTG ATGAAGTTTA CCGGTCTGGA
TTTGGCCAAG GCCATGCTGG TTCTGATGGT TGGAAACATT TGCGGCTCTT TATTTTCAAA
ATGGGTGTGT CAAAAGATTA ATCCGCTCAA TTCGTATCGT TGCGGTCTCA TGTCATTGGC
TGTTTCGATT TTCGTGTCGG CGTGGACATT AAATGGACCA GAAAGGCGCG CAGCCGTTTT
TGGTTTTACG TTTTTTTGGG GGGTCTCTAT GGGATGGGTT AACCCGTCGC AACGTGTGCT
GTTATGCACA CTTATACCGA AAGGTCAAGA GACCGAAATG ATGGGGCTAT TCGTTTTCAC
TGGCCAGATT CTAGGCTGGC TACCACCTCT CATTTTCACT CTAATGAACG AGAATGGCGC
CGACATACGC TGGGGATTCG GGCTGGTCAC TTTCTTTTGT GGTTTTGCGG CAATCTGCAC
ACTCCCAATG GGCAATTATT ACGAGGCTGT TGCATGGGCC GCTCGCCAGT CGGAAGAGAA
GCTGGGAGAA GTTCTCGTAA ATGCTCAATC TCGGAGTGAA AAGCTGCATG AAAGTGCACA
ATCGATTGGG AGCCAAGATT GTGCAACAGA GAAAATTGAA AAATAGACTT GCTAGGCAAC
TAATCTTAGC AGCAGTATAT TTTGCTC
 
Protein sequence
MSTASDDAAE KAAGSNVEDG SLSLASHHFG NETEMPKIEE GQLRRAQSIS EAVENNDDKL 
CNAPWFLKFI EYPFHVKDPT RDVYLPEAAG WAMDSAGRGP LNQVGSYVGQ AILRLATQDA
GCLNPRTCTN TVYGLKPSSL LTATTSIVGV VAAFLMPIVG AVVDHTTHRR LLGLVSGMAA
VVLAGIQISV NANNWFFILC VDGALSFSLL VHTTAVFAYL PDLSLDENVL SHYTSHFNIR
QYSVQVVYLG LVIITGEVRN LPSQAIATSV QTAKDAAGIS FGVAALFIGY AWIFLFRPRP
ALSKVPEGQT LLTTGFVQVH RTGKKIWKDY WALKWFMFSL LWSPEAGAGV IQSIAVTFLT
VVMKFTGLDL AKAMLVLMVG NICGSLFSKW VCQKINPLNS YRCGLMSLAV SIFVSAWTLN
GPERRAAVFG FTFFWGVSMG WVNPSQRVLL CTLIPKGQET EMMGLFVFTG QILGWLPPLI
FTLMNENGAD IRWGFGLVTF FCGFAAICTL PMGNYYEAVA WAARQSEEKL GEVLVNAQSR
SEKLHESAQS IGSQDCATEK IEK