Gene PHATRDRAFT_47323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47323 
Symbol 
ID7202490 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp296948 
End bp299520 
Gene Length2573 bp 
Protein Length722 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181695 
Protein GI219122734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACCCGGGA GCGGAAGGCA GTTGAGACTC CACCCTATCG TTCTTCCGTG GAAACTCGAG 
TGCTGCTGTC TCTGTATTCC CCCTTCCCTT GTCCGTTCTT GCGAGTGACC GACGGTGTCT
ATTGATTGTG AATCGTTCTA CCCAAGCAAC GGCGATAACG TTTGCGCTGT GGGTACGGGT
CGGTGTTGGC TCGGTAGATA CAACACGTAA CCACCAGAGC GCACAACTCC TGTCCTTCGT
GTGCAGTAGT TGATTGCCAT GGATCTCTCC GTCTTGGCTG CTGTGGGATC TTTTGTTGTA
TCCGAGCGAG TACTGAAGCC GAAAGCAACC GTTTTGGTTA TTTTGGGTGC CATTTTGGCC
TACGACTTGA CGACCAAAAC ACACGATGTC TCCAGCTTGC GCGTGTACCG CGGTACGTAC
AATCGTTTCA ATAGATTTCC CCTCGTAGCA TAGTACGACA CTCTGGTACT TTTGTTGCGC
AGACACTCGG ATTGCAGCAG AATCTACTTG ACTCACACAT AATGTTGGTC TGATTCAATA
TGTTTGCGTG CTGTCCAGGA CCCGCCTTGT TTGCGTTTAC GCTCATGATG TGTGCGTATT
CCTTACGCAC TTGGCGGCGC AACGGCATTG CCTGTGACGA ACTTCTTTTT CTGCCGGGAA
CTGCACACGG ACAACGGCAC GGCTTGGACG ATGCCAACAA CGTTTCGATG GGCCACGACT
CGGCTCCTCT ACGGACCGTG GAGGCTATCT CGCACTCGCC CGACGAGGGA GACGTGGCGG
CGGGGTGGAC TATACCGGCG CTAGAACTGA CGCGTACGAA CGCTGCGACC GCAAATAGCG
CTGGAGTCGT ACAGAGGAAT AGCCAAAGCC CCGTACGGTC AAGGACCTTG AGCCACGAAT
CGTCCATATC ATCTATACAA GAATTCGTCA ACAGCTGGGA TGAAGACGAC ACGGAGCATT
TGGACGAAGA CAATCGAATA AGCACATCTA CCGGAGCTGA ATCTGAATTC TTTCTATCCG
AAGAGGCTAA TTCAGGCAGC ACAACACCTA GCGGGAACGC GCCACAGACC GGCATACACC
AACGCGGTAG TCGCTTGACC CGCGGAGTGG AACGCTTCCG AGAAAACCAT CCGCAATTTA
CTCGTTTGGG CTCCTTTTTC TTTTTTCGAT CATCGGCTAC GTCCACCCAG TCGGCCGAGT
ACGCACCGTC CGGTCCATCG GTAGTGGGTG CAGCGTTGGA TTTGAGCATG CCTATTTTGT
TCAACTTTCA TCTCTACATT GAGGCCTACA ATCACATGGA CCAGTATGGA TCAGACTTTC
CTGCCAAAAT CCTACCCCTC ATTTTCTTGT CGGTGTTAGT GGTCCGTTCG ATGTTTCCAC
CGGGACGACG GATGCGATTT TGGTCCACCA TGAAATTTAC CGCGACGGCA CCCTTTCACC
GATCACGCTT TCGCGACTGC TTTATTGGGG ACGTTGTTAC TTCGTTGGTG CGACCGTGTC
AGGATGTTTT GTTTGCCTTG TCATACTACG TGACAGTTAT TTGGGGTACG CTCTCGCAAA
CGTACGGGTT GTCTGAAAGT GGAAGTTACT TGGAGCGCAG TTGGATTTTG CATAACGTTG
TGTTGCCGTC GGCGGCATTG CTACCGCTGT GGTGGAAGTT TCTGCAAACC CTTCGGCAGT
CGTACGATAC GGGGAAACGG TGGCCCTATC TCGGCAATGC CTTCAAATAC TTGTCTGCTT
CGGTAGTTAT TTTGTACGGT ATGACGCATC GGGAAGACCG ACGATCAATA TGGTGGCTCG
TGTGTTTTGC TGCATCCATG TTATACCAAA TTTGGTGGGA TACCATCATG GACTGGGATC
TATTTGTGAT CGAAACGCGG TCGGATCAAG CCACGGATAC TGACCAGGTT TGGTTCGCCA
GTTTATCTTC CTACCGACCG AATTCGTATG TCTTGCCTTT CCTGGAGAGT TGCACTCGCC
CGATTCGGAA AACGTTCGTC GCGATCGTGA CCTTTATCCC GAGCTACAAA CAAATCAAAC
TACGACCACA ACGGTTGTAC AAAAGCGAAG CGTTTTACTA CAAGGTTTTT GTATACAATA
CACTCTTTCG ATTTACGTGG ATGCTGTGCT ATATTCCTGC TTACCATTTG TCGGCATCGG
GGGAGGAGCA AGTGACGACT TTTTCGTCGG ATACCAAGAC CTACGTAGGG GTGTTACTAC
CTCTGGCTGA AATTTTGCGT CGCGCACTTT GGGGATTCTT GTTTTTGGAA AATGAGACGA
TCAAATTGCA GAATGGCAAC GCGAGCTACT CACGGATTGA AAGTGTCGAT GAGCCGGATG
AAGAAAATGC TGACCAGTCG GAAATGTCGA GCATGTCGGA TGGCAGTAGT AAGGTGCGGC
TGCCGTCGTG GTTGGGCTCT CCACAGCTGC AAGACGAATC ATCTTTTCGT TTGCGGGATC
GTTTTCGGAG ATTTTTGGAA TGTAACGAAA GAATGCGCCA ACGTCTCTTC ATACTGGAGC
TTTTCTTGTG GGCCGTCGCT TTTGTGGGCT TGGGACTGTG GGCCACAAAC TAG
 
Protein sequence
MDLSVLAAVG SFVVSERVLK PKATVLVILG AILAYDLTTK THDVSSLRVY RGPALFAFTL 
MMCAYSLRTW RRNGIACDEL LFLPGTAHGQ RHGLDDANNV SMGHDSAPLR TVEAISHSPD
EGDVAAGWTI PALELTRTNA ATANSAGVVQ RNSQSPVRSR TLSHESSISS IQEFVNSWDE
DDTEHLDEDN RISTSTGAES EFFLSEEANS GSTTPSGNAP QTGIHQRGSR LTRGVERFRE
NHPQFTRLGS FFFFRSSATS TQSAEYAPSG PSVVGAALDL SMPILFNFHL YIEAYNHMDQ
YGSDFPAKIL PLIFLSVLVV RSMFPPGRRM RFWSTMKFTA TAPFHRSRFR DCFIGDVVTS
LVRPCQDVLF ALSYYVTVIW GTLSQTYGLS ESGSYLERSW ILHNVVLPSA ALLPLWWKFL
QTLRQSYDTG KRWPYLGNAF KYLSASVVIL YGMTHREDRR SIWWLVCFAA SMLYQIWWDT
IMDWDLFVIE TRSDQATDTD QVWFASLSSY RPNSYVLPFL ESCTRPIRKT FVAIVTFIPS
YKQIKLRPQR LYKSEAFYYK VFVYNTLFRF TWMLCYIPAY HLSASGEEQV TTFSSDTKTY
VGVLLPLAEI LRRALWGFLF LENETIKLQN GNASYSRIES VDEPDEENAD QSEMSSMSDG
SSKVRLPSWL GSPQLQDESS FRLRDRFRRF LECNERMRQR LFILELFLWA VAFVGLGLWA
TN