Gene PHATRDRAFT_47117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47117 
Symbol 
ID7202030 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp506761 
End bp509063 
Gene Length2303 bp 
Protein Length743 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181218 
Protein GI219121739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCTG ATGCGACCTT TACGATACTC AACAGTTGCA GGAGTTGGAT ATCGAAATTT 
TTGTTAGTTA AACGTCTGCC CTGGCGTCTC TTGTGTTTAT CTGCATCGCT CTTTTCGCTG
TATATTCTCT TGCTCCGAAA ATCTTTATTC GGCGATGGTC CCACCTTGCA GGGAGGTTCG
ATAGGTCGGA AATTCGACAA TTCCCTCAAG CGCCTTCTAT TTCGAAGACG AGCTCGAATG
TATCGGCAGC GCTTCAAAGC CGAATTTGGT TTCGATTTAC CCCGAGTATC GGAAAAGTCT
GCAAATTCTA TCGGAGTTCC TGTTTTGCTC AATACATGGC TTGCTGAATC ACTCTTTCTG
GCCAAACATC GCAAAAACTC TCGGATGACT GCAGAATTGT CTTTGAGTGC GGTGGATTCG
GAGACGCACG TTTTTCGACT GATGCAATTT GATGTGAGTA GCTGGTACAG GCCTCTCTCT
TCCTGGCAAG GAAACAGCGC AGTGATTTGG AATGAGATTG AAATCGAGCA ATGGATTGAC
ATCAAGTTTG CAAGCTTTCA AGGTATCGGC GCGATGTGGT CAAAGCACGA GCGAATTCAA
TTCTGGGGTA TGTGCGCAGT ATATTTTTAT GGTGGAGTTT TCGTGGATCA CAATATACGC
TCGAGGACAG CTCTCCCTAT TTTTAAGGCG GCTTTCGGCC AGGATCGCTT TTGGCACCAG
CTTAACGAGG ACGGTTCCCT TCATTATCTT GCCTCAACAC CCAAGCACCC AAAGCTCGAG
TGTATTTTGG ACGAAATCTT GACTCGTCGG AACGAGCGTG GGGCAAACTC TATTGCTTGG
TCACATGTAA CACAATTACT ACAGCTACAT ATCTGGACAG GTTTTGATAA ATACAAGCCC
GCGTGCTGTC CTATAGTTTA CGAACAGAGG AACTGGTCCC TATCTTCGAC ATCGAAGCAA
GATTTCCTCC CAGTTAATGA AGAGATGGTT GTCGCACCGT CAGCCCTAGT CAAACGCTTC
GATGTATCAG TTCAGGAGCA GCCATCCACG AGGTCGGTAA TTGGGATTCC GAAAGAACCG
TGGAGCAAGG TGCTCAACGA CAATCAATGC TCGCCGGGAT GGCTGTGCAA CCGTTGTCTA
CGCTTCCCAT GGTTTGGGAG CTTCTCCAAT TGTAAATCTG TATGCCGCTC TTGTTACACA
GATCAAATAT GTGCGGCCAA CGACATTGAC CTAACAGATG AGATTGTAGT TGAAGTCATT
GTCCGCGAGC GTCCTGGCAA CCATACCAAT CGTATTCCTC GGATCATACA TCAAACGTGG
TTTGAAGAAC TACATACAGC GCGGTACCCT CATCTGCAGC GATTGCAAAA TTCATGGAAA
GCATCGGGAT GGGATTATCG TTTCTACACC GACGAAGACG CTCGGATGTT CATACAGAAG
AATTTTGCTA AAAGGTTCAC TTCTGCGTAT GATGCCATTA TTCCGGGGGC GTTCAAGGCC
GACTTTTTTA GACTACTTGT ATTGCTAAAA TACGGTGGTA TTTACAGCGA TTTCGACGTG
CAACTCGATA CTAACTTGGA CTACTTTGTC ACTAAAGACC TTTCATTTTT TGTTCCGAGA
GACGTTGCAA TCGATCATTG GGCTGGGGGG AATTACTGTG TTTGGAACGG TCTTATTGGT
GAGTTTTTAG GCCTTCCTCG ACGACTGTAC TGGGCTTATC CTGCTAACCG TTTGCCTTGG
GATTTGTAGG GGCAGCTCCC GGTCATCCAA TCGTTGCACA GGCGGTCGAG GATATTTTGA
ATCGCATTTC GAGGAGGGAG GACTATCTTG ACATAGAAAG CAGTCTTTGT CGTGGAAACC
TTGACGCTGA AATATGGAAA CTCCGAAGCT TCCCCATCCT CCTTGTGACA GGTCCTTGCG
CCTTGGGAAT ATCGCTGAAC AAAGTCTTGG GCCATCACAA CCTGGTCAAT GAGATCCTTC
CTGGATGGAT GATTTTCTCG CAACATATGA CGGAGGACAA AGCTGAAATG AGTGATAATT
GGGGAGATAT TCTGATCTTG CATACCGATC GACACGATTT AGGGGAGCTA CGCTTCTCCG
ATCTTGGGAG AAACTTGCTT GTTGCTTCGT CAAATCAAGA CTATTTCGCT AGATCTGCAG
TCCTTTTTGA GGCCGATCCA CAAAAGATGC CTCAGCATTA CAGCAAAAGT GAGAGTGATA
TAGTGGGTTC AACAGCGACT TACAAAGATG ATAAGGTTTC CAAGGAACGA GTTGTCGTAA
AGGTCACATT CACAGTGAGG TGA
 
Protein sequence
MLADATFTIL NSCRSWISKF LLVKRLPWRL LCLSASLFSL YILLLRKSLF GDGPTLQGGS 
IGRKFDNSLK RLLFRRRARM YRQRFKAEFG FDLPRVSEKS ANSIGVPVLL NTWLAESLFL
AKHRKNSRMT AELSLSAVDS ETHVFRLMQF DVSSWYRPLS SWQGNSAVIW NEIEIEQWID
IKFASFQGIG AMWSKHERIQ FWGMCAVYFY GGVFVDHNIR SRTALPIFKA AFGQDRFWHQ
LNEDGSLHYL ASTPKHPKLE CILDEILTRR NERGANSIAW SHVTQLLQLH IWTGFDKYKP
ACCPIVYEQR NWSLSSTSKQ DFLPVNEEMV VAPSALVKRF DVSVQEQPST RSVIGIPKEP
WSKVLNDNQC SPGWLCNRCL RFPWFGSFSN CKSVCRSCYT DQICAANDID LTDEIVVEVI
VRERPGNHTN RIPRIIHQTW FEELHTARYP HLQRLQNSWK ASGWDYRFYT DEDARMFIQK
NFAKRFTSAY DAIIPGAFKA DFFRLLVLLK YGGIYSDFDV QLDTNLDYFV TKDLSFFVPR
DVAIDHWAGG NYCVWNGLIG AAPGHPIVAQ AVEDILNRIS RREDYLDIES SLCRGNLDAE
IWKLRSFPIL LVTGPCALGI SLNKVLGHHN LVNEILPGWM IFSQHMTEDK AEMSDNWGDI
LILHTDRHDL GELRFSDLGR NLLVASSNQD YFARSAVLFE ADPQKMPQHY SKSESDIVGS
TATYKDDKVS KERVVVKVTF TVR