Gene PHATRDRAFT_41002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41002 
Symbol 
ID7198923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp107494 
End bp108921 
Gene Length1428 bp 
Protein Length475 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185051 
Protein GI219129764 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG ACCGTTCCGA CCTTTGGATC GGCCTTGTAG CAGGATCTCT GAGCACTGTG 
ATAGCCACAT GGTGTCTCCA GCGATATCAA TCGACGAAAA ATCCGCAGAC GTTGTACTCA
CCGACACTTA ACCAGACGCC AACGGCATCC ACCCTCTTAC CCGACGACAT TCGCGACGAG
CAACTCTCCC GGCATTTGCT TTACTTTGGC GAGGATGGCA TGGACCGACT GAAACGTTGC
AAGATTTGTG TCGTTGGACT GGGTGGAGTG GGAAGTCACA CTGCTCATAT GTTGGCCCGC
GCCGGGGTGG GGTACCTCCG TCTCATTGAT TTTGACCAAG TCACCTTATC CAGTCTCAAT
CGACACGCCT GCGCCGTCCT CGCTGACGTT GGCACTCCCA AAGCAACCTG TCTAGCGAAG
TTTTGTCGCC GCATTTGTCC CGATCCGACG AAACTGGTTC TCGACACACG TGTGGAAATG
TACACCGCCG ACACCGGCGC CGCGTTGCTG TCTCTGCCAG ACGGCGAGCA CTGGGATTTG
GTCGTGGACG CCATTGATGA CGTACCGACC AAGGCGGTGC TTCTGGCTCG TTGCTGCCAA
ACCCAAACAC GCGTAGTCTC TTGTATGGGG GCCGGAGGCA AAGCCGACGT TACGCGCTTG
CACGTGTCCG ATTTGCGCAC GGCATCCCGC GATCCTCTGG CCACCAAGCT ACGGCAACAT
CTCAAAAAAT ACATGGCGGA CCACAGCGAC GACCAAAAAA GTGACTACCT CGATAATATG
GACAAAATAT CCATCGTGTA CAGTACCGAA AAGCCGGTGG TCAAGTTGGC GGATTTTACC
GCCGAACAAA AAGAAGCCGG CGTGCACCAA TTTGGAGCCG TCGACGGGAT GCGAATCCGA
GTGATTCCCG TGCTCGGTAC CATGCCTGCC ATTATGGGGC AGGCATTGGC GGCCATGGTC
CTAACGCAAG TTGGTAACAA ACCCTTTCAA CCCGTGACGG GAGAACGAGT GGGAAAAAAT
GTACGCAACA AATTGTTTCA GCATTTGCAA ACACGGGAAG ACCGCATCCA AAAGCGAGTA
CTGCAAAATA CCACACGCGA CGACGTAGCA ACCATCGCTA CAACCGGTGG TACCGTTGTC
GACAGTGTCT GGATCGGCCC GTTGCAGATC GACCGGGACG ACGTGGAATA CTTGAACGAA
ATATGGCGGA ATCGGTGCGG CGTCACCAAC GCTCGCTTGG GCACCACGCT GGAGCTCGTC
CGCTGGAATA ATGCAAAACC TTCACGATGT GACAATCTAG TGCTCATGTG CACCGCCGCG
ATCCAAGCTT TTGATAAACC AGGGGGAAAG GAGAAAATTC CCGCCTACGT CGTTCAACGC
ATCGAAGAGC GGTTGGCAAC CTGCCAAAAT GATAGATTAG CCTACTAA
 
Protein sequence
MKKDRSDLWI GLVAGSLSTV IATWCLQRYQ STKNPQTLYS PTLNQTPTAS TLLPDDIRDE 
QLSRHLLYFG EDGMDRLKRC KICVVGLGGV GSHTAHMLAR AGVGYLRLID FDQVTLSSLN
RHACAVLADV GTPKATCLAK FCRRICPDPT KLVLDTRVEM YTADTGAALL SLPDGEHWDL
VVDAIDDVPT KAVLLARCCQ TQTRVVSCMG AGGKADVTRL HVSDLRTASR DPLATKLRQH
LKKYMADHSD DQKSDYLDNM DKISIVYSTE KPVVKLADFT AEQKEAGVHQ FGAVDGMRIR
VIPVLGTMPA IMGQALAAMV LTQVGNKPFQ PVTGERVGKN VRNKLFQHLQ TREDRIQKRV
LQNTTRDDVA TIATTGGTVV DSVWIGPLQI DRDDVEYLNE IWRNRCGVTN ARLGTTLELV
RWNNAKPSRC DNLVLMCTAA IQAFDKPGGK EKIPAYVVQR IEERLATCQN DRLAY