Gene PHATRDRAFT_50216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50216 
Symbol 
ID7199002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp14041 
End bp15348 
Gene Length1308 bp 
Protein Length411 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185188 
Protein GI219130052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00633421 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTCG TTTTCCAGTA CGGAATCGCC CCCGCTATAG TTGACGCCAA CGATTTGACT 
AGCTTTGTTA GGGACAAGTG GACGGATGGA TGTTCCCAGT ATGTCGACAT CGACGTGGTC
AAGTCTTGTG CTGGCAACAA TGGTGTTTAT CGCGTATCGG CTGCTTCGAC GCTATTCTTT
GCATTCGCGG CGCTGGGAGC TCTACTTAAA CCCACGGCCA ATCGAGAAGC ATGGCCGGCC
AAATACACTC TCTACTTCTT TCTCTGTATC GTCACTATTT TCATTCCCAA TGATCCGCTT
TTTTCTGACG CCTACTTGAA CATTGCACGT ATCGGCGCAG TCCTTTTCAT TGTTGTTCAG
CAGCTTGTCA TTGTTGACAT GGCTCACGAA TGGAATGACA GCTGGGTCGC TAAGGCGGAT
GCCGCAGAAG CACAGGAGGC TGGGTCCGGG AAAAGGTGGC TCGGTGCTAT TGTGACTGCT
TGCATAATGC TCTTTGGAAT ATCCATCATT GCAATAGGCG TCATTTTTTC TCGCTTCACG
GGATGTGGCA CAAACAATGG ATTTATTACT GTCACGCTTG TGCTCGGCGT CTCAATTGTC
GGTGCGCAGA TGTCTGGCGA AGAAGGTTCG TTGCTAGCCA GTGCCTGCGT CTTTGCGTGG
TCTGTGTTTT TGTGCTACAC AGCCGTTTCC AAGAACCCTG ATGCATCCTG TAATCCTATG
CTGGGCGAAA TGGATACTGT GAGTATTGTG CTGGGCTTGA CCGTGACGGC AATTAGCCTT
GGATGGACGG GATGGTCGTA CACGGCCGAA GACAAGCTGC GGTCGTCTTC TGAAGAGGAA
AGCGCTGCTG CAGCCACGGC CAGGGCCAGT GACGACTCCG AGAAAGATGT CAGGCGGGAC
GTCACAGGTG TGGTCACAGG CAACGACTAT GGAACGCAAG ACGACGAAGA GCAAGCTAAC
AGTGCGGGTC ATGCCGAAGT GGATGAATCA GTCTTGAACA ATCCTAGCCG TCTATCCAAT
TCATGGAAGC TGAACGCCAT TCTAATGAGT GTATCATGTT GGAAGGCTAT GGCTCTAACC
AATTGGGGCG CGATTGTGGC CAATGGCAAT GCTGCTAATC CTCAAGTCGG CCGTGTTGGG
ATGTGGATGG TTATTGCCTC GCAATGGCTT GTTCTGACGC TGTACTTGTG GACATTGCTG
GCACCGAGAC TCTTTCCCAA TCGCGAATTT GGCTGACTTT GTCTGATTTG CAGGATTGTT
GGAGATAACA GCAATGTTTC ACAATATTGT ATAAGTTGAC GGTCTAAT
 
Protein sequence
MALVFQYGIA PAIVDANDLT SFVRDKWTDG CSQYVDIDVV KSCAGNNGVY RVSAASTLFF 
AFAALGALLK PTANREAWPA KYTLYFFLCI VTIFIPNDPL FSDAYLNIAR IGAVLFIVVQ
QLVIVDMAHE WNDSWVAKAD AAEAQEAGSG KRWLGAIVTA CIMLFGISII AIGVIFSRFT
GCGTNNGFIT VTLVLGVSIV GAQMSGEEGS LLASACVFAW SVFLCYTAVS KNPDASCNPM
LGEMDTVSIV LGLTVTAISL GWTGWSYTAE DKLRSSSEEE SAAAATARAS DDSEKDVRRD
VTGVVTGNDY GTQDDEEQAN SAGHAEVDES VLNNPSRLSN SWKLNAILMS VSCWKAMALT
NWGAIVANGN AANPQVGRVG MWMVIASQWL VLTLYLWTLL APRLFPNREF G