Gene PHATRDRAFT_32747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32747 
SymbolGapC4 
ID7197206 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp715026 
End bp716234 
Gene Length1209 bp 
Protein Length336 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177670 
Protein GI219111837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.274876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATCA ACGTGGGTAT CAACGGATTC GGCCGCATCG GTCGGTAAGT CTATTGCACT 
CTCCGATACC GCGGACCGTC GTCGACGTGG AGGTCGCTCT GGCCGTTGGT GCTCCCGAAG
ATTCCAGTCC ATGCCGACCG ACCGACCGAC TCTACGCGTA CCCCATTTTG TTCGGTTTTT
TTGGGGTGCT TGGCTTGTTT TGTTCTGCTC ACTAATGTTC CGTCTACTAT TCTATCTTAC
AGTCTCGTCA TGCGCGCGGC GCAAAAGAAT CCCAACATCA AGATTGTCGC CGTCAACGAT
CCCTTCATTC CCGTCGAATA CATGGAGTAC ATGTACAAGT ACGACACGGT CCACGGACGT
GCCGACAGCG TCGTCAAGGC CAACAAGGAA GCCGGTACCA TCACGGTGGG TGAGAACGAA
ATCAAGGTCT TTGGTGAAAT GGACCCCTCC AAGATCCAGT GGGGCAGTGC GGGGGCCGAC
TACGTCGTGG AATCTACCGG AGTCTTCACC ACCACCGAAA AGGCTTCGGC ACACATGGTG
GGTGGAGCCA AAAAGGTCGT CATCTCGGCA CCCTCCGGCG ACGCACCCAT GTTCGTCATG
GGCGTCAACC AAGAGAAGTA TGAGTCCTCC ATGGACGTGG TTTCCAACGC ATCCTGCACC
ACTAACTGTC TCGCGCCCTT GGCCAAGGTC GTCAACGACG AGTTTGGACT CAAGGAGGGT
CTCATGACCA CGGTCCACGC CGTCACGGCC ACGCAGCAGA CCGTCGATGG TCCGTCGCAG
AAGGACTGGC GTGGAGGCCG TGCGGCCTGC TACAACATTA TTCCGTCGAG TACGGGAGCC
GCCAAGGCCG TGGGCAAGGT CATTCCCGCC CTCAACGGCA AACTCACCGG AATGAGCTTC
CGCGTTCCCA CCGCCAACGT GTCCGTCGTG GACTTGACTT GCCGTCTGGA CAAGGGCGCG
CCATACGCCA CCATCTGTGC CGCCATCAAG GCCGCGTCCG AAGGCCCCAT GAAGGGCATC
TTGGGATACA CTGACGAAGA CGTAGTGAGC TCCGACTTTA TCAGCGACAC GCATTCCTCC
ATCTTTGATC AAAAAGCGGG TATCGCCTTG ACGGACGATT TTGTCAAGCT CGTATCCTGG
TACGACAACG AAGCCGGTTA CAGTACGCGT GTGTTGGACC TGATTGCTCA CATGGAGTCC
CAAAAATAA
 
Protein sequence
MSINVGINGF GRIGRLVMRA AQKNPNIKIV AVNDPFIPVE YMEYMYKYDT VHGRADSVVK 
ANKEAGTITV GENEIKVFGE MDPSKIQWGS AGADYVVEST GVFTTTEKAS AHMVGGAKKV
VISAPSGDAP MFVMGVNQEK YESSMDVVSN ASCTTNCLAP LAKVVNDEFG LKEGLMTTVH
AVTATQQTVD GPSQKDWRGG RAACYNIIPS STGAAKAVGK VIPALNGKLT GMSFRVPTAN
VSVVDLTCRL DKGAPYATIC AAIKAASEGP MKGILGYTDE DVVSSDFISD THSSIFDQKA
GIALTDDFVK LVSWYDNEAG YSTRVLDLIA HMESQK