Gene PHATRDRAFT_48702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48702 
Symbol 
ID7194686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp673333 
End bp674582 
Gene Length1250 bp 
Protein Length398 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183264 
Protein GI219126017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATTCTGTT TGACGCTGAA CAATTCCCTA CTTGTTGAGC CAAACAGTAC ACTATGAGCA 
ATAATAGGAC CGGCCCACTC AGTTTAAACG ATTTAGTTGC ATTGCTGAAC AGCGGATCTC
CAATCGATGG CTTGGCAAAC ATATTGAGAG GATCTGCGCC GACTTCTCCG TACGCGGCTA
CGGCGCCACC GATCGCCAGA GCTATGGCCG ATGCCACCAT TGGCACCTCT CCCGTCAGCC
GGCTGTGGCA TCAGCTTACC AATCTTAATA ATATTTCCAC TACTGCTAGC CATGGAGGGG
GTCTGAATGC TGAAGAGGCT CTGCGGATAC TGCTGGGGGT TCAAGGGCAA ACGCCGATGC
AGTGTCCAAA TCTGGAAACG GGGGCGCCTG CATTGCAACA GCAATACGCG CATCTCCCAG
CTGTACACAC TGTTCCTCCT TCCGACTCTA ATCTCAGCGC ACAAAAGCTG ATAGACCTTT
TGATTCGACA ACAGCTTCTC ACTCAGACAC ATCAAACAAC CGCGCCTATC GGTGCTGGCA
ACGTGGGGGT CCAGGCTCCG TTCGCCGTTG CGCCATACGC CCCTCCACCG CAGCAGTTAA
ACCCAGACCA AGCCGCGGTG ATCGCCCAGC TATTACGCAA CAACCATCAG TCTGCGGCAC
CGCAACTGCC AGAAGTACCG CAGGCTTTCT TGGAAGCGCG GAAACCCCCG GCAGTGGGTA
ATCTTCCACA TTGGCGAGTA GCGACACACG AGGCCGTACT CAACCAGAAT ACAGCCCCCC
TTACCACTGA TCTACGGCTT GCAACCAAAG AAGTCAAACG TCGTAGTGGT CGTAGTGGAA
GCTTTCCGCA AAAGCTACAC CAGATGCTGA CAGACCTGGA GCAGCAAGGC AGTGACGTAG
CCTCTTTTTC TTCTCATGGG CGCTCCTTTT CGATACACAA GCCAAAGGAA TTCGTTCGCG
ATGTTATGCC CAAGTACTGT CGGATGAGTC GATACACCAG CTTTCAGCGC CAGCTTGCCC
TGTACAACAT TCGTCGCATC ACAGAAGGAC CCAACAAGGG GTCCTATTGT CACGAACTTT
TCGTTCGAGG TCGGCCGATT CTTGCGACAA TGATCAATCG AAACAAAAGC AAAGCCAGTA
AGAAGGTCCA AGTCAGCTCT GAGTCTGGCA ATACTGAAAC CCAAGAAGAA AATGACGAGG
AAGCCAGTCT GACTAGTATG AACGAAAATG GCCAGGACGG TGAAATGTAG
 
Protein sequence
MSNNRTGPLS LNDLVALLNS GSPIDGLANI LRGSAPTSPY AATAPPIARA MADATIGTSP 
VSRLWHQLTN LNNISTTASH GGGLNAEEAL RILLGVQGQT PMQCPNLETG APALQQQYAH
LPAVHTVPPS DSNLSAQKLI DLLIRQQLLT QTHQTTAPIG AGNVGVQAPF AVAPYAPPPQ
QLNPDQAAVI AQLLRNNHQS AAPQLPEVPQ AFLEARKPPA VGNLPHWRVA THEAVLNQNT
APLTTDLRLA TKEVKRRSGR SGSFPQKLHQ MLTDLEQQGS DVASFSSHGR SFSIHKPKEF
VRDVMPKYCR MSRYTSFQRQ LALYNIRRIT EGPNKGSYCH ELFVRGRPIL ATMINRNKSK
ASKKVQVSSE SGNTETQEEN DEEASLTSMN ENGQDGEM