Gene PHATRDRAFT_37855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37855 
Symbol 
ID7202655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp235460 
End bp237043 
Gene Length1584 bp 
Protein Length527 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182031 
Protein GI219123437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCGG TCAAACGGCG GTCGTCGGTG GTTCCGGTGA GCCACGGAGT GCACGAAAGT 
AACGGCAACG CCAACGGCAG CCCGCAACAC CACGGAATAT CCCCGCTGTC GGGCATCGCC
CTGGCTTCGG GTGGACCGAC GTCCTTCGAC CCTCCGACGG CATCTTTGTT TCCGTCGGCG
GACACGACTC ACGGTGGTGA AAAAGGTCGG CTCTGGCGTA GACGGAGGAA ACGTTGGCAA
CGAAGACTGC GGATTGGGTA CGGACGTTGG CAACGACTAC TTCTACAAGT TTTTCTATTG
CTATTATTGG CAGTTTTCTG TGGTTTCGTG GTACGAGTAT TCGTTTTTCG CAACTCATCT
TCAGTGTCGT CAACCGAAAC GGACGCCCTT CCGAGCATTC CCTTTCAAAC CAACTTCCCC
GATGCTCCCG TTTGTCACAG TCTGTCCCCG GACGACGTAT CCTATACACT GGTCACGCAG
TTGAGTCAGG ATCGCTTGTG GATGATGGAG CACCACTGTC AGCGATGGGG TCCATCCCAT
CCCATGTCCA TCGCTGTATT CACCAACCAA ACCGTCGCAG AAGTCCGCTC CCAACTCGTC
GCGTTGGGTT GTGCACCGGA GCAGCTCGCC TCCGTCCAAA CGTTGCCGTC CACGGCGGCG
GCGGTGTCCG ACTACCCGGT CAACGTCTTG CGCAATCTCG CCTTTCGCGC CGTCACCACT
ACCCACATTG TGTACGTGGA CGTGGACTTT TGGCCGTCCG CGGATTTGCA CGCCACGTTG
TCCGGGGCCC GCATTCGGCA CGCGCTAGCG CAGAACGAAC GCACCGCCCT GGTCATCCCC
GCCTTTCAAC TGCAACGCCA GTGTCGTGCG TGGAAGGAAT GTCCGGATCA AAACGTGCCG
GTCATGCCCA CGCACAAGGC CGCCCTCGAA CGACTTTCCC GAAACCGACA GGCCTTCCCG
TTCGATCCCA CCAATGTGGG AGGCCACGGG TCCACAAAGT ACCGGGCGTG GATTAAAAGC
CAACCCGACG GCGTGCTGTT GGAAATTCCG TGCGTACTGT CGAACCGGTA CGAACCGTAC
CTGGTGGTGC GCTACTGCGA CGTCCTCCCG CCCTTTCAGG AAGCGTTTTC CGGCTACGGC
AAGAACAAAA TGACGTGGGT CCTGCAACTG TTGCACACGG GATACCGTCT GTTCCAAATT
CCGCAATCCT TCGTGACGCA CTATCCGCAT CTGGATTCCC CGTCGCGCAT GGCGTGGAAC
GGGGGTCGGG GTGGGGCGCC GTTGCCGAAA CCGCGGGCGG CGGACGGGGC GCCGAACAGA
ATGCGTGGCA ATGGTGATAG TGCTGCTGGT ACGGTCGACT GGTTGCGGTA CCGACGGGGC
CGTGTCGACC ACGTGTTTGT ACAATTCCGG GAGTGGTTGC GGACGATGGT GACGGACGCC
CGGGTGGTCC CGTACTGCGA GTCGGCCGAA GATGATGACG GTCGGTTGTG GATTGATCAC
GACACGGATA CACCGCCGGT GCGGAAACGA CTCAACCCTA ACGAGCAAGT GGGCGATGCC
GGACTTCCCC GAACCTCTCG ATAG
 
Protein sequence
MPSVKRRSSV VPVSHGVHES NGNANGSPQH HGISPLSGIA LASGGPTSFD PPTASLFPSA 
DTTHGGEKGR LWRRRRKRWQ RRLRIGYGRW QRLLLQVFLL LLLAVFCGFV VRVFVFRNSS
SVSSTETDAL PSIPFQTNFP DAPVCHSLSP DDVSYTLVTQ LSQDRLWMME HHCQRWGPSH
PMSIAVFTNQ TVAEVRSQLV ALGCAPEQLA SVQTLPSTAA AVSDYPVNVL RNLAFRAVTT
THIVYVDVDF WPSADLHATL SGARIRHALA QNERTALVIP AFQLQRQCRA WKECPDQNVP
VMPTHKAALE RLSRNRQAFP FDPTNVGGHG STKYRAWIKS QPDGVLLEIP CVLSNRYEPY
LVVRYCDVLP PFQEAFSGYG KNKMTWVLQL LHTGYRLFQI PQSFVTHYPH LDSPSRMAWN
GGRGGAPLPK PRAADGAPNR MRGNGDSAAG TVDWLRYRRG RVDHVFVQFR EWLRTMVTDA
RVVPYCESAE DDDGRLWIDH DTDTPPVRKR LNPNEQVGDA GLPRTSR