Gene PHATRDRAFT_38966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38966 
Symbol 
ID7203772 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp660695 
End bp662494 
Gene Length1800 bp 
Protein Length555 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183005 
Protein GI219125472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00578111 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTATC TTAACGCCAT CACCCTTTTG ACTTTTGGGT GGATGCCTTC GCTCTCGGCC 
CAAGTCCTCA CGCACCGCAA GCTCCAACGT CTGCCGCAAA GCCAAGTGAT CCCAGGGCAG
TACGTGATTG AGCTGGATCC GAGCATTCCC GATTCACAAG GATTTGCCGA AAAAGTCCTC
AAACGAACGT TTCGCAAGAA CATAATTGAG ACTTACGACT ACGCCATGAA AGGATTTGCT
GTCAAGGATG TCCCCGATAT GGTGTTGAAT TTCATACTCA ACATGGACGA CGTGCTATCT
GTGTCGGAAG ATGGTATCGT TGAAATAGAG GCTATCCAAA ACAATCCGAC TTGGGGTCTC
GATCTCGTCG ACGGATCCGA CGACAATCGC TATACATACA CATACACGGG GCGAGGTGTC
GATGTGTACA TCGTTGACAC TGGCATTCAA GCAAATCATC CCGATTTTGA AGGTCGAGTG
AGGAGCTGCG TCTCCTACAC TGGAGAATGT AAGTTTGCAT TCCGCATCTA GAGAAATCCT
TTCTCCCCCA CAAAAATCCG AAGTCTTACG TGCGCGTTTT TCCTCTTGTA GCATGTGGGT
CGGATCTGAA CGGACACGGT ACACATGTGG CCGGAACCGT TGGTTCGAAA ACCTACGGTG
TGGCCAAGCA GGTGTCGCTG CACGACGTTA AAGTGCTGAA CCAAAGGGGT AGCGGGTCCT
ACAGCGGCGT GATTGCAGGC GTTGACTACG TTACCAAAAT CAAAGAAAAT AACCGAAGTC
GCAGAATCGT CATTAATATG AGTCTCGCGG GTGGCGTATT CGCGAGGCTA AACAGCGCTA
TTGACTCCGC CGCAGCCCAA GGCGTCGTCG TCGTAGTGGC CGCTGGAAAC AGTGGCGCCG
ATGCCTGCAA CGCCTCTCCT GCATCAGCTA GCGGTGCATT GGTAGTCGGT GCCATTAATG
ATCGCAATCA GCGTACAAGC TGGTCTAATT GGGGCAGCTG CGTCGATATT TTTGCCCCAG
GGACCGGGAT CCTGTCGACA GCCAAAACTG GTGGCACGAG CACGAAGTCG GGTACGTCCA
TGGCATCACC GCACGTAGCC GGCGTTGCCG CCTTGTATTT AGAGTCAGGC AGAAACACCA
ATTCTATCAC CTCCGATGCG CGGACTGGCC AACTAGGCAA CTTGGAAGGA TCCCCCAACC
GACTTGTGCG CACTTCCCGA TTACCCGCTA GGAACACTCC CCAAGACGAT ACCGATGCAC
CGGTCAGAGC ACCTACTCGC CCACCCACTC GTGCCCCAGT TCCTGCTCCA ACCCGACGGC
CGACGCGTGC CCCGACTCGC GCCCCAGTCC CTGCTCCCAC CCGTCGGCCT ACACGTGCCC
CGACTCGCGC CCCGGTCCCT GCTCCCACCC GTCGGCCTAC ACGGGCCCCG ACTCGAGCTC
CCACCCGACG ACCCACTCGG GCCCCGACTC GAGCCCCCGT CCCTGCTCCT ACCCGACGAC
CTACTCGCGT CCTAACGCGA GCCCCTGTCA CTCCTCGGAC CCGAGAGCCC ACTCGTGCCC
CTGTTCCAGC TCCCACGCAG CCCCCGGTTG CTCCGCAGTG TTTGCCTGCA GGTGAACTCT
GCGAAAGGTC CAGCATCTGT TGTGACACGA TGAGCTGTAG CCGATCCTGG ACACCGTCTC
GTGGACTCCA CAGCTCTTGT CGATCTGAAT CTGGCTGGTG GGGCTACTAG TAGTCTCGCA
CTTGGGGACA ATTGCTTTTT GGCAATGCTG TCTTTGGCCA AAGTGTCGGA AGACATATAG
 
Protein sequence
MKYLNAITLL TFGWMPSLSA QVLTHRKLQR LPQSQVIPGQ YVIELDPSIP DSQGFAEKVL 
KRTFRKNIIE TYDYAMKGFA VKDVPDMVLN FILNMDDVLS VSEDGIVEIE AIQNNPTWGL
DLVDGSDDNR YTYTYTGRGV DVYIVDTGIQ ANHPDFEGRV RSCVSYTGES CGSDLNGHGT
HVAGTVGSKT YGVAKQVSLH DVKVLNQRGS GSYSGVIAGV DYVTKIKENN RSRRIVINMS
LAGGVFARLN SAIDSAAAQG VVVVVAAGNS GADACNASPA SASGALVVGA INDRNQRTSW
SNWGSCVDIF APGTGILSTA KTGGTSTKSG TSMASPHVAG VAALYLESGR NTNSITSDAR
TGQLGNLEGS PNRLVRTSRL PARNTPQDDT DAPVRAPTRP PTRAPVPAPT RRPTRAPTRA
PVPAPTRRPT RAPTRAPVPA PTRRPTRAPT RAPTRRPTRA PTRAPVPAPT RRPTRVLTRA
PVTPRTREPT RAPVPAPTQP PVAPQCLPAA DPGHRLVDST ALVDLNLAGG ATSSLALGDN
CFLAMLSLAK VSEDI