Gene PHATRDRAFT_47365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47365 
Symbol 
ID7202516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp402786 
End bp404852 
Gene Length2067 bp 
Protein Length603 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181551 
Protein GI219122436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.776919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTACCATGTA GTTCGGCTCA CTGTTTACAA TTCGACCTCT AGCGAGAGAG ACGTTTCAAC 
CGTAAAGGGG GGGAATTTGC AACCATGAGA CGAGTCCGTG CGTGTCTCGG TGCGTGTGTG
TACGTGACCC ACTTGGTCGC GTCGGGGGCA TTGACGGCGA ACGACCGACG AAGTCCTACC
CGCCCCACAC ACTGGGTGAC CTTGTCGGAT CCCCGGAAGC GGTATCCTTC CGGAAACAGT
CTGTACGCCA AGGATGTCCC ATTAGCAAAC AACAATCGTC GATCCGCAGC GCCGTACTCG
TACTCAAGTA CCCCCGTGAA CGACTTGGAT CAAGATTTTG GCTACTATGG TGAAAAAGAT
GATGACGATG ATTGGATGAG CAATCCCGTG GATTTCGACA ACAGCATCGA CGAAAGCGCC
GTTACTAGCA CGAGAAGACC AAAACTTGTC AAAGGGGCCG ACAATTCACG TCCCAATTCC
CATTTCTTCA GTCGCAAATC CTTACAAGAT CCAATCTTTG CGTACCAAAC GAAGGGGGCC
TCGGAAACCT TTGCCCAATT GTGCCAGGGG GCTGGTATTG CGCGTCCCTC CAAAATTCAG
AGTTTGGCTT GGCCCATCTT GTGCAAAGGC TCCCACACGA TTGTGGCGGA CCAAACGGGA
TCCGGAAAAA CCTTGGCCTA TCTCATTCCT TTGTTGACAC GCGCCTTGGA GGACCGCAAC
GCTCAGCCGG CCGGAACCGC CGTACCCAAC GGATCGCCTC GTATCATCGT CCTGGCTCCG
ACCGCCGAAC TGGCCGACCA AATTCGAGCC GTTTGCGAAC AAATGACCGC ATCCGTTTCA
TTCTCGACCC TTGTAATCAC GGCGACCGGG AAATATTCCA CTTCGATTCG TGATCAAATT
CGTATGCTCC AACGACAACC CGTGGACGTT CTGATTTCGA CACCTGGACG GATCGCCACC
ATTTTGCGAA CGCGCAATTC TGGCTTGGAT TTGAGTGCGT TGCAATCCAT CGTTCTCGAC
GAAGTCGACG TCTTGTTGGT GGACGACACG TTCGGCCCGC AATTGCGTAC GGTCGGGGCG
GCGGCACCCC TGGATCGAAC GCAATTTGTC TTTGTCACGG CAACGCTACC CGACACGGTT
GTCGAAACTG TGGAGAAAGA GTTCCGCGGC GTACAGCTAA TCAAAGGCCC CGGTTTACAC
CGTGTGGCAC CGACCGTGCA AGAAAGACTC GTCGACGTCT CCGTCCCTTC TCAAAACAAC
CGAGACGCCA AACTCTGTTT TGACGTCAAG GCCAAACAAC TACTGAAAGC CTTGCGACAG
ACTCGGTGTC GCCGAACGCT CGTATTTTGC AATACCGTGG AAAGTTGCCG CTCGGTGGAA
AACTTGCTAA AACGCAAGGA TCGCAAGGGC AACGTCTTTG AAGTCCGCGC CTATCACAAC
GCCATGACAC CAGAAAATCG CAACGAAAAT TTGGCCGTCT TTAGTCACGG CATTCGGACT
ACACAACCAG AAAAGGTGGA TTACGTACTG GTGTGCACAG ATCGGGCTGC TCGAGGCGTC
GACTTTGAAA GGGCCCCCGT GGATCACGTC GTCTTGTTCG ATTTTCCCAA AGATCCGGCC
GAATACGTCC GTCGAGTTGG ACGAACGGCG CGAGCGGGAC GGACCGGAAC GAGCACCGTC
TTCGCCTACG GATGGCAACT GCCGATCGCT CGTAGCGTCA TGGGAAGCAA GTTGGATAGC
TTCACCATTG CTCGCGAAGA GCGGGATGAA ATGGATACGG AGGAAATTCG AGGTGGAGTG
CAGGCGCGGC TCCACCGAGG TGACGGCGCA AATAAGAAGC ATGGTTCGAA GCATATAATA
AAGGGTAACA TTGAGAGCGG AAAGCAGTGG AAGTGAAAAG AAGACCCGCC TCTCCTTAAC
AAGGGTCTAT CTAGAGAGAG TTTTGTTGTT CTGCGAGAGT AACTGAGCGA GTAAAGGTAG
TAACGGTTTA CTCGAAATGG CAATTTTCTT TTTTTGACAT TTCAGCTTAA CGAAGCAAAT
TTAGTGAGCA AAGCAATAGC ATAATTT
 
Protein sequence
MRRVRACLGA CVYVTHLVAS GALTANDRRS PTRPTHWVTL SDPRKRYPSG NSLYAKDVPL 
ANNNRRSAAP YSYSSTPVND LDQDFGYYGE KDDDDDWMSN PVDFDNSIDE SAVTSTRRPK
LVKGADNSRP NSHFFSRKSL QDPIFAYQTK GASETFAQLC QGAGIARPSK IQSLAWPILC
KGSHTIVADQ TGSGKTLAYL IPLLTRALED RNAQPAGTAV PNGSPRIIVL APTAELADQI
RAVCEQMTAS VSFSTLVITA TGKYSTSIRD QIRMLQRQPV DVLISTPGRI ATILRTRNSG
LDLSALQSIV LDEVDVLLVD DTFGPQLRTV GAAAPLDRTQ FVFVTATLPD TVVETVEKEF
RGVQLIKGPG LHRVAPTVQE RLVDVSVPSQ NNRDAKLCFD VKAKQLLKAL RQTRCRRTLV
FCNTVESCRS VENLLKRKDR KGNVFEVRAY HNAMTPENRN ENLAVFSHGI RTTQPEKVDY
VLVCTDRAAR GVDFERAPVD HVVLFDFPKD PAEYVRRVGR TARAGRTGTS TVFAYGWQLP
IARSVMGSKL DSFTIAREER DEMDTEEIRG GVQARLHRGD GANKKHGSKH IIKGNIESGK
QWK