Gene PHATRDRAFT_41383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41383 
Symbol 
ID7199174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp324127 
End bp325911 
Gene Length1785 bp 
Protein Length594 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185310 
Protein GI219130309 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGCTA CCAGGAATGA CAAGACGGCG GGAAGGCATG CCAATGCCTA CCGACGAGCG 
ACGCCCTATC GGACCCTTGC TTTGTGTGCC ACGTTTGGAG TTGTTTGCTT TTATCTGGGT
GTCTTGCTGG GTAGTGTAAC TTTCGCATCA CACTCTTCTT GTTCGTCGGC GGAAGAGCTC
AACACACGAG TAGAAGAGCG AGTGAACGAA ATATCTTCCG CCTGGAAGTA CAAACACAAG
CTGGAGCAGG AAGATACCGC CGCACGGATA CCTGCCAATC TACACGATAT AGTTCAAGGC
ATGACCCGCG TGGATCGCAA CGAGTTTGCG GCTCTGTTTC CCATGGGGGT ACCGTTGGAT
CCGTCGTCGC CACAGAACGA TCAAGTCGTC ATTCTGCACA ACTCGCCGCG ATCGTTGCCC
ACGGATCCGT TTGCGGCCGC CGAAGCCTCC TCACAAACCA CCATACCTTT ACTCAGCGCT
GCCGATGCTA CCGAAAATTG TGATAATCTA CACGTGGTGC TCACGGACCA CAATTTGAAG
CGTCGTCAAT GCGTGGCCCT TATGGGTCAA TACGAAGCGT TCCATTTGCA AAAGTTTATG
CGGTTGCCAC AGTCGGGCAA GCTCGACCCA CACCTGCCAC TGCGGCTCGT CAATCGAGGA
GCCCATCAAT CGGGACGGAA GTCAACCAAA ACGCCAACCA TGGAACAGAC AATGCAAGCG
TGGTCGACCT TGACGCCGTA CTTGCAGAAT ATAACCCAGA CGTTGGACAA ATTGAGACCG
ATCGCGGCGT CCGTGGCCGT CGATAACACG ATTGTCGTCA TGGTCTGTAA TCACGGACAG
TCGGAGTTGC TGCTAAACTT CGCCTGTGCC GCCCGAGCGC GGGGGCTCGA CACGGCTTTA
GAGGCCGTCT TGGTGTTCGC CACGGACGAG GAAACTCGGG ATTTGGCGAT CGGATTGGGC
TTGTCGGTTT TCTATGATCC AGTTGTGTTT GGCGAAATGC CCAAGGAAGC TGCGAGGGCA
TATGCAGACG TCAAGTTTCG GGCCATGATG ATGGCCAAGG TATACTGTGT ACAGCTAGTC
AGCATGTTGG GGTATGATTT ATTGTTTCAA GACGTAGACA TAGTATGGTT GCGCAATCCG
CTCGAATACT TTCACAACGA CACATCCAGT GCGAACGACG AGGTCAGCCC AGACTATTAC
GACGTTTATT TTCAGGATGA TGGGAACCAC GCGATATACT ACGCGCCGTA TTCAGCCAAT
ACGGGCTTTT ACTTTGTCCG CCACAACGAC AAGACTCGCT ATTTTTTCAA TTCGCTACTC
CTCGCGGGCG ATTTGATTTT GACGACTAAA TCCCACCAAA TCCCGCTCGT CGCTTTGCTG
CAGGAGCATG CCTCCATGTA CGGACTCAAG GTAAAAATAT TCTCGCGCCT TGAAAACGAC
TTTCCTGGTG GTCACGCCTA TCACCGACGC AAGGACTTTA TGAAGGATTA TTTTGCCGGA
CACGTTAACC CGTATTTGTT CCATATGAGT TGGACCAAGA GCAAAATCAA CAAAGGCAAG
TTCTTTGAGC AAATGGGGGA ATGGTATTTG AGGGACACTT GTGCGCAAAA AACGGCGCAA
CATATTCTTG ACCTGCCGGA TGGTACCAGC GTGAACCAAC AATCTTTAGT AGAACCGTGC
TGCATGGCCA CATCGGTCGT CAAATGTCAT TTTCGAGACA AGGCAAGCAA AATCCCGTGC
AACGACAGTC CAGCTATCGA CAGGAATGGC CGATCTTTTT GGTAA
 
Protein sequence
MVATRNDKTA GRHANAYRRA TPYRTLALCA TFGVVCFYLG VLLGSVTFAS HSSCSSAEEL 
NTRVEERVNE ISSAWKYKHK LEQEDTAARI PANLHDIVQG MTRVDRNEFA ALFPMGVPLD
PSSPQNDQVV ILHNSPRSLP TDPFAAAEAS SQTTIPLLSA ADATENCDNL HVVLTDHNLK
RRQCVALMGQ YEAFHLQKFM RLPQSGKLDP HLPLRLVNRG AHQSGRKSTK TPTMEQTMQA
WSTLTPYLQN ITQTLDKLRP IAASVAVDNT IVVMVCNHGQ SELLLNFACA ARARGLDTAL
EAVLVFATDE ETRDLAIGLG LSVFYDPVVF GEMPKEAARA YADVKFRAMM MAKVYCVQLV
SMLGYDLLFQ DVDIVWLRNP LEYFHNDTSS ANDEVSPDYY DVYFQDDGNH AIYYAPYSAN
TGFYFVRHND KTRYFFNSLL LAGDLILTTK SHQIPLVALL QEHASMYGLK VKIFSRLEND
FPGGHAYHRR KDFMKDYFAG HVNPYLFHMS WTKSKINKGK FFEQMGEWYL RDTCAQKTAQ
HILDLPDGTS VNQQSLVEPC CMATSVVKCH FRDKASKIPC NDSPAIDRNG RSFW