Gene PHATRDRAFT_42741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42741 
Symbol 
ID7196127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp943323 
End bp945648 
Gene Length2326 bp 
Protein Length742 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176691 
Protein GI219109876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGACAGTGG GCCCGAACTT ATCACATTTG ATAAACCCAA GTGGTAAAAC CGCCAAGGTA 
CAATCTCTAA CCAAGACGCT GGCGCTCACT TGAAAGCATG ATGTCGGAGG ACAAAAGGAG
CCGTGATGCA CCGTCAGATC TTTCCGAGGG CGGTGTTACT GCTTCTCACA GTATTGAGTC
GCTGGAGCCG AGAAACTCCC TGACCGGCGC CATGTTCCTT TATTGGGTTG TCCCAGTTCT
ATTGTTTGCA GTTTTCAGTC GACTGACGGT AGACACAAAC GTTGGCACTG TCAAAACGAA
ACCTTTAAAG TCGATTCCTA TCCAGCTCGA TCAGGACTAT ATCGATTCGA CACCATCGGT
CAGGCCGACC CAAGCACCCA TCCCAGTTCG GCAAGCCGAT AATCCTTCCT TACCATCCCG
GTGGCCCACA TCGTACCGCG CAACAATAGA GAAAATTGAA CGTCGTCGTC CACACTGGAA
AAACAAGCCC ACTTTGTCTC CGTCTGCTCA GCCTTCGGCA GCCGACAGCA AAAAATTGGA
GGACACCGTT ACTTCCTCAC CGCTGCCTAA TGGTCGCAAC GCTGGTGGCC GTCCTCGGGG
CCGGGCATCG GACCCGAACC GTCTCATGAT CATCGAAAAG ATTGATGCAA TGAGACAGGA
CGTTGTAGAT GACCCTGCCG ATATTTACAA GGCGATTGAA TTTGCGGACG CCTTGCGTTT
CTACGATCTA CAGTACCGGG AAGGTGGTAC TTATGAAACG GAAGCAATTG ACACTTACAA
CAAAGTAGTT GGTCTTGTGG TAGCCAAACG GAATAAGCTG GTTGCTGCAA ATCAACCGAC
CAATGTTTCC TTGAATGGTT CGCGAACGAA GTCCGTGAGC GACGAAGTCA CTCTGGACTA
TGCCTCAAAG AGCGCTGACG GCCTCGTTTG TGCTGTATAC ACGGCGCTTG GGAAAGTCTA
CTACATGGCC AACATGTTTG AACGAGCGGC GAAGAGTTAC ACGGAATGTC TAGAAATTGC
ACCAAGCTAC TTGGATGCGG TTAACGCACG CGCGTCAACG AACATTATCT TGGGCAAGTA
CGCTGAGGCT GGTGCCGATT TTTTGAAAGT CGTTCGGGAG GATGAACAAC GATTGTTTCC
AGATTCTTTC TCCGGAATTG CTCGTGTACT CGAAGCACAA GAGGATGCGA TTCCTGGAGG
GTGGGGGCCA GTAGTTGAGT TGCTGGATCA ATTAATTCCT TCTTTTGAAG CACAATGGGC
GTCTGCCCCG CCCCAGACAA AGCAAATTTT CGGGAATGGT CTCAATCGAT TCCATCATTC
GCTCTTCACC TATCACGACA AGAAGACGAA AGCATATTCC GAGGCATGGC ATCATCTCAC
CGAAGCATAC CAGTACAAAA TGGCAAATTT ACCTGTTTGG CAGTCAGGGC AAGAGTCGAC
AAAATCATTC CAGACCAAGC AAATTTTCAA GCCAGGATTC TGGTCCCCGG GAGTAGGCAG
CGAGACCGAG ACGCCAATCT TCATCATTGG CTTTGTCCGC AGTGGGTCAA CTCTTCTCGA
ACGAATATTG GACGCCCATC CAAAAATTGT TGGCACCGGC GAAAATTCTG TCTTTAACGG
ACGCCTTGAC GATATTCGCA ATAAGATTGT TCAAGTCAGT ATGGGTGGGC GGCGCGAGCA
GCTGGGGGAA GTCACTAGAC GGCTGGCGGA AGAAGTCGTC GATGGCATGC GAAAGCGTTG
GCGAATTTTG CAAGCTACCA CAGAAACGAG CGGAGTTAGA GACGACATCC CACTGCGATT
TGTGGATAAA ATGCTCACCA ACTACTATAA TGTCGGCTTC ATTCATCTAC TGTATCCAAA
AGCCTTGATA CTTCACGTTT ACCGCAATCC AATGGATACG ATCTTTTCGG CTTACAAGCA
TGAATTTCCG AGCGGTACGT TGGACTACAC ATCCGACTTT GACGCTCTAG CCGAGCTTTA
TCACTCGTAC CGTGACATTA TCGACCATTG GGACGACGCT CTGCCGGGAC GCGTAACACA
CGTCCGCTAC GAGGACATGG TTCAAGATAT GCCCGGTATG GCAAGGGCGA TCATCGATGC
CACCGGTTTG CCCTGGGATG ACAGCGTTCT GCAATTCCAC AAGCAGAAGC ACGCAGTCAA
TACTTTATCC ACCACACAGG TGCGCAAGGG AATCTATAAG GACAGTTTGA AATCATGGGC
GAAGTACGAG AATGAGCTTC AGCCAATGGT ACAACTGATT GGCGGGCGTG TCCACTTCAA
TATAAAAGCA ACGCTGCAAC CTGTTCCGAC TAAGGAGGAG TTGTGA
 
Protein sequence
MMSEDKRSRD APSDLSEGGV TASHSIESLE PRNSLTGAMF LYWVVPVLLF AVFSRLTVDT 
NVGTVKTKPL KSIPIQLDQD YIDSTPSVRP TQAPIPVRQA DNPSLPSRWP TSYRATIEKI
ERRRPHWKNK PTLSPSAQPS AADSKKLEDT VTSSPLPNGR NAGGRPRGRA SDPNRLMIIE
KIDAMRQDVV DDPADIYKAI EFADALRFYD LQYREGGTYE TEAIDTYNKV VGLVVAKRNK
LVAANQPTNV SLNGSRTKSV SDEVTLDYAS KSADGLVCAV YTALGKVYYM ANMFERAAKS
YTECLEIAPS YLDAVNARAS TNIILGKYAE AGADFLKVVR EDEQRLFPDS FSGIARVLEA
QEDAIPGGWG PVVELLDQLI PSFEAQWASA PPQTKQIFGN GLNRFHHSLF TYHDKKTKAY
SEAWHHLTEA YQYKMANLPV WQSGQESTKS FQTKQIFKPG FWSPGVGSET ETPIFIIGFV
RSGSTLLERI LDAHPKIVGT GENSVFNGRL DDIRNKIVQV SMGGRREQLG EVTRRLAEEV
VDGMRKRWRI LQATTETSGV RDDIPLRFVD KMLTNYYNVG FIHLLYPKAL ILHVYRNPMD
TIFSAYKHEF PSGTLDYTSD FDALAELYHS YRDIIDHWDD ALPGRVTHVR YEDMVQDMPG
MARAIIDATG LPWDDSVLQF HKQKHAVNTL STTQVRKGIY KDSLKSWAKY ENELQPMVQL
IGGRVHFNIK ATLQPVPTKE EL