Gene PHATRDRAFT_40174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40174 
Symbol 
ID7195819 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp317447 
End bp320196 
Gene Length2750 bp 
Protein Length897 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184111 
Protein GI219127790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCCCA AACCTCAAGA ACACCAGCAA GGGGAACCCG AACAAGACGC AGACGAGCAC 
CACTCGAGAA CTTCCAGCAA GAGTAGTCTG TCAGGGCGCG AACCACTCCT GTTTGACAAT
GCAACTCTGT TGTCCGCGTC TGGGCATTTA CCATCGGAGA CTTCTCGTAG TACTTTAGAC
GGTGAATTCG CTGTCAATAC TGCTACGGAG ATGGACTGTG ATATCTCACT TTCAATCATG
AATCGTCCAT CACGGGATGT TGTTCTTCAA CGATTATCGG AAGCCCTGCT CCGGAGATCT
CTGACAAAGG TATGTGGTAG CGACATATAT GATCCTTGGT GCCAAGACTG GTGCTATCTA
CAAAGTTCGC TCTGACACAC TTTCTTTCTC TTGCTGCTTT CATTAAACTT TCGTCACTCG
CCGTAATTTC AAAGATTGAC TTGTCCCAAC GAGGAATTCG AACCTCAGAT GCTCGACTCA
TCAACATGGC ACTGGCCCAG AACGCATCCC TGACAACCTT GAAACTGGGA TACAATGATC
TCGGGGACGA CGGCTTGCGA ACATTGGCAA ACGGTATCGC TCGCCACGGG GCTTTAGAAA
GCCTGGATCT AGGGTTCAAC AACATCGGCG ATAATGGTTG CCGCGCTTTG GCCGAAGCAA
TTACTGCGCA ACCCATGTCA CTTTCCAGAT TGCGGACTCT GTATCTTGCA GGAAACGCAT
TGGGCGAAGA TGGAGCATTG GCCATTGCCA AAATCGTCCA GCACGGATCT CTGGAAAAGC
TGTATCTCAC TGGAAATCGC CTAGGACCAG ATGGAGTTAG AGCCATTGCT GAGGCAGCGT
TGGAGCTACA GCTGGAGAAA ATTCACAAAG TCAACATCAA TTGCGTAAAC AGTCTTCAAG
CAAGCAGACG TGGGATCAAG GAACTGTTTC TCGGAGGCAC TGGATTGGGC GGTGTTGGTT
GTCAGGCTAT CGCGGACTTG TTAGGGCAAT CGTCAACTCT GCAAGTTTTG TCTCTGGCGA
ACTGCGACCT AGACAATGAT TCTTTGTCAG TCTTGGCATC CAGTATCAAG TCAAATAGGG
AGCAATTACC CCTGGAGTCT CTTCAATTGT CATTTAACCA GATATCTTGC AAAGGAGTCG
AGAGTCTTTC AAATGCTATA TGGGGATGCC GTTCACTTCG AGAGCTGCTT CTCGACAACA
ATCAGATCGG AGACCGCGGG GCGGGGCAAA TCGCTGCCGT CTTAGCCTCT GCGAATCGCT
TGGAGACCCT AAACGTTGGC TTCAATCGAA TCAAAGCAGT AGGCATTAAG GCGATCATGA
AGACTGTTCC CGAAAGCGAG AGTTTACATT CCCTTTCTCT GTCGGGGAAC ACCGTTGACG
CCAGTGCCGC GAGAAGTATT GCCTATGCTC TGGCATTCAA TCATTCTCTT CTTTCGCTCT
CGCTAGTGAA CACCTCAATT CAACATGAAG GACAGCGACA TATTACTGCA GGAATTGTTT
CCAATAGTCA CATTAAACTC TTGCAACTGA ACGGCTTCCG AATTGGCCCG ATCGTTGTCA
CGCTCGGCTT TCCAGCTGCC TTGGAACATT GGAGCAATGA TCAGATTCTC AACTTTATTC
ATTTAATGTG GGACAAATCC GCTGAGTTGG TGGCACAACA GGAGCACGAG GCAAAACCAG
TCTTTGACAC ATCACGCTTT TTCTCGAAGG CGAATCCTCG AGATCGAGCG GCCCCGTTGG
ACGCCGCGGT CGTGGTTGAC GTGGCAAAGA AGGCCTATGT GGAACTTGTT ACGGAGGGGG
TTGATATTTT TTCAAAGCGA CCTGGCAATA TGCACGAGCT GTCGCCGCTA CCAGGTGATA
ACTTCATAGT AGAGTCGACG AGGAAGGTCG GAGAGAACAG CTATGCCGAA TCGTCGCTTG
AAAGCCATGT TCAAGCTCGT TCTTTCGTGA CATCCCCAGA ATTAGCCGGC TCTGAAACCT
ATGTTCCAGA TCCTCAGCGA AAGAAACGCG TCATTGAGTG GCTTTGTTCT AATATTCAGA
ACCTGAACAA AATGGCCCAG CAGCCCTTTA ACTCGAAAGA GTTATGGGCG CTTCACCAGC
GATACTTTAC GCCAGTCGTC AACGAGTGTG GCGGAAGCGT CAACCCTACT TCAGAAACGT
CCAACAATCA AAACGGGAAT CCAAAATTAC ACGCTTCCAG GGTTTCTCGA TCTAACTCGA
CTGAGAACCC AGCTGATATG ATGAATGATT CAACGGATGA CACACTGATG ACCCAATCGA
GTGACCCTTT CATCTTGGAT TCGCCTCAAG GCATCGTTTC CTTGCCCGTT CTGAAAAGAA
AAGTGTCTTA CCGATTTCTT GGTGATGCAA TGGTAAATTC AGCCCCTCGA ATGTCAAATT
GTGTGGAGAT GCGCGGGCCT GAGACAGAAC AGCCAATTTC GAACGGAATG GTGTCCATGA
TGATTGAAGG AGGCCCGGTT GGCCACTCGA TGCCTCGCAA AACCAAACGT GCACGGAGGA
ATCGCACTCG CATTTCATTT CTACCTCGTG TGAAGGTAAA GCTGGATTCG TATTTGGACG
TCTGTCACGA GAAAGCGTTG ACGATGATGA GGCAACTGTA CTTTGTTGAA CGAGCAATCT
TGCTGGGTCA GTTGAATTCA GATGTGAACT CGATGCCGTA CAGTGCCCGC ATGCACTTGC
ACGGCGTCCT TGCTATGGAC GCTGAAATGA TTTTAGTTGA CATGATATAG
 
Protein sequence
MPPKPQEHQQ GEPEQDADEH HSRTSSKSSL SGREPLLFDN ATLLSASGHL PSETSRSTLD 
GEFAVNTATE MDCDISLSIM NRPSRDVVLQ RLSEALLRRS LTKFALTHFL SLAAFIKLSS
LAVISKIDLS QRGIRTSDAR LINMALAQNA SLTTLKLGYN DLGDDGLRTL ANGIARHGAL
ESLDLGFNNI GDNGCRALAE AITAQPMSLS RLRTLYLAGN ALGEDGALAI AKIVQHGSLE
KLYLTGNRLG PDGVRAIAEA ALELQLEKIH KVNINCVNSL QASRRGIKEL FLGGTGLGGV
GCQAIADLLG QSSTLQVLSL ANCDLDNDSL SVLASSIKSN REQLPLESLQ LSFNQISCKG
VESLSNAIWG CRSLRELLLD NNQIGDRGAG QIAAVLASAN RLETLNVGFN RIKAVGIKAI
MKTVPESESL HSLSLSGNTV DASAARSIAY ALAFNHSLLS LSLVNTSIQH EGQRHITAGI
VSNSHIKLLQ LNGFRIGPIV VTLGFPAALE HWSNDQILNF IHLMWDKSAE LVAQQEHEAK
PVFDTSRFFS KANPRDRAAP LDAAVVVDVA KKAYVELVTE GVDIFSKRPG NMHELSPLPG
DNFIVESTRK VGENSYAESS LESHVQARSF VTSPELAGSE TYVPDPQRKK RVIEWLCSNI
QNLNKMAQQP FNSKELWALH QRYFTPVVNE CGGSVNPTSE TSNNQNGNPK LHASRVSRSN
STENPADMMN DSTDDTLMTQ SSDPFILDSP QGIVSLPVLK RKVSYRFLGD AMVNSAPRMS
NCVEMRGPET EQPISNGMVS MMIEGGPVGH SMPRKTKRAR RNRTRISFLP RVKVKLDSYL
DVCHEKALTM MRQLYFVERA ILLGQLNSDV NSMPYSARMH LHGVLAMDAE MILVDMI