Gene PHATRDRAFT_12431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_12431 
Symbol 
ID7201033 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp796946 
End bp799556 
Gene Length2611 bp 
Protein Length801 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180318 
Protein GI219119103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTATTACTGC ACGGATACCT TCATGTTCAA ATCGTCCGCG CCAAAGAATT ACGAGACTTC 
GACTGTATGA TCGGCTGTCG CAAAGCCAGT ACTGCGTTGC GTTGTTGCGA CAATGTTAGC
GACCCCTACG TCACTGTACA CGCTGGAGAT CATCGATTGA TCAAGACCTC CGTTATGACG
AATAGGTTGA ACCCTCATTG GAAGGAAAGT TTTGTTGTAC CGATCAGCCA TCATGTGGAA
GCACTCGAAT TTAGAGTAAA GGACTCGGAC TTCAACGGTG CGATGAATCT TCTCGGGAAG
ACATTTCTCT CTATAAGTGA TATACTTAAA CTCAATAAAG AGGGAAAACC TCGCAGGACA
GGGATTCACA AAGTCGTTCA TTTGGATGGA AAACCTCGAC ATGGGTCGTT TGAATACTTT
GTGGAGTACG TTCCAGCCGA AATGATGAAG GACGGCGTTG CAGTTCCGGG GACATATTTC
AAACCTAAAC AAGGAAACAA AGTCAAACTG TACATAAATG CTGACGACCG CGGTTCAGAA
AAAGGAACTC CAGAAGTGAC ATACGGTGCA AACAATGACC AGGTGTGGAA ACCAAATCGT
CTCTGGAAGG ATATTTACGA ATCCATTTGC CAAGCAAACG AATTGATCTA TATTGCAGGA
TGGGCCGTCG ATTACGAACA GAGTTTTCTA CGAGGAGAGG AACGTGAGCA AGCATTGGAC
AGTGACAAGT ATAGCCCATA CATCGGGGAT CTCTTGAAGG CCAAAGCAGA GGAGGGCGTA
ACAGTAAACG TTCTTGTATG GGACGACCAG ACCTCAAATG GATTTAATGG TGAAGGAATG
ATGGCCACGA AAGATGAGGA GCTCCGACAG TTTTTCAAAG GAACAAAGGT CAACCTACAC
CTCGCACCCA TGTTAGGAGG TGAATCCAAT CCATACTTCG AACAAATTCG AAATTCGATG
TGCTTTACAC ATCATCAAAA GATAGTCATA TGCGACGAAA AATCTGAGTT AGTGGGATAT
GTTGGTACGT TGCTACTTTT GTGTCCTGAC GCTGAGCAGT TTGGCAGCTG CCCGAGTCAA
CTTACCTCAG TTCTTTGCTG TAGGTGGGAT TGATTTGACG TATGGGCGAT TCGATAACAG
TGAATATTCC CTCTTTCGAA CCTTAGCGTC TGACCACAAA GGAGACTTCC ATAATGGATG
CCATATATTG AAATCTGGAG ATACACTTGG ACCTCGCCAG CCTTGGCATG ACATTCATTT
ATGCGTTAGA GGACCGGCTG CGCAGGATCT ATTGCAAAAT TTTGAAGAGA GGTGGCGCCG
TCAAGCTATA TCAGATGCCG ATCAGCTCGT TGACCGTGCA AAAAAAGAAA TTGTCGCCAA
GTCGCTAGAT CAAGACCACG GTGGAGTGTG GAGTACGCAG CTTTTTCGGT CTATTGACGC
TCGTACGGCC AGCTTTGATC CAGAGTTGAT GTCTCATTTT TCTTCGCCAT CCTTCGACGA
AATCAAAGGC GTTAAGTTTC TCAATAGCGG GAAAAAATCA ACGTCGCACC GTAAACTACG
TCGCAAATTT TGGAAGATCA GTGTCGAGTA TGATAGAAGA TTCGTCTCTG ATTCTGCCGA
TGGCTTTGTT TTCCCTCGGA CTTTGGACCA GAAAAAAGGC CGAGCAATAG ACCACAGTGC
CCATGATGCT ATGGTGTATC ACATTCGCAG AGCGCACCAC ACTGTTTACA TTGAAAGTCA
ATACTTCTTA AGCAGTTCAC ACATATGGTC GGAAGACACT GCAACGAAGT GTTACAACCT
AATTGCAGCG GAGTTAACCT GGAAGATCTG CCAAAAGATA GAAGCTAGAG AGCGATTTGC
CGCATACATT GTAATCCCTA TGTGGCCAGA AGGCGTTCCA GAATCAGGAT CTGTCCAAGA
AATTCTGCGT TGGCAAAGGC TGACGATTGA AAGTATGTAT AGGCGTGTGT GTAAAGCTAT
ACAGCGACAG AAAGATCTTG CGAGGCAGTC AGGATCGCAG TTTGATATGG AAGCAACCGA
CTACTTAAAT TTTTACTGTC TTGCAAATCG GGAGACCGAA GATGGAAGTG AAGCCCAAGG
GGTCCCAAGG CCGCAATCAA TCGAAGAAAC GCTAAGCAAG TCCCGCCGGC ACTTAATATA
CGTGCATTCT AAGTTGATGA TTGTTGACGA TGCAGTAGCA TTGATTGGTT CAGCCAATAT
TAACCAAAGG AGTCTCGATG GAACGAGAGA CAGTGAAATT GTTCAAGGGG TATGGCAGCC
TGATCATTTG GCAACAAATA AGAGTATCGC CGTCGGCGAT ATTCATGGAT TTCGCCTACA
CTGCTGGAGT CACCTGACGG GGAAAATGGA GGACATATTC CGAGATCCGT CAAATTTGGA
TTGCGTTCGT CGGCTGAATA CCATTGCCAA AGAAAACTGG AAAATATTTT CGCAGGAACA
AGTCGCGGAA ATGAATTCGT ATCTGGTTTC CTACCCAATT CGTGTTGATG CGGACGGAAA
GTTGTCAGGT ATCGAAGGCG ATGTATTTCC TGATACGAAG GCCCAAATCC TGGGTTCGAA
GTCCTTTCTA CCTGAGTACT TGACAACATA A
 
Protein sequence
VLLHGYLHVQ IVRAKELRDF DCMIGCRKAS TALRCCDNVS DPYVTVHAGD HRLIKTSVMT 
NRLNPHWKES FVVPISHHVE ALEFRVKDSD FNGAMNLLGK TFLSISDILK LNKEGKPRRT
GIHKVVHLDG KPRHGSFEYF VEYVPAEMMK DGVAVPGTYF KPKQGNKVKL YINADDRGSE
KGTPEVTYGA NNDQVWKPNR LWKDIYESIC QANELIYIAG WAVDYEQSFL RGEEREQALD
SDKYSPYIGD LLKAKAEEGV TVNVLVWDDQ TSNGFNGEGM MATKDEELRQ FFKGTKVNLH
LAPMLGGESN PYFEQIRNSM CFTHHQKIVI CDEKSELVGY FFAVGGIDLT YGRFDNSEYS
LFRTLASDHK GDFHNGCHIL KSGDTLGPRQ PWHDIHLCVR GPAAQDLLQN FEERWRRQAI
SDADQLVDRA KKEIVAKSLD QDHGGVWSTQ LFRSIDARTA SFDPEFVSDS ADGFVFPRTL
DQKKGRAIDH SAHDAMVYHI RRAHHTVYIE SQYFLSSSHI WSEDTATKCY NLIAAELTWK
ICQKIEARER FAAYIVIPMW PEGVPESGSV QEILRWQRLT IESMYRRVCK AIQRQKDLAR
QSGSQFDMEA TDYLNFYCLA NRETEDGSEA QGVPRPQSIE ETLSKSRRHL IYVHSKLMIV
DDAVALIGSA NINQRSLDGT RDSEIVQGVW QPDHLATNKS IAVGDIHGFR LHCWSHLTGK
MEDIFRDPSN LDCVRRLNTI AKENWKIFSQ EQVAEMNSYL VSYPIRVDAD GKLSGIEGDV
FPDTKAQILG SKSFLPEYLT T