Gene PHATRDRAFT_47492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47492 
Symbol 
ID7202596 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp766577 
End bp768928 
Gene Length2352 bp 
Protein Length640 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181625 
Protein GI219122591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00475484 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTAGCAGCAA CCCTGTCTCG CGACGAACTC TATTTCACCT GTCATTCACT GTAGTCATTC 
ACACGGCTTT GGTCTTTTTG TGCAACCAAC GAAAAACGAA CCAAGTGGAG GCGGGAAGAA
GGAGACCTTT TACATACCGT TGGGCCGGGC CATTGAGACC AGGGATTGAC TGACTGCGAG
TGTTATTCTC TCTCGTTTTC TCACACACAC GCACACATAC ATACACACAC AATCACACAC
ACATCCACAG TCCATCATGG CGCTCGGTAC ACGATCCAAC CAGACACTAG TGGTTGGTTT
GGCGGCTGCG GCGACCGCTA GTCTCTTGTT GTACTACGTC GTGCAAGCCC GGACCCGACC
AACTTCGGAT TCCTCGCCCC GCGACACCAC CGATCGCAAG TCTCGATCGG TAGAATTCAC
CACACCGTCA CCGAAAAAGA CCTCCGGGCC TTTAACGAAG GCGGTCGGGA AAGGAGAAGC
AGCGGACGGA TCCGAGGACC AGACACCCAT TGTGAAGAAC ACGGCGAAAG ACGCGGAAAA
AGAAGTAAAC GCCAAAATTG AACTCTTGGA CAAGAAGGGG AAGGATCTCT TCAAGAACAA
ACAGGTACGT AACAAGGAGG AGTAAGAAGA AAATAGAGGC CAGATCTTTC CAAGTGGGAC
GAGACAACTC CTCGATACAA CCAGTGCAAT CCTCCCACTC ACCCCCGTTC CCCAACACTA
TTTCCTTTGG CAGTACCTCG ATGCCGCGGA AACATTCACG GAAGCGCTCA CGCTCATTGA
AACCAGCGAT ACGAGCGGTT CCGTCCAGAA CACATCGTCC TCTTTGCATC GCCAATTGAT
TACCTTGCTC AACAATCGCT CCGCCATGTA CGAAAAAGGC AACCAGCCCG AATTGGCCCT
GGAAGATTGC ACGCAAATAC TCGACCAGGA TGTGCATCAC GCCAAAGCCC GCACGCGGAA
ACTCCGCGTT CTGGAAAGTC TCGGTCGTTG GCACGATGCA CTCGTCGAGG TCTGCGCCGT
GCAGCTTTTG TTTATGCGCA AACACCGGGA CAGCATGCGA CTCGGCTTGA AGGTACCGCC
GCCTCCCGTT CCCGAATCCA AAATGCAAGA AATTCTCACC AACGTTGTGC CGTTGGAAAT
GGAACCCTAC ATCCAAGCGC TCAACGAAAA AACCACGCGA CCACTGCCAT CGGGCTACAC
GATCCTGCAA TTGCTCCGCT CCTTTACGTC CTACAATAGC TGGATGGCGC AAGCGGCCAA
AGACGGCAAC GTCGCGAACA TTGACAAGGA GTTGGTGGAA GGCGTGGATG CGGCGAGTAA
AGCACAGCGC GTGCACGTCC TACTGAAACG CGGACGCCGG CACGTGTACG ACCGGGCTTT
TGAAAACGCC AGTGACGATT TCGAACAAGC CTACGCTCTG GCCGAAACCA ATGAGGTGCA
ACTATTGTTG GAAGGAGACG ACTACGCTCG CGTGTTGGAA TGGACGGGCA TGGTCAAGCA
CTGGCGATAC AAGCTGGACG AAGCCTCGGC TTGTTACGAA AAGTGTGCCG ATCTAGAGCC
GACCAATGCC TTGGTGCTGG TAAAGAATGC CGGAGTCAAA ATGGACGGTT CCCACCAAGA
CGAAGCCATG AAACTGTTCG ACACTGCCCT GGGACTCGAT CCGAAAAACG CCGACGCGCT
CTTGCACCGT GCTAATCTGC GACTGTTGCA AACCAAGCCA GACGAAGCCA AGGAAGATCT
GGAAGCCTGC ATTGCGGTGC GACCCGACCA TATTATGGCC CGTCTCCGAC TGGCGTCCAT
TTTGGCCGCG ACGAACGAAG CGGCCAAGGC GAAAAAGCAC TTGGACGCGG CCGAAAAAGT
GGAGCCCAAA TCGTCCGAAG TGCAATCGTA CCGAGGCGAG CTACACTTTA CGCAAGGAGA
ATTCGACCAG GCCCGTGCGC AGTTTGAAAA GGCCATTGCG CTGGACCCCA CCAATCCCAC
TCCGTACGTG AACGCCGCCA TGGCTATTTT GCAGACGCCA CCGCCGCCGG GACAAATGCC
GGATGCGCAG GAGGTAATTC GTCTGTTGGA AGAAGCCATT CGTGTCGATC CGTCGTTTAC
GTTGGCGTAC ACGCATCTCG GCAACGTGAA ACTTGGAACC GCCACGGAAC TTTCCAGTGC
TCGTGAAGTG GTGACGCTGT ACGACCAGGC ACTGGCCAAC TGCCGATCGG AAGAAGAAAT
CAAGGAACTG TGCAGTATGC GCATTCTAGC GGTAGCTCAA GTAGAGGCAG CCAGCATGCT
GAAAATGGAA TCCTTCAACA TGCAATAGAG CGTACGAGAA CAGTCATTAA GCAACTACTC
GACGAGGAGA CT
 
Protein sequence
MALGTRSNQT LVVGLAAAAT ASLLLYYVVQ ARTRPTSDSS PRDTTDRKSR SVEFTTPSPK 
KTSGPLTKAV GKGEAADGSE DQTPIVKNTA KDAEKEVNAK IELLDKKGKD LFKNKQYLDA
AETFTEALTL IETSDTSGSV QNTSSSLHRQ LITLLNNRSA MYEKGNQPEL ALEDCTQILD
QDVHHAKART RKLRVLESLG RWHDALVEVC AVQLLFMRKH RDSMRLGLKV PPPPVPESKM
QEILTNVVPL EMEPYIQALN EKTTRPLPSG YTILQLLRSF TSYNSWMAQA AKDGNVANID
KELVEGVDAA SKAQRVHVLL KRGRRHVYDR AFENASDDFE QAYALAETNE VQLLLEGDDY
ARVLEWTGMV KHWRYKLDEA SACYEKCADL EPTNALVLVK NAGVKMDGSH QDEAMKLFDT
ALGLDPKNAD ALLHRANLRL LQTKPDEAKE DLEACIAVRP DHIMARLRLA SILAATNEAA
KAKKHLDAAE KVEPKSSEVQ SYRGELHFTQ GEFDQARAQF EKAIALDPTN PTPYVNAAMA
ILQTPPPPGQ MPDAQEVIRL LEEAIRVDPS FTLAYTHLGN VKLGTATELS SAREVVTLYD
QALANCRSEE EIKELCSMRI LAVAQVEAAS MLKMESFNMQ