Gene PHATRDRAFT_46561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46561 
Symbol 
ID7201845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp714426 
End bp715862 
Gene Length1437 bp 
Protein Length453 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181060 
Protein GI219120652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0084892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCAC AGAGGAGAGG AAGTAGAAGA CGACGACGAC AATCGCATTT GACGCAGCAA 
TGTACTCGCG TCGTTTGTCT TCTGATATTG GGGAAAACGG CCTCCCAACC CGCTGAAGAA
ACGGTCGCCG CCTCGCTACC ATTGTCGCAA CCACACCTTC GCAGACGTCA CGACAACGGT
AATACCGTGG AACTTGTTCC GAATGCGACC GTCCGTCTGC CTCTGCACGC CGTCGCGGGT
ACGCATCACG TGACGGCTTG GATGGGGGAA CCGCCGCAGG CGCAAACGCT GATTGTCGAC
ACCGGGTCGC GGTTGACGGC GACCGCGTGC GAGCCCTGTT CGCAATGCGG GACGACGCAC
GCACACCCGT TCCCCCATTT GGACCCCCAG CGGTCCAGCA CGCTGCGATA CACGCAGTGT
GGATCCTGTC TGCTCAGCGG CATCCAGGAA TGCGCAGCGG AACAAAAGTG TGGTATTAAT
CAAAGGTATA CTGAAGGCTC CAGCTGGACA GCAGTGGAAG TCAGCGATAC GTTTGTCCTG
GGAGGACCGG AGATATCCAG TTTGGAACAG TACGTGAGCT TTACGATTAT CTTTGCGTTC
GGATGCCAGC AAAAAGTCAG GGGATTGTTC CGAACACAGT ACGCCAACGG TATATTGGGT
TTGGAACGGT CCGACCTCTC GCTCATTAAG CGATTGTGGA AGGAAAATGT CATTCCTCGC
GAGTCATTCT CCCTATGCAT GACACCTTTT GAAGGCTACA TTGGACTGGG AGGACCACTA
CGAGACAAGC ATACGGAATC GATGAAATAC ACGCCGTTCA CTTCCACTCA GAGTTGGTAT
GCTGTCCACG TAGTCCGAGT GTTTGTAGGG GACGAATGCT TGACAAGCAA TGACCAGCAC
GACACTGTTG TCGAGCATGC ATTGGTCGAA GCCTTTGCAG AGGGCAAGGG TACTATACTG
GACTCGGGAA CGACGGACAC GTATCTCCCC AAGGCAGTTG CGGGTCGTAT GCGAGAAATA
TGGGCGCGCC TTTCCAACAC ACCCTTTCAA CCGTCGAGCA CGTACGCCTA CACATACGAT
GAGTTTAGAT CGCTGCCCAT CGTGACCTTT GAGCTCGCCA ACAACGTAAC CTTACAGGCC
CTGCCTAAAA ATTTCATGGA AGACCTTCCC GAGCCTTTGC GGCCCTGGAC GGGACGGAGG
AAACTAATGA ACCGCCTGTA CGCGGACGAA GTACAAGGTG CCGTGGTGGG ATTGAATACA
ATGGTGGGCT ATGACTTGCT CTTTGACGTC CAAGGCAATC GTTTTGGTGT CGCCCCGGCC
CTATGTGGAA TTGCGAACAG TACACCAGCA GCGACTCATT AAAACGGAAG CGTTTGTAAA
GGTTCTTTTG ACAATTAAGA ATCTTCGATA TACTTAATGG TCATCGGGGT TCCCGTT
 
Protein sequence
MVPQRRGSRR RRRQSHLTQQ CTRVVCLLIL GKTASQPAEE TVAASLPLSQ PHLRRRHDNG 
NTVELVPNAT VRLPLHAVAG THHVTAWMGE PPQAQTLIVD TGSRLTATAC EPCSQCGTTH
AHPFPHLDPQ RSSTLRYTQC GSCLLSGIQE CAAEQKCGIN QRYTEGSSWT AVEVSDTFVL
GGPEISSLEQ YVSFTIIFAF GCQQKVRGLF RTQYANGILG LERSDLSLIK RLWKENVIPR
ESFSLCMTPF EGYIGLGGPL RDKHTESMKY TPFTSTQSWY AVHVVRVFVG DECLTSNDQH
DTVVEHALVE AFAEGKGTIL DSGTTDTYLP KAVAGRMREI WARLSNTPFQ PSSTYAYTYD
EFRSLPIVTF ELANNVTLQA LPKNFMEDLP EPLRPWTGRR KLMNRLYADE VQGAVVGLNT
MVGYDLLFDV QGNRFGVAPA LCGIANSTPA ATH