Gene PHATR_43981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43981 
Symbol 
ID7204195 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp610446 
End bp612519 
Gene Length2074 bp 
Protein Length418 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186092 
Protein GI219113017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.321285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAA GACAGCGCAC CGATCGCCTG CTTTCTTCAG CGATGACTTC CCACCACATC 
AACAGCAACT TCTCGCTGGC TATCTTGGCC AGTGTGGGCG CTGGTGCAGC GCTCACCATT
GCCGGCCAAT ACTGGCTTGC TCGTCGAAAA CCGGATGGCG AGACGGATCC TTCCATTGGG
CACAGCCTTG TCTGGTCCAG CATCGGCGGG GCTATGAACG CGGCCATGAT GTATACTGGC
GATCGTCTGA AACTTTACGA AACTCTTCGA GAAATGTGTG CAAAGCCTGC CTCTTCGGTG
ACGGCTATTG AGTTGGCCGA AGCCACGGTA TGTTTACCAA AGCAACTCAC AGTCAATATT
GTCAGCAAAG AGAGCTTACA CTAATACTTC ACTTCGCAGG GGCTAAATCA ACGATGGTTG
CGAGAGTGGC TAGCGCAACA AGCGGCGATG GGTGTCCTCA AGCTTTTATC TGGAACGGAA
AACGACGATG CGGCACTTAG ATACCGATTA CCGAAAGCAA CGGCTGAAGT TCTGGCCGAC
CCGGATTCTA GAGAATACGA CATTGCCATG ATTCAAGCCG TACCGTCCCT TGTAAATCGC
GCCAAAACGA TGTTACCGGA AGCCTTCGCA ACAGGAATGG GACGGCCTTA TGACGAAGCA
GACGTGGCCG AAGCCATTGA CCGGAATCAT CGGAAGCACG TTCGCGACGT GTTCATCCCG
CTCGTCCTAA GGCCCGCTCT CGGGGGAAGT ATAGCGCAGC ATTTGGAGGA TGGCTGCGAT
GTGGCGGATC TGGGATGCGG TGCAGGAGTC ATGCTCATTT TACTAGCCAA ATCATTTCCT
AAATCGAGCT TTCACGGGTT TGAAGTCTCT CAAGTAGCCT TGGAAAAGGC CGCTTTTCAC
GTTGCGGCGG CTCGCGTGTC TAATGTCTTT CTGCATAACG CCACCGAGCC TGGCGAATCT
TTGCGCGATC AACCAAGTCA ATTTGATCTT GTAGTGGTCT TTGATGTTCT GCACGACTCT
CCCTTTCCTG ATGATTTGAT CCAGCAAGTC AAAACTGCTT TGAAGCCCTC TGGTGCCTGG
TTGCTGGCGG ACATACCGAG CGCTCCCACG ACACGAGAAA ATCTCGTACA AATGCCCACC
GCGTCAACGT ATTTTGCCTT TTCCACCTGC CTTTGTATGA GCTGCTCATT ATCCGAGGAA
GGCGGCGCTG GTCTGGGTAC TTTAGGATTT TCCGTCCCTG TAGCTGAGAA AATGCTCCGA
GAAGGCGGTT TTAAATTCGT CAAGGTCTTG CTAGAGAAAG ACAATGCACG GTGGTTCCTT
GTGCATTGAG TGACCCAGTA TATGTGCTTC AATACAGTGA ACAAGACGTA GCAGTACAGC
GGTAACACCG GGGGTACATA ACGTATCGCT GAAGAAGGCG TTCATGGATG TGGCGTGGAC
GAAGGACTAA TGAAGAATTG CGCCTTCATA TCCAACGAAT AACGGAAAGA CTATATCTAA
TTTTACAGTT TTCGGTTGGT CCCTACCTTG CTGTGTCACA GGTATCCAAC AGCGGTTGGA
GTTTAATCAT CCGAACCCTT TCAGCGCCTC TATTGTGATC GCCTTGATGG AAGGATTTTC
TATTTTTTTC TTCTCCTCCA CTGAAACGAC GCGTTCGCGA TCCGCCATTT TCGTATTGGG
ACCACCTGGA TATCCCGCAT GATGCATGAG ATACAAGCGC ATCGAAATGG TAGCTACCGT
CGAGCCGATG ACGCCTACTC CTAGCAAGAG ACGTTTTGCC TGAGAGGTTT TGACGTCACG
GTATAATGTA ACGCAAATAT GCGCCACGGG TGCAGCGAGG ATGGGGCCAA GATAGACCCA
CTGCCGTTCC CTCACTCGAG CCATGACAAT TTCCTGGTCA TGACTTTTGC CCGTTGACAT
TGCTGCAAAT ACTGTTGAAA GCGCAGAGAC GCTAAAAGTT GATTCATTTG TTTGCACTAC
AGCTCAATGG CGTGATATGA GCGTGGGATA GGCAAATGAT GCTATTCGGT CCTATCTAAT
GGAAAATGAA ATTTTAAATA GCCTGTTTTC GGAC
 
Protein sequence
MTQRQRTDRL LSSAMTSHHI NSNFSLAILA SVGAGAALTI AGQYWLARRK PDGETDPSIG 
HSLVWSSIGG AMNAAMMYTG DRLKLYETLR EMCAKPASSV TAIELAEATG LNQRWLREWL
AQQAAMGVLK LLSGTENDDA ALRYRLPKAT AEVLADPDSR EYDIAMIQAV PSLVNRAKTM
LPEAFATGMG RPYDEADVAE AIDRNHRKHV RDVFIPLVLR PALGGSIAQH LEDGCDVADL
GCGAGVMLIL LAKSFPKSSF HGFEVSQVAL EKAAFHVAAA RVSNVFLHNA TEPGESLRDQ
PSQFDLVVVF DVLHDSPFPD DLIQQVKTAL KPSGAWLLAD IPSAPTTREN LVQMPTASTY
FAFSTCLCMS CSLSEEGGAG LGTLGFSVPV AEKMLREGGF KFVKVLLEKD NARWFLVH