Gene PHATR_43972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43972 
Symbol 
ID7204189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp579116 
End bp580726 
Gene Length1611 bp 
Protein Length536 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186085 
Protein GI219113003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.549222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTCT TCCAGTTACG AAAAACGATG CATGTGCTAG AATGCGTGCC CAACTGGTTC 
CAGGTGTCGG AGATGCAAAA ACCTAAGTAT GGTGACAGAA ATGATACCGG TAAATGGTTT
GTCTTCACCA TGCCTCGAGA ATATTATACC GTTCTGGTTA CCTTTCTCTG TTCAATTGGC
ACTACCAATG GTTTTGGTCT CGGTGTTGAC TTTCAAAAAG AACAGGGCCG GGAGCATCGT
CTATTTTTGA AAAGGAAATC GCGCCTGGCC TCTGCGCTGT TTATGGAGGT TCCGGTAGGA
ATTCCTATCG ATCCCGAAGG TCGGCCATAC AAATTTCCGG CCAAAGAACA TTGTTCCAAA
TGTGGCTTAT GTGAAACAAG CTACGTGGCG CGTGTGAAGG AGGCCTGCGC ATTCTTGGAA
CCGGGCATGT CTCGGATCGA CACACTCGAG ACGAAAGTTC ACGGCCGGCG ACGAAAGACG
ACCGACGACA AAACAATTGT GCAGGCTGAT GAACGACGCT TTGGGGTTCA GTACCAGCCA
CTTCGACTCG CTCGAGGAAT CAGCATGCCG GGTGCACAAT GGACAGGCGT GGTGTCTTCT
ATTGCTATTT CGATGTTGGA GACCAGACAA GTCGATGCCG TTGCCTGTGT GGCTTCCAAT
GAAGAAACCT GGAGTAATCC CAATCCAATA CTAGCCCAAA CTACCGACGA AGTTCTGAAA
GGAAGAGGTG TAAAGCCGTC TCTTGCTCCT AGTCTTAACA TTCTGGACGA AGTAAAAAAT
GATCCATCGA TAAGGCGACT CCTGTTTTGC GGAGTTGGCT GCTCCGTCCA GGCGCTTCGT
TCCATCGAAA ATGAGTTGGG TATAGAAATT TTCATATTGG GCACCAATTG TGTTGATAAC
AGCCCTTCCC CAGGAGCTGC AGCTGCATTT ATCGAGAAGG GGGCGAAGGT CTTTTCAGAT
TCGGTCCGTG GCTATGAGTT TATGCAGGAT TTTCGTGTTC ATGTCAAAAC CGAGGAGACC
TACTTGACAA TACCTTATTT TTGTCTACCT GGCACTATTG CTGAATCGTC TATTGCCAAG
TCATGCCGAT CTTGTTTCGA CTATACAAAT GCTTTGGCGG ATGTAGTGGT TGGATACATG
GCAGCGCCAC TTGATGGAAA GTCGAGAATG GACGAATCTT GGCAGACTGT CACAGTCAGG
AACGAACGAG GCAATCAGAT GGTTGAGACT GCGATTACAC AAGGACGTCT AGAAGTTGGA
GACATTGTAC GAGGATCTGG CGATCACCAA CAACTTGCAA TTGCGACTAC GAAATCCGAT
GCTCTTGTGC AAGCCATGGT GGGTGGCAAA GTTCAAGAGA ATGGGATGCC GCTATGGCTA
GGGAACATAA TGGCAACGGT TCTCCGAAAA GTTAGTGCCA AAGGAATCGC ATTCGCCCGG
TACAGCATTG ACTACCACAT AGTGAGGAAC TATTTTCATG TTCTGAACGA GTGGGGCGAG
CATCGCGCTC GATCCTCAAC ACCGCAATTC GCTTTGGAAA TTGTCGACGA ATACCTCGAA
ATGGATTCTA CGCTAAAGGG ATATGCCGCT AAACTTACTT CGAAACATTG A
 
Protein sequence
MTVFQLRKTM HVLECVPNWF QVSEMQKPKY GDRNDTGKWF VFTMPREYYT VLVTFLCSIG 
TTNGFGLGVD FQKEQGREHR LFLKRKSRLA SALFMEVPVG IPIDPEGRPY KFPAKEHCSK
CGLCETSYVA RVKEACAFLE PGMSRIDTLE TKVHGRRRKT TDDKTIVQAD ERRFGVQYQP
LRLARGISMP GAQWTGVVSS IAISMLETRQ VDAVACVASN EETWSNPNPI LAQTTDEVLK
GRGVKPSLAP SLNILDEVKN DPSIRRLLFC GVGCSVQALR SIENELGIEI FILGTNCVDN
SPSPGAAAAF IEKGAKVFSD SVRGYEFMQD FRVHVKTEET YLTIPYFCLP GTIAESSIAK
SCRSCFDYTN ALADVVVGYM AAPLDGKSRM DESWQTVTVR NERGNQMVET AITQGRLEVG
DIVRGSGDHQ QLAIATTKSD ALVQAMVGGK VQENGMPLWL GNIMATVLRK VSAKGIAFAR
YSIDYHIVRN YFHVLNEWGE HRARSSTPQF ALEIVDEYLE MDSTLKGYAA KLTSKH