Gene PHATR_44208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44208 
Symbol 
ID7203927 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1312237 
End bp1315224 
Gene Length2988 bp 
Protein Length937 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186227 
Protein GI219113287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA GTGTCTGCAT AATATATTGT CACGTGCTTC TCCTCCTGTG CTTGCTGCAC 
GACGATGGTT GGCGGGCCTA CGGACAAGAA ACCGAAGACA TTCTCGATAC GACCACAACT
CGTCCAGTTG TAGTCCACTT GCCAGATCTT GGACGCGTCC AAGGCAAACG ACAGTCGGGA
ATCGACTTTT TCGGGGGGTT GCCCTATGCC GCTCCCCCCG TCGGTCACTT GCGATGGGCT
CCACCGGAAC CACCAGCGCC CTGGGCACCC GCCAAACTAG ACGCTACCCA CTTCGGTCCC
GACTGTTGGC AGCTCGTCGA TCCCTTGCTC AATCCAGGAG CCGAAGTCGC ACGCATGTCG
GAAGATTGTT TGTATCTTAA CGTATTTACT CCGGCGGGAC ACGCTTCGCG GCACGAGCGA
CTGCCCGTGC TTGTCTGGTT GCACGGCGGT GCCTTTCAAC AAGGGGGCGC TCGACGCTCC
GAATACGACG GACGTCGTCT CGCCGAGCGC GGCACCATTG TGGTGACAAT CAACTACCGG
CTTGGTGCGC TGGGATTCTT GGTCAGTAGC GTCGACGGCT TGTTTGGAAA CTTTGGACTC
ATGGATCAGC GCGCCGCCTT GCACTGGGTA CAGGAGAATA TTGCTAAATT TGGAGGAGAT
CCGGATAGCG TCACGTTGTT TGGAGAGTCG GCGGGGGCAG TCATGACAGG ATTGCACCTC
ATGATGGAAG GGGCGGGATC GCTTTTTCAT CGAGCTATTA TACAGAGCAA TCCGCTGGGC
TGGCAAGTAC GAGCCATTGT AGTAGCTGAC TTTATCGGTG AAGCAATGAA ACGTTCCGTA
GATTGTCGAG ATGTGGCCTG TCTCCGGGCG GAGCGTGTGG AAGAGATTAT GCGTGCGCAG
TCCAGCCTCA TGGGAGTCCC AAGGAGTGTG GGCGATTTTT TTACCTGGGG TCCAACCTTG
ACGGAAGAGC TCAAGCTCAC CGTCGGAGGG CGCACACCGT TTGGCTCCAC TTCCCCCCTT
AGTCGTGAGC ACGTCATGTT CCGAGACTTG GACTCTTGGA AATGGCAAAA CAACCGCGAT
ACGTCCTGGG CTGCCGTCAA CGTCACACAG CCGCTGAAAA ATTTGAATCT CATACCCGAC
GATATACCCG TCATTATTGG TGCCAACAAG CATGAAGGCG AAATGTTTGT ACACGGTGCT
TTCCCCATCA CCATGTCGAA AACTGTCTAT TGGATGTTCG TTGGGGCCCT ATTTCGAGAT
AGTGCTTCGA GGGTATTAAA ACATTACCGC GCGTACGTGG ATCAAATAGA GCGGGAAGCC
GAAGAACTTG CCCGATGCCA AATCGAAGAA GAGGAGAACC GGCAATACTA TTTAGAGCAC
AAAGAGCAAC TCGATCACGA GTATCAGTTA CTACTGGAAA TGAATTCGAC TAAGGAAGGA
GTCGAAGCTA TCTCGGACAT TGAAACGTTG GTACAGACCT GGAGCCGCGG TGGCGCATTC
TTTCACCGAG ACCAACACGA TGACACAACC AATCATACAC CGTGGCATCG TCGTGTCTGG
CCTTTTGCGC GGAACAATAC AGAAGAAGCA ATTTTGGAGC GTGCCAGACT ACGCGAGGAG
CGCCGAAAAC TTCGAATCAA AGAACGTGCT TTGAAAGCAG CGGCCAGGGT AGTGGTGGAC
TATCGTCCCG TCATGAGTCG GATCATTGAC GACTACTTGT TTCGATGTCC GGCGTGGCAC
TACGCTCATT CTTTAAGCCG CAACCGCATT TATCGTGGCA AGCGAAACAA TGTGTATGTG
TATCAGTTCA GCCATTCGAC GCACATCCCA GGCTACGAGG AGTGCTGGGG CAAATCTTGT
CATACGTCCG AGATTCCCTA TGTCTTTCAG GCCATGGATA TCATTCGGAG CAACTATTCT
ACACTCGGTC CGCACGCTCA AAGGGAAGCC CCGTCCACTC CGGAGTACCC GTACACCGAT
ATGTTGGTAG CGTACCGTGA GGCCATGGAT GCAGCCTATC GGCAATACGA TGATGAAGAG
GACGCTGACG TGGAGACTCC CTCAAACGCT ACCAACCATG GTAGCACAAG CAGCAATCTC
TTTCAGCACT CGATGCGATT TCAACGATTG GTGAATCACT TTTTTGGCGA TTACTTCAAA
GAAGACGCGG ACGAAGAAAT CGCCAGTGAC ATGGCTGACC GATGGGTTTC CTTTGCCAAA
ACAGGCGACC CAAATTACGA AGGCAGTAAA GCATACTGGC GACCTTGGCG ATATATACTG
GACGAACGGT TGGGCCGAGA CAAGGAAAGA CCTTGGGAAC CTCAGGACTT TGACAAAATA
TTTGATCCCG AGATCGAGGA CGACTGGGAC GAAAACGATA CCACCCTAAT TGAGCGGTAT
GTTTGGTCAG ACGATCCAGG AGAACGTACC TACCGCCGCC GGGCGTTGCA CGCGCTCGCA
ATGGAGGTTG TCGATGAAGA CGTCTTCCAA ACCATGCTAC GTCGGACACC AAGAGGTCAC
GAAGACGATA ATCCTTTTAA CAGCTTTTTG TTCGGCAGCG CATCAAAACC AAAAGACGGT
CACCAGGAAC GGCTTATGTC GCGACAAGCC ATGCGCCAGC TACAGGAGAT TGCTCAAAAT
ATGGGTGTAC TGGGTACGGG GCTACAGGGG GAAGCGCGCC GGGGACACGT CGGCGATACC
TGGGATGAAG ACTTCTTTCC TGAAATTTTG GAGCTCAAAT GGCCACCGGA AGGACGCCTC
GTCGAACGTG ATTGTACTTG CGACATGTGG GACCGGATCC GATGTAAGCA ACCACCGTCC
TTTGACTTGT TGAATTCTAT GCGACACTGT TGCTCACACC CCTTCTGAAA TCTGTGCCAT
GCTTATGCTC GCCCCCTCTT TTCGACGATG GATGTTTTTC ATTAACATCA GACCGCTACT
AGCTAAGAAA CTGTAAAATA TTATGTAGGC AGTTCATAAC ACACCGCT
 
Protein sequence
MYKSVCIIYC HVLLLLCLLH DDGWRAYGQE TEDILDTTTT RPVVVHLPDL GRVQGKRQSG 
IDFFGGLPYA APPVGHLRWA PPEPPAPWAP AKLDATHFGP DCWQLVDPLL NPGAEVARMS
EDCLYLNVFT PAGHASRHER LPVLVWLHGG AFQQGGARRS EYDGRRLAER GTIVVTINYR
LGALGFLVSS VDGLFGNFGL MDQRAALHWV QENIAKFGGD PDSVTLFGES AGAVMTGLHL
MMEGAGSLFH RAIIQSNPLG WQVRAIVVAD FIGEAMKRSV DCRDVACLRA ERVEEIMRAQ
SSLMGVPRSV GDFFTWGPTL TEELKLTVGG RTPFGSTSPL SREHVMFRDL DSWKWQNNRD
TSWAAVNVTQ PLKNLNLIPD DIPVIIGANK HEGEMFVHGA FPITMSKTVY WMFVGALFRD
SASRVLKHYR AYVDQIEREA EELARCQIEE EENRQYYLEH KEQLDHEYQL LLEMNSTKEG
VEAISDIETL VQTWSRGGAF FHRDQHDDTT NHTPWHRRVW PFARNNTEEA ILERARLREE
RRKLRIKERA LKAAARVVVD YRPVMSRIID DYLFRCPAWH YAHSLSRNRI YRGKRNNVYV
YQFSHSTHIP GYEECWGKSC HTSEIPYVFQ AMDIIRSNYS TLGPHAQREA PSTPEYPYTD
MLVAYREAMD AAYRQYDDEE DADVETPSNA TNHGSTSSNL FQHSMRFQRL VNHFFGDYFK
EDADEEIASD MADRWVSFAK TGDPNYEGSK AYWRPWRYIL DERLGRDKER PWEPQDFDKI
FDPEIEDDWD ENDTTLIERY VWSDDPGERT YRRRALHALA MEVVDEDVFQ TMLRRTPRGH
EDDNPFNSFL FGSASKPKDG HQERLMSRQA MRQLQEIAQN MGVLGTGLQG EARRGHVGDT
WDEDFFPEIL ELKWPPEGRL VERDCTCDMW DRIRYRY