Gene PHATRDRAFT_37719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37719 
Symbol 
ID7202274 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp776737 
End bp777973 
Gene Length1237 bp 
Protein Length389 aa 
Translation table 
GC content51% 
IMG OID 
Productformamidase-like protein 
Protein accessionXP_002181627 
Protein GI219122595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.990726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTG GCATCGTACG CCACACCATT GTTGGTGCTT TGCTTTTCGT CACCAGCACC 
CATGTTCCAG GACTCGTCTC GGCACAAACA ACGACTCTTC CCTTGAGTGC GGCAAATGTT
CATTGGGGAT ATTTTAGCAA GACCCTCGAC CCAGTTCTCA CTGTTGCTTC CGGTACCGAA
GTCGTTGTTG AAATGGCAAC TCACCACGCT TGCGATGATT GGGACAAGAT GATCATGGGA
GACGCTGCCA TGGAAGAGAT CTACACGTGG ACCGGAGACA TTGTCGGCGA GGAGTTTCGT
GGCGCCAGTG GTGGTGGCGA CGGTGTGCAC ATTCTCACGG GCCCTATCTT TGTTGAAGAT
GCCGAGCCGG GAGATATCCT CAAAGTGGAG ATATTGGATC TTCAGCCCCG ACCTAACCAG
GATGGCAAGA CCTTTGGGTC CAACGCTGCT GCCTGGTGGG GCTTTCAAGC CCGGGTCAAC
AAGGCGGACA ACACTCCTTT CTACGCCGGG TCGTTCTCCG ACACGCCAAC CCAGAACGAC
GAAATCGTCA CAATCTACGA GATTGTGGAA GAAAACGGTC AGAGTTTCGC GGTGCCTTCG
TACCAGTTTG AATGGCCCAT CATGACGGAT CCCAATGGTG TTGAACGCGA TTACATTGCA
TACCCAGGTA CATGTGTTCC ACATGACACT CATAGCATAA CCATACCGTC TTCGGATGTT
ACTGATATGG GATGGACCAA AGCGGGAGCC ATCACTTACT ACGACAATGT GTTCAAGGCT
AAGATTCCTA TCAACTACCA TGTGGGTTGT ATGGGGCTTG CTCCCGCTTC CCATGACTTT
GTCGATTCCA TTCCGCCAAT GCCAACCGGT GGCAACCTGG ACAATAAGCG TATTGGTGTT
GGCACCACCA TGTACTACCC GGTGGAAGTT GCGGGAGGCT TGATCTCAAT GGGTGATGCA
CACGCTGCTC AGGGCGACTC GGAACTTGAT GGTACAGGAA TCGAAACCTC AATTACGGGC
AAGTTTAAGC TAACGGTCAT CAAACAGGAA GATTTTACAG CTTCTCAGGC AGTGTTGGAC
TTTCCCTTGG GCGAGACGGC AACGGACTGG ATTATTCATG GTTTTACGGC AACCGACTAC
CTCGAGACAT ATGCAGATAA CCCAGCTGCC ATATACAACG CTTCAAGTAT TGATGCAGCA
GCAAAAAATA CATTTACACA AACCCGCAAG TTTCTAA
 
Protein sequence
MSIGIVRHTI VGALLFVTST HVPGLVSAQT TTLPLSAANV HWGYFSKTLD PVLTVASGTE 
VVVEMATHHA CDDWDKMIMG DAAMEEIYTW TGDIVGEEFR GASGGGDGVH ILTGPIFVED
AEPGDILKVE ILDLQPRPNQ DGKTFGSNAA AWWGFQARVN KADNTPFYAG SFSDTPTQND
EIVTIYEIVE ENGQSFAVPS YQFEWPIMTD PNGVERDYIA YPGTCVPHDT HSITIPSSDV
TDMGWTKAGA ITYYDNVFKA KIPINYHVGC MGLAPASHDF VDSIPPMPTG GNLDNKRIGV
GTTMYYPVEV AGGLISMGDA HAAQGDSELD GTGIETSITG KFKLTVIKQE DFTASQAVLD
FPLGETATDW IIHVLMQQQK IHLHKPASF