Gene CA2559_05350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_05350 
Symbol 
ID9296563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1206697 
End bp1208457 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content37% 
IMG OID 
Productphenylalanine 4-monooxygenase (phenylalanine-4-hydroxylase) 
Protein accessionYP_003715833 
Protein GI298207654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA CCATACAGAC AAACCCACTT ATAGATAGAT TGCCACCACA CTTAAAGCAA 
TTTATAAAAC CTCAAAACTA CGACGATTAT ACAGCTATTA ATCAAGCAGT ATGGCGTTAT
GTAATGCGTA AAAATGTTGA ATATTTAGGT AAGGTTGCAC ACGAAAGTTA TTTGTCTGGA
TTAAAGAAAA CAGGTATTTC TATAAATGAA ATCCCTAGTA TGTATGGTAT GAACCGTATC
CTTAAAGATA TTGGTTGGGC AGCAGTTGCT GTTGATGGTT TTATACCACC AAACGCTTTT
ATGGAGTTTC AAGCTTATAA GGTGTTAGTG ATTGCTAGTG ATATTAGACA ATTAGAAAAT
ATTGAGTATA CACCAGCACC AGACATCATT CACGAAGGTG CAGGACACGC ACCTATTATC
GCAAGTCCAG ATTATGCAGA GTACTTGCGT CGATTTGGTG AAATTGGAAG TAAAGCAATT
TCTAGTGCTC ACGATATTGA GATGTATGAA GCAGTAAGAG CGGTTTCTAT ACTAAAAGAA
GCAGAAGGTA CACCACAAGA GCAAATTGAT GCTGCTGAAG CACTTGTTGA GGAGCTTCAG
AATAAAAAGG TTGAGCCAAG TGAAATTTCA TTAATTAGAA ATTTACATTG GTGGACTGTA
GAGTATGGTC TTGTTGGAAC AGTCGAAGAC CCAAAGATAT ATGGCGCAGG TTTGCTGTCC
TCTATTGGAG AAAGCAAGAA CTGTATGACA GATGCCGTGA AAAAAATACC ATATTCTATT
GATGCAGCAT ACCAAGATTT TGACATCACA AAACAACAAC CGCAACTATT TGTAACTCCA
GATTTTGCTT ATTTACAAGA GGTTTTAGAG GAGTTTGCCA ATACAATGGC AGTAAGAAAA
GGTGGTTGGC GCGGCCTTAA AAAACTTATT GAAAGCAAAC AATTAGGTAC TATAGAATTG
AGTACTGGCT TGCAAGTCTC AGGTGTGTTT AAGCGAATGA TACAGAATGA GGATAATGAA
GTTGTTTACT TTGAAACCGA AGGTGAAACC GCATTATCTT ACCGTGAAAA AGAATTGATA
GGTCATGGTA TAGAGCGCCA TATCAATGGA TTTCGATCGC CTTTAGGGAA ATTAAAAGGT
ATTAATCTCG CCATTGAAAA TATGGGTCCA CGTGATTTAC AAGCTTATAA TTTTTATGAC
GGTAAGCACA TTCAATTTGA GTTTGAAAGT GGCATTACTG TAGAAGGTAT GAACGTTACA
GGTATAAGAA ACCTAAAAAG TGAATTGATG TTGATTCAAT TTACAGATTG TACCGTGAAA
TATAAAGATG AGGTATTGTT TAGTCCAGAG GATGGCGATT TTGATTTTGC CGTTGGAAAA
GACATTGTTT CAGCCTTTGC AGGTATGGCA GATTACAGAT CTTTTAATGT AAACACACAT
AATCCTTCAA CAACGACAAC GATAAAAACA GAACGTTCTG CAAAACAAAA AGAACTTATC
GAGTTGTATG AGGCTATTAG GAATTACCGC AACGGTGAAA CCACTAAGTT TTCACCAGAT
GCTGTTTTTG ATATCTTGAA GAAACACCAT AATGAAGATT GGTTACTACC TTTAGAGATT
TATGAATTAG AAGTTGAGCG AAACACAACT CTATCAAAAG AGGTGTTCAG ATATTTAAAT
GAATTAAAAG AGCGCAGACC AGAAGTTAGT CATTTAATTG AAGGTGGTTT AGAGCTTTTA
GAAACGCCAG AGCATGTTTA G
 
Protein sequence
MNETIQTNPL IDRLPPHLKQ FIKPQNYDDY TAINQAVWRY VMRKNVEYLG KVAHESYLSG 
LKKTGISINE IPSMYGMNRI LKDIGWAAVA VDGFIPPNAF MEFQAYKVLV IASDIRQLEN
IEYTPAPDII HEGAGHAPII ASPDYAEYLR RFGEIGSKAI SSAHDIEMYE AVRAVSILKE
AEGTPQEQID AAEALVEELQ NKKVEPSEIS LIRNLHWWTV EYGLVGTVED PKIYGAGLLS
SIGESKNCMT DAVKKIPYSI DAAYQDFDIT KQQPQLFVTP DFAYLQEVLE EFANTMAVRK
GGWRGLKKLI ESKQLGTIEL STGLQVSGVF KRMIQNEDNE VVYFETEGET ALSYREKELI
GHGIERHING FRSPLGKLKG INLAIENMGP RDLQAYNFYD GKHIQFEFES GITVEGMNVT
GIRNLKSELM LIQFTDCTVK YKDEVLFSPE DGDFDFAVGK DIVSAFAGMA DYRSFNVNTH
NPSTTTTIKT ERSAKQKELI ELYEAIRNYR NGETTKFSPD AVFDILKKHH NEDWLLPLEI
YELEVERNTT LSKEVFRYLN ELKERRPEVS HLIEGGLELL ETPEHV