Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_05350 |
Symbol | |
ID | 9296563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 1206697 |
End bp | 1208457 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | phenylalanine 4-monooxygenase (phenylalanine-4-hydroxylase) |
Protein accession | YP_003715833 |
Protein GI | 298207654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA CCATACAGAC AAACCCACTT ATAGATAGAT TGCCACCACA CTTAAAGCAA TTTATAAAAC CTCAAAACTA CGACGATTAT ACAGCTATTA ATCAAGCAGT ATGGCGTTAT GTAATGCGTA AAAATGTTGA ATATTTAGGT AAGGTTGCAC ACGAAAGTTA TTTGTCTGGA TTAAAGAAAA CAGGTATTTC TATAAATGAA ATCCCTAGTA TGTATGGTAT GAACCGTATC CTTAAAGATA TTGGTTGGGC AGCAGTTGCT GTTGATGGTT TTATACCACC AAACGCTTTT ATGGAGTTTC AAGCTTATAA GGTGTTAGTG ATTGCTAGTG ATATTAGACA ATTAGAAAAT ATTGAGTATA CACCAGCACC AGACATCATT CACGAAGGTG CAGGACACGC ACCTATTATC GCAAGTCCAG ATTATGCAGA GTACTTGCGT CGATTTGGTG AAATTGGAAG TAAAGCAATT TCTAGTGCTC ACGATATTGA GATGTATGAA GCAGTAAGAG CGGTTTCTAT ACTAAAAGAA GCAGAAGGTA CACCACAAGA GCAAATTGAT GCTGCTGAAG CACTTGTTGA GGAGCTTCAG AATAAAAAGG TTGAGCCAAG TGAAATTTCA TTAATTAGAA ATTTACATTG GTGGACTGTA GAGTATGGTC TTGTTGGAAC AGTCGAAGAC CCAAAGATAT ATGGCGCAGG TTTGCTGTCC TCTATTGGAG AAAGCAAGAA CTGTATGACA GATGCCGTGA AAAAAATACC ATATTCTATT GATGCAGCAT ACCAAGATTT TGACATCACA AAACAACAAC CGCAACTATT TGTAACTCCA GATTTTGCTT ATTTACAAGA GGTTTTAGAG GAGTTTGCCA ATACAATGGC AGTAAGAAAA GGTGGTTGGC GCGGCCTTAA AAAACTTATT GAAAGCAAAC AATTAGGTAC TATAGAATTG AGTACTGGCT TGCAAGTCTC AGGTGTGTTT AAGCGAATGA TACAGAATGA GGATAATGAA GTTGTTTACT TTGAAACCGA AGGTGAAACC GCATTATCTT ACCGTGAAAA AGAATTGATA GGTCATGGTA TAGAGCGCCA TATCAATGGA TTTCGATCGC CTTTAGGGAA ATTAAAAGGT ATTAATCTCG CCATTGAAAA TATGGGTCCA CGTGATTTAC AAGCTTATAA TTTTTATGAC GGTAAGCACA TTCAATTTGA GTTTGAAAGT GGCATTACTG TAGAAGGTAT GAACGTTACA GGTATAAGAA ACCTAAAAAG TGAATTGATG TTGATTCAAT TTACAGATTG TACCGTGAAA TATAAAGATG AGGTATTGTT TAGTCCAGAG GATGGCGATT TTGATTTTGC CGTTGGAAAA GACATTGTTT CAGCCTTTGC AGGTATGGCA GATTACAGAT CTTTTAATGT AAACACACAT AATCCTTCAA CAACGACAAC GATAAAAACA GAACGTTCTG CAAAACAAAA AGAACTTATC GAGTTGTATG AGGCTATTAG GAATTACCGC AACGGTGAAA CCACTAAGTT TTCACCAGAT GCTGTTTTTG ATATCTTGAA GAAACACCAT AATGAAGATT GGTTACTACC TTTAGAGATT TATGAATTAG AAGTTGAGCG AAACACAACT CTATCAAAAG AGGTGTTCAG ATATTTAAAT GAATTAAAAG AGCGCAGACC AGAAGTTAGT CATTTAATTG AAGGTGGTTT AGAGCTTTTA GAAACGCCAG AGCATGTTTA G
|
Protein sequence | MNETIQTNPL IDRLPPHLKQ FIKPQNYDDY TAINQAVWRY VMRKNVEYLG KVAHESYLSG LKKTGISINE IPSMYGMNRI LKDIGWAAVA VDGFIPPNAF MEFQAYKVLV IASDIRQLEN IEYTPAPDII HEGAGHAPII ASPDYAEYLR RFGEIGSKAI SSAHDIEMYE AVRAVSILKE AEGTPQEQID AAEALVEELQ NKKVEPSEIS LIRNLHWWTV EYGLVGTVED PKIYGAGLLS SIGESKNCMT DAVKKIPYSI DAAYQDFDIT KQQPQLFVTP DFAYLQEVLE EFANTMAVRK GGWRGLKKLI ESKQLGTIEL STGLQVSGVF KRMIQNEDNE VVYFETEGET ALSYREKELI GHGIERHING FRSPLGKLKG INLAIENMGP RDLQAYNFYD GKHIQFEFES GITVEGMNVT GIRNLKSELM LIQFTDCTVK YKDEVLFSPE DGDFDFAVGK DIVSAFAGMA DYRSFNVNTH NPSTTTTIKT ERSAKQKELI ELYEAIRNYR NGETTKFSPD AVFDILKKHH NEDWLLPLEI YELEVERNTT LSKEVFRYLN ELKERRPEVS HLIEGGLELL ETPEHV
|
| |