Gene CA2559_08166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_08166 
Symbol 
ID9297117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1788520 
End bp1789923 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content34% 
IMG OID 
ProductUDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase 
Protein accessionYP_003716380 
Protein GI298208201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CCATTGCTCA AAAGCAACAG ACCATAGCTA AAGAAGTTAC TCTTAAAGGA 
GTAGGTCTTC ACACTGGTAA AGAAGTAACA TTAACTTTTA AGCCGGCACC CGAAAATTTT
GGATACGCTT TTAAACGTGT TGATCTTGAA GGAGAACCTG TTATAGAGGC AGATGCCAAT
TACGTGGTAA ACACTCAACG AGGAACTAAC CTTGAGAAAA ACGGCGTTAG CATACAGACA
AGTGAACACG TACTTGCTGC TTGTGTAGGC TTAGAGATAG ATAATGTTTT AATTGAATTA
AATGCATCTG AGCCTCCAAT TATGGATGGG TCTTCAAAAT TCTTTGTTGA AGCTTTAGAA
AAAGCAGGAA TACAAGAACA AGAAAAGAAT AGAGAAGTTT ATGTTGTTAA AGAAAACATC
TCTTATATCG ATGAAGAAAC TGGTAGCGAG ATACTTTTAA TGCCTTCAGA CGATTACCAA
ATTACCACAA TGGTAGATTT TGGTACTAAG ATTTTAGGAA CCCAAAATGC TTCAATAAAA
AATCTTTCAG AATTTAAAGA TGAAATATCA GACGCACGTA CATTTAGTTT CCTTCATGAA
TTAGAAATGC TTTTAGAACA CGGCTTAATA AAAGGTGGTG ATCTAAATAA TGCAATTGTT
TATGTAGATA AAGAGATAAG CTCAGATACT GTCGAAAAAT TAAAGAAAGC ATTTAATAAA
GAAACAATCT CTGTTAAACC TAATGGTATA TTAGATAACC TAACGTTACA TTATCCTAAT
GAAGCAGCAC GTCATAAACT ATTAGATGTA ATAGGAGATT TAGCGTTGGT AGGTACAAGA
ATACAAGGTA AAATTATTGC CAATAAACCA GGACACTTTG TAAATACTCA ATTTGCTAAA
AAACTATCTA AAATTATCAA GATAGAAAAA CGTAATGCAG TTCCTCAGGT AGATTTAAAT
CAAAAACCAT TGATGGATGT TGTGCAAATC ATGAAAATGT TACCGCACAG ACAACCATTT
TTATTAATAG ATAAGATTTT TGAGTTATCT GATACACATG TATTAGGATC AAAAAATGTA
ACCATGAATG AAGACTTTTT TAGAGGTCAC TTTCCTGGTT CACCTGTAAT GCCAGGTGTC
CTAATTGTTG AGGCAATGGC ACAAACCGGA GGCATATTAA TATTGAGTAC CGTTCCAGAT
CCAGAAAATT ACTTAACCTA CTTCATGAAG ATAGATAATG TTAAGTTTAA ACAAATGGTC
GTACCTGGAG ATACATTAGT CTTTAAGTGT GATTTAATAT CACCTATACG ACGAGGCATT
TGTCATATGC AAGGTTACGC CTATGCAAAC GGCAAATTAG CTTGTGAAGC CGAACTTATG
GCACAAATTT CAAAAGTAAA GTAA
 
Protein sequence
MTETIAQKQQ TIAKEVTLKG VGLHTGKEVT LTFKPAPENF GYAFKRVDLE GEPVIEADAN 
YVVNTQRGTN LEKNGVSIQT SEHVLAACVG LEIDNVLIEL NASEPPIMDG SSKFFVEALE
KAGIQEQEKN REVYVVKENI SYIDEETGSE ILLMPSDDYQ ITTMVDFGTK ILGTQNASIK
NLSEFKDEIS DARTFSFLHE LEMLLEHGLI KGGDLNNAIV YVDKEISSDT VEKLKKAFNK
ETISVKPNGI LDNLTLHYPN EAARHKLLDV IGDLALVGTR IQGKIIANKP GHFVNTQFAK
KLSKIIKIEK RNAVPQVDLN QKPLMDVVQI MKMLPHRQPF LLIDKIFELS DTHVLGSKNV
TMNEDFFRGH FPGSPVMPGV LIVEAMAQTG GILILSTVPD PENYLTYFMK IDNVKFKQMV
VPGDTLVFKC DLISPIRRGI CHMQGYAYAN GKLACEAELM AQISKVK