Gene Acid345_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1935 
Symbol 
ID4071411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2328032 
End bp2329045 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content61% 
IMG OID637983947 
ProductUDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
Protein accessionYP_591010 
Protein GI94968962 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1044] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
TIGRFAM ID[TIGR01853] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.920136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT CCGAGATCGC CCGCCGCCTC GGCTGCACCC TCGACAACTG TCCCGACCCT 
GACGCGGTTG AGATCACCGC CGTCACCGGC ATCGAAGCAG CCGGGCCCAC AGATATCACC
TTCGTCTCGA ACCCGCGCTA CGCTGCGGCC GCGAAGACAA CGCACGCCGG CGCGATCATC
GTCTCCGACG ACTTCACCGC CGGTCGCGCG CCGCTCGTCC GCAGCAAGAA TCCGTACCTC
ACGTTCGCGA AGGCGATCGA GCTCTTCTAC CAAGCGCCAA AGTACGCTCC GGGCATTCAC
CCCACCGCGG TCATCTCTCC CACGGCGAAG GTGGGCGCGA ACGCTTCGAT TGGCCCTTAC
GTGGTGATTG AGGACAACGT TGCCATCGGC GCGAATTGCG TTCTTCGCGC GCACGTCGTC
ATCTACGAAG GCGTGACTAT TGGCGACAAT TTCTTCGCGC ACGCGCACGC GGTTGTCCGC
GAGCACTGCC GCATTGGCAA CAACGTCATC CTGCAGAACG GCGTGGTAAT TGGCGCCGAC
GGCTACGGCT TCGCCCGCGA CACCGACGGC TGGTACAAGA TCGCCCAATC TGGCACTACC
ATCCTCGACG ACAACGTTGA AGTACAAGCC AACTCCACCG TCGACCGGGC CTCAATCGGC
GAGACTCACA TCTATGCCGA CGCCAAGATC GACAACCTCG TAATGATCGG CCACGGCAGC
TCCGTCGGCG AACATTCCCT GCTCTGCTCA CAGGTTGGAC TCGCCGGTTC CAGCCACGTC
GGCAAAAACG TAATTCTTGC GGGTCAAGTC GGGGTCGCCG GACATCTACA CATTGGTGAC
GGGGTAATCG CGGCCGGCCA AACCGGTGTG CAGAACGACA TCGAGCCCGG CAAACGCATT
GGCGGCTCGC CGTCATACGA CCACAAGCAG TGGATCCGTT CCTGGCAAAT CCAGACGAGA
TTGCCGGAAA TTGTGAAGGA ACTGCGAAAT CTTGCATCCA AGAAAAGTGA GTAG
 
Protein sequence
MKLSEIARRL GCTLDNCPDP DAVEITAVTG IEAAGPTDIT FVSNPRYAAA AKTTHAGAII 
VSDDFTAGRA PLVRSKNPYL TFAKAIELFY QAPKYAPGIH PTAVISPTAK VGANASIGPY
VVIEDNVAIG ANCVLRAHVV IYEGVTIGDN FFAHAHAVVR EHCRIGNNVI LQNGVVIGAD
GYGFARDTDG WYKIAQSGTT ILDDNVEVQA NSTVDRASIG ETHIYADAKI DNLVMIGHGS
SVGEHSLLCS QVGLAGSSHV GKNVILAGQV GVAGHLHIGD GVIAAGQTGV QNDIEPGKRI
GGSPSYDHKQ WIRSWQIQTR LPEIVKELRN LASKKSE