Gene Clim_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1539 
Symbol 
ID6354186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1660084 
End bp1661136 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content51% 
IMG OID642669145 
ProductUDP-3-O-(3-hydroxymyristoyl) glucosamine N-acyltransferase 
Protein accessionYP_001943568 
Protein GI189347039 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1044] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
TIGRFAM ID[TIGR01853] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTA GCGAAATCCG GGACTATCTC GGCCAATTTT TTCCAGGCAT TGAACTGCAT 
GGCGGCAGCG ACAGGATAAT TACCGGTCCG GCCAAGATTG AAGAAGCCCG ATCAGGGGAG
GTCAGCTTTA TTTCAAATGA GAAATATGTC AGGTTTCTCG ATACCACCTG CGCTTCGCTT
GTCATTGTCG GGAACTCGGT TCCGGTCGAT GCCTATATCG AAAAAACCTC ATTCATCAGG
GTTCGTGATC CATATACGGC ATTTATGCTT CTTTTGCAGC GATTTTCCAA ACCGAGGAGC
GTTGCTCTTC CTGGCATTGC CGATACAGCC GTGGTCGGCA TCGATGTGCG GATCGGATCA
AATGTGGCGA TCGGTGATTA TGCGGTTATC GGAGACCGTT GTTCGATAGG CGATAATGCT
GTTATCGGAC CGCATGCGGT GCTCCTTCAT GACGTTTCTG TCGGTAATGA TACCGTCATC
AATCCCCATG TTATCTGTTA TGACGGTTCG GTCATAGGAT CGCGGGTGAT CATTCATTCC
GGAAGCGTTA TCGGAGCAGA CGGCTTCGGT TTTGCTCCGC AGGCTGACGG CTCATACCTT
AAAATACCCC AGATGGGTAT TGTCGAGATC GGTGACGATA CCGAAATCGG TGCCAATGCA
ACCATTGATC GTGCTACCAT GGGCAGTACG GTCATTGGCA GAGGTGTGAA AATTGACAAT
CATGTACAGA TTGCCCATAA CTGCAGGATT GGCGATCATA CCGTCATCGC CGCCCAGGCG
GGCATATCCG GCAGTGTCAC TCTCGGCTGT TTCTGCATGA TCGGCGGGCA GGCAGGATTA
GCCGGTCATC TGGAGCTTGC CGATCGTACT CATGTCGCCG CTCAGGCGGG GATCTCAAAA
TCGTTTTTGC AGCAAGGTGT CGCTTTGCGC GGTTATCCGG CACAACCCAT GCGTGAACAG
CTTAAGCAGG AAGCTCTTGT GAGAAGTCTT GGCTCTATGA AGGCTCGCCT TGATACGCTG
GAAGTCGAGG TTCGACTGAT GCAGCAGAGT TGA
 
Protein sequence
MIISEIRDYL GQFFPGIELH GGSDRIITGP AKIEEARSGE VSFISNEKYV RFLDTTCASL 
VIVGNSVPVD AYIEKTSFIR VRDPYTAFML LLQRFSKPRS VALPGIADTA VVGIDVRIGS
NVAIGDYAVI GDRCSIGDNA VIGPHAVLLH DVSVGNDTVI NPHVICYDGS VIGSRVIIHS
GSVIGADGFG FAPQADGSYL KIPQMGIVEI GDDTEIGANA TIDRATMGST VIGRGVKIDN
HVQIAHNCRI GDHTVIAAQA GISGSVTLGC FCMIGGQAGL AGHLELADRT HVAAQAGISK
SFLQQGVALR GYPAQPMREQ LKQEALVRSL GSMKARLDTL EVEVRLMQQS